/
Multimodal Data-Loader
Multimodal Data-Loader
Motivation
A lot of recent research has papers have discussed fusing multiple modalities of data together. We need a general purpose data-loader that can take various forms of image data or textual data from various time frequencies. CrossVIVIT
Example Papers
CrossVIVIT
Realm
media_context_len:
media_context_time_len: Maximum time-span to attempt to retrieve the media_context_len
media_types List: The types of media present in the multi-media file
Abstract method called (get_media)
Related content
Supporting Variable Length Temporal Sequences for Classification and Forecasting
Supporting Variable Length Temporal Sequences for Classification and Forecasting
More like this
FlowDB 2.0
FlowDB 2.0
More like this
FlowDB Dataset
FlowDB Dataset
More like this
Live Coding Tutorials
Live Coding Tutorials
More like this
Adding models backlog
Adding models backlog
More like this
Replicating Results of Transformer Fusion Model using Favourita Dataset
Replicating Results of Transformer Fusion Model using Favourita Dataset
More like this