/

Multimodal Data-Loader

Multimodal Data-Loader

Owned by Isaac Godfried

May 07, 2024

Motivation

A lot of recent research has papers have discussed fusing multiple modalities of data together. We need a general purpose data-loader that can take various forms of image data or textual data from various time frequencies. CrossVIVIT

Example Papers

CrossVIVIT

Realm

media_context_len:
media_context_time_len: Maximum time-span to attempt to retrieve the media_context_len
media_types List: The types of media present in the multi-media file

Abstract method called (get_media)

Related content

Supporting Variable Length Temporal Sequences for Classification and Forecasting

Supporting Variable Length Temporal Sequences for Classification and Forecasting

More like this

FlowDB 2.0

More like this

FlowDB Dataset

More like this

Live Coding Tutorials

Live Coding Tutorials

More like this

Adding models backlog

Adding models backlog

More like this

Replicating Results of Transformer Fusion Model using Favourita Dataset

Replicating Results of Transformer Fusion Model using Favourita Dataset

More like this