Abaka AI ยท 2 months ago
Research Intern (Video)
Abakaai is focused on advancing video understanding technologies, and they are seeking a Research Intern to contribute to their projects. The role involves building datasets for video understanding, evaluating video-language models, and conducting experiments related to modeling efficiency and data optimization.
Data Collection and LabelingMachine LearningNatural Language Processing
Responsibilities
Build and refine datasets for video understanding and multimodal reasoning, including temporal QA, action recognition, event prediction, and spatial understanding
Evaluate video-language models (Video-LLMs) and audio-visual datasets, including those derived from large-scale sources such as HowTo100M
Conduct experiments analyzing long-context modeling efficiency, compression strategies, and data optimization techniques
Contribute to benchmark standardization efforts and assist in setting up public leaderboards for evaluation and comparison
Qualification
Required
Strong background in computer vision, video analytics, or multimodal learning
Proficient in building and managing video data processing pipelines
Understanding of transformer-based temporal models (e.g., TimeSformer, VideoGPT, etc.)
Preferred
Experience with video-QA, action recognition, or multimodal reasoning datasets
Relevant publications in top-tier conferences
Company
Abaka AI
Abaka AI is a leading AI company and we are committed to becoming the data partner in artificial intelligence industry.
H1B Sponsorship
Abaka AI has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (2)
Funding
Current Stage
Growth StageCompany data provided by crunchbase