Uppsala Vision, Language and Learning – Part of The Beijer Laboratory for Artificial Intelligence Research, Uppsala University

Research Mission

Build fundamental AI/ML methods for computer vision and language modeling to address societal challenges.

Research Interests

Our research is highly interdisciplinary and collaborative, and interests include vision-language models, large language models, computer vision, text/image/video classification, text recognition (OCR, Handwritten), human action analysis, multispectral imaging and NLP tasks.

Research Projects

Multimodal deep learning
Vision-language(-action) models
Multispectral handwritten text and palimpsests modelling
Large Language Models and applications in computer vision

News

March 2025: Ekta Vats is appointed to the WASP–Diversity and Inclusion Group.
March 2025: We welcome Tianru Zhang as a Postdoc in the group!
Jan. 2025: WASP Affiliation for PhD student project on Multimodal Deep Learning.
Oct. 2024: UU-MISHA is ready! We built a cost-effective Multispectral imaging (MSI) system to reveal hidden text from manuscripts, partially funded by Kjell och Märta Beijers Stiftelsen. We thank team MISHA at Rochester Institute of Technology for the collaboration.
Sept. 2024: We welcome Robin Hollifeldt as our PhD student!
May 28, 2024: Ekta Vats got promoted to a Docent in Computerised Image Processing. Docent Lecture: Introduction to Large Language Models in Image Analysis: Theory and Applications. Read more!
New funding from the UU Graduate School in Cybersecurity for project: Large language models-powered social robots in cybersecurity applications (CYBERBOT). PI: Ginevra Castellano, Co-PIs: Ekta Vats, Katie Winkle and Boel Nelson.
Ekta Vats’ interview with Beijerstiftelsen: 3 frågor till nya Beijerforskaren Ekta Vats (3 questions for the new Beijer Researcher Ekta Vats)
Dec. 2023: Women in Data Science Sweden (WiDS) mentorship program wrap up! Ekta Vats served as a mentor
Oct. – Dec. 2023: Till Grutschus from Technical University of Munich joined us on an exchange semester. Project: Human fall detection on untrimmed videos using large foundational video-understanding model
Oct. 5, 2023: Beijerforskardagen
Oct. 4, 2023: Raphaela M. Heil defended her thesis titled Document Image Processing for Handwritten Text Recognition. Deep Learning-based Transliteration of Astrid Lindgren’s Stenographic Manuscripts.

Hiring!

[Closed] Postdoc position in Multimodal Deep Learning (Deadline: Dec. 9).
[Closed] PhD position: PhD student in Machine Learning and Computer Vision (Deadline: Aug. 12).
[Closed] PhD position: PhD student in social robotics with focus on large language models and cybersecurity (Deadline: May 23).
[Closed] Postdoc position: Postdoctoral position in Deep Learning with a focus on Vision-Language Models (Deadline: May 14)
[Closed] PhD position: PhD student in Machine Learning with a focus on Vision-Language Models (Deadline: March 28)