A highly motivated and results-oriented Computer Vision Ph.D. student with a deep passion for advancing the field of artificial intelligence. My research focuses on building multimodal representations and understanding human activities from ego/exocentric perspectives, addressing key challenges for autonomous agents and AI in general. I have extensive experience with multimodal large language models for video captioning and question answering and a keen interest in view-invariant video representation learning. I’m particularly committed to exploring how to effectively bridge the gap between representations of different modalities while preserving their unique characteristics.
In addition to my research expertise, I possess a strong engineering foundation honed through academic and industry experiences. Proficient in Python, C++, and CUDA, I excel at rapidly prototyping and implementing innovative ideas. I’m eager to leverage my skills and knowledge to contribute to cutting-edge research and development in this dynamic field.
PhD in Computer Science, 2021 - ongoing
Sapienza University
MSc in Computer Science, 2018 - 2020
Sapienza University
BSc in Business Computer Science (a.k.a. Data Science), 2015 -2018
University of Tirana
5+ Years
5+ Years
4+ Years
3+ Years
5+ Years
Conducting research on the following areas of computer vision:
Main responsibilities include:
Main responsabilities include:
Main Responsabilities include:
Contact Me