people | Amir Aghdam

555 your office number

123 your address street

Your City, State 12345

layout: about title: About permalink: / subtitle: Researcher in VLMs and Computer Vision @ Temple University

profile: align: right image: prof_pic.png image_circular: false # crops the image to make it circular more_info: > <p>📍 Philadelphia, PA, USA</p>

selected_papers: false # includes a list of papers marked as “selected={true}” social: true # includes social icons at the bottom of the page

announcements: enabled: true # includes a list of news items scrollable: true # adds a vertical scroll bar if there are more than 3 news items limit: 5 # leave blank to include all the news in the _news folder

latest_posts: enabled: false scrollable: true # adds a vertical scroll bar if there are more than 3 new posts items limit: 3 # leave blank to include all the blog posts —

Hey, thanks for stopping by! 👋

I’m a Master’s student in the CS Department at Temple University, currently wrapping it up at Summer 2025. My research focus include Vision-Language Models (VLMs), Multimodal Learning, and Computer Vision.

My current research focuses on zero-shot adaptation of VLMs, with a particular focus on fine-grained video understanding by leveraging the open-set recognition power of image-language models. I’m especially interested in how we can harness the capabilities of LLMs and VLMs responsibly, equipping them with effective workflows to solve high-impact problems.

Previously, I worked on active fine-tuning of foundational vision models like DINO, and I bring over two years of hands-on research experience in image segmentation, active learning, and VLMs.

I’m always open to new ideas, collaborations, or just a good conversation. Feel free to reach out! 📬