Hello everyone!

I am a 3rd year Computer Science student, and recently I started exploring the world of AI research, especially in:

Vision Language Models (VLMs)
Spatial AI
Semantic Segmentation
Multimodal Learning
Scene Understanding

At first, research looked very complicated to me because of research papers, mathematical concepts, and large AI architectures. But after reading papers and experimenting with datasets and models, I realized research is mainly about curiosity and solving problems.

What I Am Currently Exploring

Recently, I have been learning about:

How VLMs understand images and text together
Spatial reasoning in AI systems
Scene graph understanding
Data curation and annotation pipelines
Reducing hallucinations in multimodal models

My Goal

I want to work on advanced AI systems that can understand the real world more accurately, especially for applications like:

Robotics
Autonomous systems
Smart surveillance
Human-AI interaction

What I Learned So Far

One important thing I learned is:

Research is not about knowing everything.
It is about continuously learning and improving ideas.

Technologies I Am Using

Python
PyTorch
Hugging Face
OpenCV
Transformers

Final Thoughts

This is just the beginning of my research journey, and I am excited to keep learning, building, and sharing my progress with the community.

If you are also starting in AI research, feel free to connect with me!

ai #machinelearning #research #computervision

# Starting My Journey into AI Research 🚀

What I Am Currently Exploring

My Goal

What I Learned So Far

Technologies I Am Using

Final Thoughts

ai #machinelearning #research #computervision

Tags

Author

Stats

Published

You Might Also Like

How Are Developers Actually Using AI At Work?

An LLM API call, in 4 GIFs

Why does AI forget what you said (and how to fix it)

AI Agents Are Great at 80% of Our Code. The Other 20% Is Why We Still Need Seniors.

The Quiet AI War Inside Your Browser

Toward a Standard Model for Agent Memory