Hello everyone!
I am a 3rd year Computer Science student, and recently I started exploring the world of AI research, especially in:
- Vision Language Models (VLMs)
- Spatial AI
- Semantic Segmentation
- Multimodal Learning
- Scene Understanding
At first, research looked very complicated to me because of research papers, mathematical concepts, and large AI architectures. But after reading papers and experimenting with datasets and models, I realized research is mainly about curiosity and solving problems.
What I Am Currently Exploring
Recently, I have been learning about:
- How VLMs understand images and text together
- Spatial reasoning in AI systems
- Scene graph understanding
- Data curation and annotation pipelines
- Reducing hallucinations in multimodal models
My Goal
I want to work on advanced AI systems that can understand the real world more accurately, especially for applications like:
- Robotics
- Autonomous systems
- Smart surveillance
- Human-AI interaction
What I Learned So Far
One important thing I learned is:
Research is not about knowing everything.
It is about continuously learning and improving ideas.
Technologies I Am Using
- Python
- PyTorch
- Hugging Face
- OpenCV
- Transformers
Final Thoughts
This is just the beginning of my research journey, and I am excited to keep learning, building, and sharing my progress with the community.
If you are also starting in AI research, feel free to connect with me!













