2023 - 2024
I conducted
research in the field of computer vision and it was focused on integration of state-of-the-art computer vision algorithms with path planning to develop a system for autonomous vehicle, presented paper in DJ-ICACTA-23 and published it in IEEExplore journal.
Also at VCET-Solecthon we won ESVC3000+, SEVC2023 championships.
2022 - 2023
I was Perception (Computer Vision) lead at VCET-Solecthon, focused on building robust perception mechanism where objective was to drive a car on road (or any drivable space) without the need of explicit lane markings. Researched and worked on many computer vision tasks including in-house data labelling, improving object detection, dataset curation, lane detection, semantic segmentation, SLAM, etc.
Won SEVC2022 championship and also secured second rank at OpenCV-Core competition
Simultaneously, I took CS50AI course by Harvard University, worked on many projects on AI domain. Explored DL and CNN via course by Andrew Ng.
2021 - 2022
Took CS50X by Harvard University, worked on many projects throughout the course and built and deployed
Dronacharya webapp.
Along the way I joined an autonomous-solar-electric vehicle team at college named VCET-Solecthon as autonomus member, at that time objective was to drive a car in constrained environment.
Also worked in aeronautics & aerospace team Airnova as R&D member on drone building and gesture controlled interface for it.
2020 - 2021
Admission in BTech at the University of Mumbai, Vidyavardhini's college of engineering and technology with a major in computer science and a honours in artificial intelligence and machine learning.
Explored the computer science domain and took various courses on web dev. This is where I first got intoduced to deep learning by my brother.
Education
B.Tech in Computer Science Engineering from Mumbai University, 2020-2024
Honours in Artificial Intelligence and Machine Learning from Mumbai University, 2021-2024
Deep Learning Specialization, 2022-2024
Experience
Full-time Engineer at Zeus Learning, specializing in AI/ML applications, Researching and Working on Document Layout Analysis, Multimodal Neural Networks, State-of-the-Art Transformer Architectures, RAG-based Chatbots, NLP from scratch, 2024 - Present
Team Lead in Computer Vision and Autonomous Vehicles research in VCET Solecthon tech team, 2022-2024
Member of Research & Development team in Airnova
Responsibilities included research, modification, and application of data science and data analytics prototypes. Created and constructed methods and plans for machine learning, employing test findings for statistical analysis and model improvement. Trained and retrained ML systems and models, improved current ML frameworks and libraries, and created machine learning applications in accordance with customer needs.
Coordinated with development teams to understand application needs, utilized Python to create scalable code, performed application testing and bug fixing, and integrated storage methods for data. Designed and implemented high-performance, low-latency applications while working closely with front-end programmers.
Projects
- Autopilot: AI-based perception system for autonomous vehicles, integrating state-of-the-art object detection and depth estimation techniques.
- VisioBlend: Denoising diffusion model for sketch-based image generation, improving LDDPM with human-drawn sketches and color information.
- Dronacharya: College recommendation system based on exam scores, using JEE and MHT-CET data to recommend colleges and branches.
- Gesture-X: Computer vision-based interface for system interaction using hand gestures, supporting 20 gestures with high accuracy.
- Takshak: Farmer assistant project with features like crop recommendation, yield prediction, disease prediction, and weed detection using ML and DL models.
- Web Automation Suite: Automated various web tasks using Python, leveraging libraries such as Selenium and BeautifulSoup for scraping, and automating workflows. Integrated with APIs and databases for seamless data handling.
Skills
- Data Science
- Computer Vision
- Deep Learning
- Full-Stack Development
- Research
- Numpy
- Matplotlib
- Scikit Learn
- Caffe
- Caffe2
- Darknet
- Pytorch
- Tensorflow
- AWS
- Machine Learning
- Python
- Multimodal Neural Networks
- Object Detection
- Segmentation
- CUDA/CuDNN
- Data Pipelines
- Artificial Intelligence
- Document Layout Analysis
- Natural Language Processing (NLP)
- Chatbots
- Neural Networks
- Autonomous Systems
- Data Pipeline Development
- C
- C++
- JavaScript
- Matlab
- Simulink
- CNN
- RNN
- FPN
- Feature Extraction
- Architecture Design
- Mathematics and Statistics
- Exploratory Data Analysis (EDA)
- Data Collection
- Data Preprocessing
- SLAM
- Visual Odometry
- OpenVINO
- Optimization of ML/DNN Models
- Neural Processing Engines (SNPE, TIDL)
- Pandas
- Pickle
- ElementTree Parsing (XML)
- Jinja Template
- Git