Alert - Open to work for internships and projects.

Siddhesh Dosi
  • Home
  • Blogs [Handy Tools and Scripts]
    • Linux commands
    • HPC servers
    • Git
    • Conda and Python environment
    • Trending Blogs
  • Projects and Publications
  • Hackathons and Competitions
  • RESUME/CV
  • Education/Teaching
  • Achievements

Table of Contents

  • Timeline
  • Top 10 Repositories
Categories
All (5)
BBL deduplication (1)
Data Curation (1)
Data Science (1)
Decision Tree [Classifier and Regressor] (1)
Deep Learning (1)
Distributed training (1)
Ensemble Learning (1)
Gradient descent (1)
HPC server (1)
Huggingface (1)
LLM pretraining (1)
Linear Regreesion (1)
Machine Learning (1)
Mistral (1)
NLP (1)
Point Cloud (1)
Point Cloud Network (1)
Python (2)
Pytorch (1)
Scientific domain data (1)
XLSTM (1)
fine-tuning (1)
tokenization (1)
Important

Open to new opportunities in the field of NLP/CV and Machine Learning.

I am glad that you have visited my blog. I hope you find it interesting and informative!

Siddhesh Dosi

Email LinkedIn Github Huggingface

Welcome to my page! I am Siddhesh Dosi, a post graduate from IIT Gandhinagar in Computer Science and Engineering under the mentorship of Prof. Dr. Mayank Singh.

Current Research projects:

Working on the completion task of 3D partial point cloud in vision domain.
- Experimenting with various PCN and GAN based architectures for completion problem in 3D point cloud.

Explore More

Current work-profile:

Working as full time ML/AI lead engineer at Neuroreef Labs
- Developing and deploying user specific AI-Copilots for various medical residents in USA.

Explore More

Timeline

2024-12-16 | GEN-AI SDE - Full-time | Chapter Apps Inc

I have joined chapter apps as a full-time Gen AI SDE engineer.
It is a merger of Got-It-AI and ChapterVitamins. I will be working on the development of the AI-based chatbots for different clients including but not limited to (Aditya Birla Capitals)[https://www.adityabirlacapital.com], JCB India, etc. There are a lot of improvements to be done in the current architecture and I am excited to work on them.

2024-07-01 | Joined as full-time ML/AI Team Lead | Neuroreef Labs

I have joined Neuroreef Labs as a full-time ML/AI Team Lead of a team of 5.
I am excited to work with a team of talented individuals and contribute to the development of cutting-edge AI solutions.
We provide custom user specific Medical Template for final notes like (Neurology, Cardiology, etc).

2024-06-29 | Convocation! Graduated with M.Tech in Computer Science and Engineering | IIT Gandhinagar

I have successfully completed my M.Tech in Computer Science and Engineering from IIT Gandhinagar with a CGPA of 9.33/10.

My major acdamics courses are:
- Machine Learning
- Deep Learning
- Natural Language Processing
- Algorithms
- Artificial Intelligence
- Computational Neuroscience

2024-06-17 | M.Tech Thesis Defense | IIT Gandhinagar

I have successfully defended my M.Tech thesis on “Optimizing Smaller Variant of LLMs” under the guidance of Dr. Mayank Singh.

Trained a mistral architecture based mini llm of 1B parameters on a web scrapped and processed Hindi dataset, Ganga-1B

Curated and processed the scientific papers from arXiv to train a mini scientific LLM.

Reduced the bibliographic biasness, as in Galactica, by deduplication and filtering of the dataset.

2023-10-05 | Joined as ML/AI Engineer | Neuroreef Labs

Developed and designed the core architecture of two main products
> MedAura.ai : An evidence-based medical chat bot with minimal document search time with unqiue table creation approach.
> Carecortex.ai : A full-fledged AI - copilot for doctors to assist them in their daily routine.
| - Conversation Transcription
| - Evidence-based insights generation
| - Final notes generation
| - Medical coding (ICD-10, CPT, etc)

Used aws lambda to invoke apis triggered on s3 bucket events for real-time processing of the data, independent of the browser live session.

2023-09-29 | Got placed as Software Developer Analyst| ICG-TECH CITI

From back-2-back interviews in smartsense, citi and broadridge, I have been selected in citi as a Software Developer Analyst.

Got a great experience in the interview process and learned a lot from the interviewers.

Though the toughest interview was of smartsense, I found citi had very thoughful and interesting interview process.

2023-08-17 | Hackathon Winner | ThirdAI Corp

Participated in the India-level hackathon at ThirdAI Corp and won the first prize among 70+ teams.

Developed a in-depth contexual based document search engine for Google Drive, TEGD (ThirdAI Engine for Google Drive) using NeuralDB architecture.

01-05-2023 | Started ML/AI Internship | ThirdAI Corp

Joined ThirdAI Corp as a Machine Learning/AI Intern.

Trained many ThirdAI’s BOLT based models in distributed environment from ParamAnanta supercomputer on
- Stackoverflow dataset
- Amazom 3M review dataset
- Arxiv abstracts dataset
- Food UDT dataset

Delivered a high performance model on Amazon-3M with Metrics (P@1=0.695, R@5=0.797, R@10=0.814) in its category

2022-07-18 | Started M.Tech in Computer Science and Engineering | IIT Gandhinagar

Started my journey as a M.Tech student in Computer Science and Engineering at IIT Gandhinagar. I am excited to learn and explore the field of AI and Machine Learning.

Got offers (in the first round only) from
- IIT Gandhinagar (accepted)
- IIT Jodhpur
- IIT Indore
- IIT Gawhati (spot round)
- IIIT Delhi
- IIIT Bangalore

2022-06-20 | Interview Call from IB (ACIO-2) | MHA

Based on my GATE score, I have been shortlisted for the interview round in IB as Assistant Central Intelligence Officer Grade-II/Executive. I went through the final round of the interview but unfortunately couldn’t make it to the final list.

2022-03-15 | Scored 640 in GATE-22 (AIR 1012) CSE | GATE-22

I have scored 640 in GATE-22 (AIR 1012) in Computer Science and Engineering.
I am happy with my performance and looking forward to applying for M.Tech programs.

This was my second attempt and am happy with my performance and looking forward to applying for M.Tech programs.

2021-12-01 | Completed B.Tech in Information Technology | SKIT & MG, Jaipur

Graduated with a B.Tech in Information Technology with 80%.
This marked the end of an exciting chapter filled with learning, growth, and exploration of the tech world.

2021-03-22 | Got placed in InTimeTec as Software Developer | InTimeTec

Secured a position as a Software Developer at InTimeTec.
It was an exciting moment to transition from a student to a professional in the software industry.

I learnt a lot of concepts related to GO-lang, go-routines and Transact-SQL/SQL Server.

2021-03-19 | GATE Score 464 (AIR 6028) CSE | GATE-21

Achieved a GATE score of 464 in Computer Science and Engineering, securing an All India Rank of 6028.
This was my first attempt at GATE, which set the foundation for my future academic journey.

2020-02-18 | Semifinalist in e-Yantra Robotics Competition | IIT Bombay

Became a semifinalist in the prestigious e-Yantra Robotics Competition organized by IIT Bombay. Worked on solving real-world problems using robotics and embedded systems, gaining valuable hands-on experience.

Designed and implemented a working prototype of supply - line follower robot using Arduino and sensors.
Used Computer vision to process the live imaging of the bot for the navigation and real-time decision making.

2017-08-03 | Started B.Tech in Information Technology | SKIT & MG, Jaipur

Embarked on a journey into the world of technology by enrolling in B.Tech in Information Technology at SKIT & MG, Jaipur.
It was the beginning of my passion for computer science and engineering.

Top 10 Repositories

Point Cloud Completion

Deep Learning
XLSTM
Point Cloud
Point Cloud Network
Using the XLSTM encoder and PCN decoder to complete the partial point cloud object.
Oct 22, 2024
README.md

Hindi-vani-LLM

Data Science
NLP
Data Curation
HPC server
Distributed training
tokenization
Data processing and pretraining of LLM on Hindi language data from scratch.
May 20, 2024
README.md

Mini-sci-LLM

Pytorch
Mistral
LLM pretraining
Scientific domain data
fine-tuning
Huggingface
BBL deduplication
Data processing and pretraining of LLM on scientific domain data from scratch.
Mar 21, 2024
README.md

ThirdAI Google Drive Search Engine

Python
Machine Learning
A neuralDB based indepth contextual search engine for Google Drive.
Aug 5, 2023
README.md

ML algorithms from scratch

Python
Decision Tree [Classifier and Regressor]
Linear Regreesion
Ensemble Learning
Gradient descent
Implementing Machine learning algorithms from scratch in Python.
Jan 4, 2023
README.md
No matching items
Back to top