About
Driven by an insatiable curiosity for technology, I find myself captivated by its dynamic and transformative power. My passion is particularly ignited by Data Science, which I perceive as a potent tool for deciphering intricate data and unveiling profound insights that can shape our future.

Data Scientist & Web Developer.
Unleashing the power of data, one algorithm at a time.
- Age: 24
- Website: https://github.com/shubhamtamhane
- Email: shubhamtamhane2000@gmail.com
- City: Bloomington, IN, USA
- Visa: F1 OPT
- Degree: Master's in Data Science
- University: University of Rochester
- Available Start Date: Immediately
As a Master's student specializing in Data Science, I am passionately delving deeper into the intricacies of data analysis and interpretation. My academic journey in this field is a natural progression from my previous education in Information Technology, which laid a solid foundation for my understanding of the digital world. I am eager to apply my theoretical knowledge and technical skills to practical scenarios, thereby enhancing my learning experience. My goal is to continually evolve as a data science professional, leveraging my academic background and personal interest in technology. I am excited about the opportunities and challenges that lie ahead in my journey of personal and professional growth.
Skills
I am proficient in a wide array of Data Science disciplines, with a strong ability to manipulate, analyze, and interpret complex data sets. My expertise extends to various data science tools and methodologies, enabling me to transform raw data into actionable insights and strategic solutions
- Programming Languages: Python, R, C, C++, Java, Spark
- Database: SQL, MySQL, PostgreSQL, OracleSQL, NoSQL, MongoDB, Google Firebase
- Data Manipulation and Visualization: Tableau, PowerBI, MS Excel
- No Code Software: JMP, Dataiku, Seeq, Git
- Framework and Libraries: Sklearn, OpenCV, Tensorflow, Keras, Pandas, Numpy, ggplot2, pytorch
- Machine Learning Methods: Linear Regression, Logistic Regression, Decision Trees, Random Forest, Naive Bayes, K-Nearest Neighbors, Support Vector Machines, Artificial Neural Networks, Deep Learning, Gradient Boosting algorithms (GBM, XGBoost, LightGBM), Principal Component Analysis, ARIMA, SARIMA, State Space Models, Holt-Winters Method, Exponential Smoothing.
- Web Technologies: HTML5, CSS3, Django, Flask, Nodejs, , JavaScript, Express, Flutter
Resume
Kindly peruse the following for a comprehensive overview of my educational and professional experience.
Education
Master of Science - Data Science
Aug 2022 - Dec 2023
University of Rochester, Rochester, NY, USA
- GPA: 3.95/4
- Recipient of 40% merit scholarship
- Secured 2nd position in the 2022 UR Biomedical Data Science Hackathon
- Relevant Courses: Time Series, Data mining, Statistics, NLP, Machine Learning, Big Data
Bachelor of Engineering - Information Technology
Aug 2018 - May 2022
Ramrao Adik Institute of Techology, Mumbai, MH, India
- GPA: 3.73/4; CGPA: 8.95/10
- Hackathon winner for creating video conferencing web application. Was invited next year to give guest lecture
- Relevant Courses: Time Series, Data mining, Statistics, NLP, Machine Learning, Big Data
Software Experience
Software Intern
Sept 2022 - June 2023
URMC - Department of Neuroscience, New York, NY
- Implemented a system using the "pydicom" library that verifies if over 10,000 medical files in "dcm" format, anonymized by a specific function, are present in the output, all within 60 seconds.
- Developed a web service in Flask having JWT authentication in combination with caching features to reduce loading time of large JSON files to under 10 seconds and deployed on Docker
Software Intern
Department of Information Technology, RAIT, India
- Engineered a multi-user video communication application utilizing Express and Node.js.
- Implemented competitive programming practices, resulting in a 10-50% performance optimization in C/C++.
Jun 20-Jul 20
Dec 19-Jan 20
Data Science Experience
Data scientist
Mar 2024 - Present
Indiana University, Bloomington, IN
- Incorporated Microsoft SQL Server in combination with Python for in-depth analysis of healthcare datasets, exceeding 5M records, to derive insights from Electronic Health Record (EHR) data
- Leveraged T-SQL to clean and preprocess extensive healthcare datasets, ensuring data integrity for a study on patient outcomes, which was conducted in compliance with HIPAA regulations
- Performed comprehensive survival analysis to assess disease diagnosis over 10 years of clinical data, integrating demographic and treatment variables to provide detailed reports
- Developed polynomial regression models in R, analyzing 25 years of previous birth rates and disease spread while factoring in seasonality and other pertinent social variables in USA
- Conducted a meta-analysis of over 25 studies, including results from A/B testing, and visualized the findings using forest plots to summarize review results
Data Science Co-op
Jun 2023 - Dec 2023
Regeneron Pharmaceuticals, New York, NY
- Developed a demand forecasting algorithm in JMP, which was used to predict protein demands from a central repository, resulting in a time saving of considerable hours per week.
- Instituted an inventory management system that effectively prevented significant amount of material from being wasted each quarter.
- Formulated a predictive maintenance system to ensure proper upkeep of MFCs and related systems saving large amounts
Data science intern
May 2021 - Jun 2021
Exposys Data Labs, India
- Performed knowledge mining and segmented data by implementing k-means clustering with 85% accuracy
Data science intern
Oct 2020 - Jan 2021
Sciffer Analytics Pvt Ltd, India
- Developed and annotated image datasets by extracting information from Google, using tools such as “labelimg”. This process enabled the training of a machine learning model that successfully identified more than 30 objects within a span of 3 months.
- Utilizing the YOLO v3 model, a deep learning classifier model was built, achieving an accuracy rate of 80%.
Campus Leadership Activities
Event Head
Apr 2020 - May 2020
Ramrao Adik Institute of Technology, RAIT-ACM, India
- Organized and mentored Dark Raptor event making it a success by selling more than 200 tickets, accounting for 20% of event’s revenue
Event Organizer
Apr 2019 - May 2019
Ramrao Adik Institute of Technology, ITSA, India
- Collaborated with 3 teammates and contributed to 30% of event’s revenue as the Event organizer by selling more than 60 tickets. Also maintained track of all events in the fest with the use of Microsoft Excel
Portfolio
- All
- Data Science
- Web Development

Emotion Recognition Using Deep Convolutional Neural Networks.
Python, Machine Learning, EDA
A deep convolutional neural network (DCNN) was created and used to identify the mood of the user based on his facial expression. Accuracy of over 83.9% was achieved.

Predicting and Analysing the Viral Fragments of Songs
Python, Machine Learning, EDA
Utilized dynamic time warping to compare extracted MFCC features from songs, and applied a weighted SVM classifier. Achieved 100% recall and 86% accuracy due to presence of data imbalance.

Dynamic QA Generator for Research Papers
Python, Large Language Models, Data Preparation, OpenAI API
Developed a QA model to aid efficient comprehension of research papers by summarizing relevant information in the form of Q&A. Fine-tuned T5 models on OpenAI-modified QASPER dataset with 1.5k+ papers for question generation and answer generation task

Student Performance in Exams
Python, Machine Learning, EDA
EDA of features affecting student's marks and prediction of marks is done achieving 90% accuracy

Mushroom classification
Python, Machine Learning, EDA
Data analysis and usage of Random Forest Classifier on this famous dataset.

Newsletter Signup
MongoDB, Express, Nodejs
It lets me keep you updated with the daily happenings of the world

ToDo List
MongoDB, Express, Nodejs
A Todolist webapp which allows to add,edit and delete tasks on the todolist.
Open to Work
I am available to work for the following roles
Data Scientist
I leverage statistical methods and machine learning to extract insights from complex data sets and drive strategic decisions.
Data Analyst
I interpret data, analyze results using statistical techniques, and provide ongoing reports to help guide business decisions.
Data Engineer
I design, construct, install, test and maintain highly scalable data management systems, ensuring that all data systems meet business requirements and industry practices.
Machine Learning Engineer
I design and build intelligent systems that learn from and make decisions or predictions based on data.
Web Developer
I create and maintain websites, ensuring they're both user-friendly and functional across various platforms.
Software Engineer
I design, develop, and maintain software systems, ensuring they're efficient, reliable, and meet the needs of users.
Contact
I would greatly appreciate the opportunity to connect and engage in a more personal dialogue if you are in the vicinity. It would be a pleasure to meet and discuss our mutual interests further. Feel free to reach out directly at shubhamtamhane2000@gmail.com
Location:
Bloomington, IN
Email:
shubhamtamhane2000@gmail.com
School Email:
stamhane@ur.rochester.edu