Shubham Tamhane

I'm

About

Driven by an insatiable curiosity for technology, I find myself captivated by its dynamic and transformative power. My passion is particularly ignited by Data Science, which I perceive as a potent tool for deciphering intricate data and unveiling profound insights that can shape our future.

Data Scientist & Web Developer.

Unleashing the power of data, one algorithm at a time.

  • Age: 24
  • Website: https://github.com/shubhamtamhane
  • Email: shubhamtamhane2000@gmail.com
  • City: Bloomington, IN, USA
  • Visa: F1 OPT
  • Degree: Master's in Data Science
  • University: University of Rochester
  • Available Start Date: Immediately

As a Master's student specializing in Data Science, I am passionately delving deeper into the intricacies of data analysis and interpretation. My academic journey in this field is a natural progression from my previous education in Information Technology, which laid a solid foundation for my understanding of the digital world. I am eager to apply my theoretical knowledge and technical skills to practical scenarios, thereby enhancing my learning experience. My goal is to continually evolve as a data science professional, leveraging my academic background and personal interest in technology. I am excited about the opportunities and challenges that lie ahead in my journey of personal and professional growth.

Skills

I am proficient in a wide array of Data Science disciplines, with a strong ability to manipulate, analyze, and interpret complex data sets. My expertise extends to various data science tools and methodologies, enabling me to transform raw data into actionable insights and strategic solutions

  • Programming Languages:   Python, R, C, C++, Java, Spark
  • Database:   SQL, MySQL, PostgreSQL, OracleSQL, NoSQL, MongoDB, Google Firebase
  • Data Manipulation and Visualization:   Tableau, PowerBI, MS Excel
  • No Code Software:   JMP, Dataiku, Seeq, Git
  • Framework and Libraries:   Sklearn, OpenCV, Tensorflow, Keras, Pandas, Numpy, ggplot2, pytorch
  • Machine Learning Methods:   Linear Regression, Logistic Regression, Decision Trees, Random Forest, Naive Bayes, K-Nearest Neighbors, Support Vector Machines, Artificial Neural Networks, Deep Learning, Gradient Boosting algorithms (GBM, XGBoost, LightGBM), Principal Component Analysis, ARIMA, SARIMA, State Space Models, Holt-Winters Method, Exponential Smoothing.
  • Web Technologies:  HTML5, CSS3, Django, Flask, Nodejs, , JavaScript, Express, Flutter

Resume

Kindly peruse the following for a comprehensive overview of my educational and professional experience.

Education

Master of Science - Data Science

Aug 2022 - Dec 2023

University of Rochester, Rochester, NY, USA

  • GPA: 3.95/4
  • Recipient of 40% merit scholarship
  • Secured 2nd position in the 2022 UR Biomedical Data Science Hackathon
  • Relevant Courses:  Time Series, Data mining, Statistics, NLP, Machine Learning, Big Data

Bachelor of Engineering - Information Technology

Aug 2018 - May 2022

Ramrao Adik Institute of Techology, Mumbai, MH, India

  • GPA: 3.73/4; CGPA: 8.95/10
  • Hackathon winner for creating video conferencing web application. Was invited next year to give guest lecture
  • Relevant Courses:  Time Series, Data mining, Statistics, NLP, Machine Learning, Big Data

Software Experience

Software Intern

Sept 2022 - June 2023

URMC - Department of Neuroscience, New York, NY

  • Implemented a system using the "pydicom" library that verifies if over 10,000 medical files in "dcm" format, anonymized by a specific function, are present in the output, all within 60 seconds.
  • Developed a web service in Flask having JWT authentication in combination with caching features to reduce loading time of large JSON files to under 10 seconds and deployed on Docker

Software Intern

Department of Information Technology, RAIT, India

    Jun 20-Jul 20
  • Engineered a multi-user video communication application utilizing Express and Node.js.
  • Dec 19-Jan 20
  • Implemented competitive programming practices, resulting in a 10-50% performance optimization in C/C++.

Data Science Experience

Data scientist

Mar 2024 - Present

Indiana University, Bloomington, IN

  • Incorporated Microsoft SQL Server in combination with Python for in-depth analysis of healthcare datasets, exceeding 5M records, to derive insights from Electronic Health Record (EHR) data
  • Leveraged T-SQL to clean and preprocess extensive healthcare datasets, ensuring data integrity for a study on patient outcomes, which was conducted in compliance with HIPAA regulations
  • Performed comprehensive survival analysis to assess disease diagnosis over 10 years of clinical data, integrating demographic and treatment variables to provide detailed reports
  • Developed polynomial regression models in R, analyzing 25 years of previous birth rates and disease spread while factoring in seasonality and other pertinent social variables in USA
  • Conducted a meta-analysis of over 25 studies, including results from A/B testing, and visualized the findings using forest plots to summarize review results

Data Science Co-op

Jun 2023 - Dec 2023

Regeneron Pharmaceuticals, New York, NY

  • Developed a demand forecasting algorithm in JMP, which was used to predict protein demands from a central repository, resulting in a time saving of considerable hours per week.
  • Instituted an inventory management system that effectively prevented significant amount of material from being wasted each quarter.
  • Formulated a predictive maintenance system to ensure proper upkeep of MFCs and related systems saving large amounts

Data science intern

May 2021 - Jun 2021

Exposys Data Labs, India

  • Performed knowledge mining and segmented data by implementing k-means clustering with 85% accuracy

Data science intern

Oct 2020 - Jan 2021

Sciffer Analytics Pvt Ltd, India

  • Developed and annotated image datasets by extracting information from Google, using tools such as “labelimg”. This process enabled the training of a machine learning model that successfully identified more than 30 objects within a span of 3 months.
  • Utilizing the YOLO v3 model, a deep learning classifier model was built, achieving an accuracy rate of 80%.

Campus Leadership Activities

Event Head

Apr 2020 - May 2020

Ramrao Adik Institute of Technology, RAIT-ACM, India

  • Organized and mentored Dark Raptor event making it a success by selling more than 200 tickets, accounting for 20% of event’s revenue

Event Organizer

Apr 2019 - May 2019

Ramrao Adik Institute of Technology, ITSA, India

  • Collaborated with 3 teammates and contributed to 30% of event’s revenue as the Event organizer by selling more than 60 tickets. Also maintained track of all events in the fest with the use of Microsoft Excel

Portfolio

  • All
  • Data Science
  • Web Development

Emotion Recognition Using Deep Convolutional Neural Networks.

Python, Machine Learning, EDA

A deep convolutional neural network (DCNN) was created and used to identify the mood of the user based on his facial expression. Accuracy of over 83.9% was achieved.

Predicting and Analysing the Viral Fragments of Songs

Python, Machine Learning, EDA

Utilized dynamic time warping to compare extracted MFCC features from songs, and applied a weighted SVM classifier. Achieved 100% recall and 86% accuracy due to presence of data imbalance.

Dynamic QA Generator for Research Papers

Python, Large Language Models, Data Preparation, OpenAI API

Developed a QA model to aid efficient comprehension of research papers by summarizing relevant information in the form of Q&A. Fine-tuned T5 models on OpenAI-modified QASPER dataset with 1.5k+ papers for question generation and answer generation task

Student Performance in Exams

Python, Machine Learning, EDA

EDA of features affecting student's marks and prediction of marks is done achieving 90% accuracy

Mushroom classification

Python, Machine Learning, EDA

Data analysis and usage of Random Forest Classifier on this famous dataset.

Campus recruitment in R

R, ggplot2, EDA

Data visualization and Exploratory Data Analysis using R

Newsletter Signup

MongoDB, Express, Nodejs

It lets me keep you updated with the daily happenings of the world

ToDo List

MongoDB, Express, Nodejs

A Todolist webapp which allows to add,edit and delete tasks on the todolist.

Open to Work

I am available to work for the following roles

Data Scientist

I leverage statistical methods and machine learning to extract insights from complex data sets and drive strategic decisions.

Data Analyst

I interpret data, analyze results using statistical techniques, and provide ongoing reports to help guide business decisions.

Data Engineer

I design, construct, install, test and maintain highly scalable data management systems, ensuring that all data systems meet business requirements and industry practices.

Machine Learning Engineer

I design and build intelligent systems that learn from and make decisions or predictions based on data.

Web Developer

I create and maintain websites, ensuring they're both user-friendly and functional across various platforms.

Software Engineer

I design, develop, and maintain software systems, ensuring they're efficient, reliable, and meet the needs of users.

Contact

I would greatly appreciate the opportunity to connect and engage in a more personal dialogue if you are in the vicinity. It would be a pleasure to meet and discuss our mutual interests further. Feel free to reach out directly at shubhamtamhane2000@gmail.com

Location:

Bloomington, IN

School Email:

stamhane@ur.rochester.edu

Loading
Your message has been sent. Thank you!