Shubham Tamhane

I'm

About

Driven by an insatiable curiosity for technology, I find myself captivated by its dynamic and transformative power. My passion is particularly ignited by Data Science, which I perceive as a potent tool for deciphering intricate data and unveiling profound insights that can shape our future.

Data Scientist & Web Developer.

Unleashing the power of data, one algorithm at a time.

  • Age: 24
  • Website: https://github.com/shubhamtamhane
  • Email: shubhamtamhane2000@gmail.com
  • City: Bloomington, IN, USA
  • Visa: F1 OPT
  • Degree: Master's in Data Science
  • University: University of Rochester
  • Available Start Date: Immediately

As a Master's student specializing in Data Science, I am passionately delving deeper into the intricacies of data analysis and interpretation. My academic journey in this field is a natural progression from my previous education in Information Technology, which laid a solid foundation for my understanding of the digital world. I am eager to apply my theoretical knowledge and technical skills to practical scenarios, thereby enhancing my learning experience. My goal is to continually evolve as a data science professional, leveraging my academic background and personal interest in technology. I am excited about the opportunities and challenges that lie ahead in my journey of personal and professional growth.

Skills

I am proficient in a wide array of Data Science disciplines, with a strong ability to manipulate, analyze, and interpret complex data sets. My expertise extends to various data science tools and methodologies, enabling me to transform raw data into actionable insights and strategic solutions

  • Programming Languages:   Python, R, C, C++, Java, Spark
  • Database:   SQL, MySQL, PostgreSQL, OracleSQL, NoSQL, MongoDB, Google Firebase
  • Data Manipulation and Visualization:   Tableau, PowerBI, MS Excel
  • No Code Software:   JMP, Dataiku, Seeq, Git
  • Framework and Libraries:   Sklearn, OpenCV, Tensorflow, Keras, Pandas, Numpy, ggplot2, pytorch
  • Machine Learning Methods:   Linear Regression, Logistic Regression, Decision Trees, Random Forest, Naive Bayes, K-Nearest Neighbors, Support Vector Machines, Artificial Neural Networks, Deep Learning, Gradient Boosting algorithms (GBM, XGBoost, LightGBM), Principal Component Analysis, ARIMA, SARIMA, State Space Models, Holt-Winters Method, Exponential Smoothing.
  • Web Technologies:  HTML5, CSS3, Django, Flask, Nodejs, , JavaScript, Express, Flutter

Resume

Kindly peruse the following for a comprehensive overview of my educational and professional experience.

Education

Master of Science - Data Science

Aug 2022 - Dec 2023

University of Rochester, Rochester, NY, USA

  • GPA: 3.95/4
  • Recipient of 40% merit scholarship
  • Secured 2nd position in the 2022 UR Biomedical Data Science Hackathon
  • Relevant Courses:  Time Series, Data mining, Statistics, NLP, Machine Learning, Big Data

Bachelor of Engineering - Information Technology

Aug 2018 - May 2022

Ramrao Adik Institute of Techology, Mumbai, MH, India

  • GPA: 3.73/4; CGPA: 8.95/10
  • Hackathon winner for creating video conferencing web application. Was invited next year to give guest lecture
  • Relevant Courses:  Time Series, Data mining, Statistics, NLP, Machine Learning, Big Data

Software Experience

Software Intern

Sept 2022 - June 2023

URMC - Center for Advanced Brain Imaging and Neurophysiology, New York, NY

  • Created a distributed ETL pipeline to process over 10,000 DICOM files, reducing ingestion time by 50% by leveraging Python (PySpark) for metadata extraction and storing structured data in MySQL.
  • Orchestrated automated data validation workflows, enabling ingestion of new DICOM files by employing Apache Airflow to enforce data integrity checks before storage.
  • Containerized the data pipeline, enabling cross-platform execution and reducing deployment time by 80% using Docker

Software Intern

Department of Information Technology, RAIT, India

    Jun 20-Jul 20
  • Engineered a multi-user video communication application utilizing Express and Node.js.
  • Dec 19-Jan 20
  • Implemented competitive programming practices, resulting in a 10-50% performance optimization in C/C++.

Data Science Experience

Data scientist

Mar 2024 - Present

Indiana University, Bloomington, IN

  • Incorporated Microsoft SQL Server in combination with Python for in-depth analysis of healthcare datasets, exceeding 5M records, to derive insights from Electronic Health Record (EHR) data.
  • Enhanced data quality by applying feature engineering, outlier detection, and missing value imputation using advanced data preprocessing techniques in Python (Pandas, NumPy).
  • Developed a classification model for disease prediction with recall of 85% by implementing XGBoost in scikit-learn.
  • Collaborated with researchers to predict survival activity, improving prediction reliability by 30% by implementing deep learning architectures in PyTorch trained on patient’s food intake and activity records.

Data Science Co-op

Jun 2023 - Dec 2023

Regeneron Pharmaceuticals, New York, NY

  • Implemented time series forecasting approach to predict customer demand of a complex inventory management problem employing multiple approaches including statistical and deep learning methods.
  • Deployed a webapp built using python-dash that leverages MLOps workflow built on cloud-infrastructure to provide real-time up-to date data and forecasting predictions, customer analysis and model maintenance options to end users contributing significantly to cost optimization.
  • Led the development of a maintenance analysis system, optimizing the upkeep of MFCs and related systems, which resulted in substantial monthly savings.
  • Adopted JIRA for task tracking and Confluence for documentation, adhering to the Agile/Scrum methodology.

Data science intern

Oct 2020 - Jan 2021

Sciffer Analytics Pvt Ltd, India

  • Managed the development of image datasets using labelimg tool for information extraction from Google in 3 months empowering a computer vision model to recognize over 30 distinct objects.
  • Employed the YOLO v3 model to build a deep learning classifier model, attaining an accuracy rate of 80%.

Campus Leadership Activities

Event Head

Apr 2020 - May 2020

Ramrao Adik Institute of Technology, RAIT-ACM, India

  • Organized and mentored Dark Raptor event making it a success by selling more than 200 tickets, accounting for 20% of event’s revenue

Event Organizer

Apr 2019 - May 2019

Ramrao Adik Institute of Technology, ITSA, India

  • Collaborated with 3 teammates and contributed to 30% of event’s revenue as the Event organizer by selling more than 60 tickets. Also maintained track of all events in the fest with the use of Microsoft Excel

Portfolio

  • All
  • Data Science
  • Web Development

Emotion Recognition Using Deep Convolutional Neural Networks.

Python, Machine Learning, EDA

A deep convolutional neural network (DCNN) was created and used to identify the mood of the user based on his facial expression. Accuracy of over 83.9% was achieved.

Predicting and Analysing the Viral Fragments of Songs

Python, Machine Learning, EDA

Utilized dynamic time warping to compare extracted MFCC features from songs, and applied a weighted SVM classifier. Achieved 100% recall and 86% accuracy due to presence of data imbalance.

Dynamic QA Generator for Research Papers

Python, Large Language Models, Data Preparation, OpenAI API

Developed a QA model to aid efficient comprehension of research papers by summarizing relevant information in the form of Q&A. Fine-tuned T5 models on OpenAI-modified QASPER dataset with 1.5k+ papers for question generation and answer generation task

Student Performance in Exams

Python, Machine Learning, EDA

EDA of features affecting student's marks and prediction of marks is done achieving 90% accuracy

Mushroom classification

Python, Machine Learning, EDA

Data analysis and usage of Random Forest Classifier on this famous dataset.

Campus recruitment in R

R, ggplot2, EDA

Data visualization and Exploratory Data Analysis using R

Newsletter Signup

MongoDB, Express, Nodejs

It lets me keep you updated with the daily happenings of the world

ToDo List

MongoDB, Express, Nodejs

A Todolist webapp which allows to add,edit and delete tasks on the todolist.

Open to Work

I am available to work for the following roles

Data Scientist

I leverage statistical methods and machine learning to extract insights from complex data sets and drive strategic decisions.

Data Analyst

I interpret data, analyze results using statistical techniques, and provide ongoing reports to help guide business decisions.

Data Engineer

I design, construct, install, test and maintain highly scalable data management systems, ensuring that all data systems meet business requirements and industry practices.

Machine Learning Engineer

I design and build intelligent systems that learn from and make decisions or predictions based on data.

Web Developer

I create and maintain websites, ensuring they're both user-friendly and functional across various platforms.

Software Engineer

I design, develop, and maintain software systems, ensuring they're efficient, reliable, and meet the needs of users.

Contact

I would greatly appreciate the opportunity to connect and engage in a more personal dialogue if you are in the vicinity. It would be a pleasure to meet and discuss our mutual interests further. Feel free to reach out directly at shubhamtamhane2000@gmail.com

Location:

Bloomington, IN

School Email:

stamhane@ur.rochester.edu

Loading
Your message has been sent. Thank you!