Harshit Agarwal

Aspiring Data Scientist & Machine Learning Engineer

LinkedIn | GitHub

About

Highly motivated and results-driven student with a strong foundation in Data Science and Machine Learning, seeking to leverage robust analytical and programming skills to solve complex problems. Proven ability to develop and deploy predictive models, optimize systems with real-time data, and derive actionable insights from diverse datasets, as demonstrated through impactful internships and academic projects. Eager to contribute to innovative data-driven solutions within a dynamic technology environment.

Work Experience

Data Science Intern

Toast

May 2025 - Dec 2025

Remote, Any, US

Developed and deployed advanced predictive models for credit risk, integrating complex financial metrics and leveraging cloud-based data tools to enhance decision-making.

  • Engineered and deployed a robust PD (Probability of Default) model, significantly improving credit risk prediction accuracy by incorporating seasonality logic.
  • Implemented account-level filtering mechanisms, refining risk assessments and enabling more precise financial evaluations.
  • Utilized Python for model development and integrated with AWS data tools (S3, Athena) for efficient data processing and scalable model deployment.
  • Applied advanced predictive modeling techniques, including LightGBM, to analyze complex datasets and generate actionable insights for credit risk management.

ML Intern

JITSIE IIT Madras

Oct 2024 - Jan 2025

Chennai, Tamil Nadu, IN

Designed and implemented a machine learning model to dynamically optimize data center cooling, enhancing energy efficiency and system performance.

  • Developed a machine learning model that dynamically optimized data center cooling, reducing overcooling by analyzing real-time sensor data (temperature, server workload).
  • Replaced static CRAC (Computer Room Air Conditioner) systems with a dynamic ML-driven approach, leading to potential energy savings and improved operational efficiency.
  • Utilized ensemble modeling and data visualization techniques to process and interpret complex sensor data, ensuring precise cooling adjustments.
  • Contributed to data generation and analysis, providing critical insights for the continuous improvement and scalability of the cooling optimization system.

Data Analyst

Jindal Steel and Power

Jun 2024 - Aug 2024

Raigarh, Chhattisgarh, IN

Performed comprehensive data analysis on industrial datasets to identify cost-optimal solutions and gain hands-on experience in industry data analytics.

  • Analyzed high-dimensional belt-drive datasets, incorporating critical factors such as grade, material strength, and cost to identify performance bottlenecks.
  • Engineered features from raw data, enhancing the predictive power of analytical models for industrial applications.
  • Generated data-driven recommendations for cost-optimal replacements, contributing to potential operational savings and efficiency improvements.
  • Gained practical experience in data visualization, ETL processes, SQL querying, and data warehousing within a heavy industry context.

Education

Computer Science and Engineering

National Institute of Technology, Durgapur

N/A

Sep 2021 - May 2025

Durgapur, West Bengal, IN

Courses

  • Linear Algebra
  • Probability and Statistics
  • Theory of Algorithms
  • Database Management Systems
  • Operating Systems
  • System Designing
  • Computer Architecture

Volunteer

Core Team Member

Recstacy 2023 Organizing Committee

Jan 2023 - Mar 2023

Durgapur, West Bengal, IN

Spearheaded the successful organization and execution of Recstacy 2023, a large-scale cultural festival, demonstrating strong leadership and logistical capabilities.

  • Spearheaded the planning and execution of Recstacy 2023, a large-scale cultural festival, overseeing all aspects from concept to delivery.
  • Managed and coordinated diverse teams, optimizing workflows and ensuring seamless collaboration across various functional areas.
  • Directed event planning, stage management, and logistics, contributing to the successful hosting of multiple events and performances.
  • Demonstrated strong leadership and problem-solving skills in a high-pressure environment, ensuring the festival's smooth operation and positive attendee experience.

Projects

Automated Investment Thesis Generator

Jan 2025 - Feb 2025

Created an AI-powered web application designed to automate the analysis of startup pitch decks, providing scored insights and comprehensive reports for investment evaluation.

Solar Energy Predictive Model

Sep 2024 - Oct 2024

Developed a machine learning model to accurately predict solar energy output using historical weather data, demonstrating expertise in data preprocessing, model selection, and hyperparameter tuning.

Awards

A Grade in Advanced Algorithms

National Institute of Technology, Durgapur

Mar 2024

Achieved an 'A' grade in the Advanced Algorithms course during the 3rd year of study, demonstrating strong analytical and problem-solving skills.

Kaggle Competition Participant

Kaggle

Jun 2023

Actively participated in various Kaggle Competitions, applying machine learning and data science techniques to real-world datasets and continuously enhancing skills.

Olympiad Silver Medalist

Mathematics Olympiad Committee

Jan 2019

Awarded a Silver Medal in the Mathematics Olympiad, recognizing exceptional mathematical aptitude and problem-solving abilities at a national level.

Skills

Programming Languages

  • Python
  • C
  • C++
  • SQL
  • HTML
  • CSS
  • TypeScript
  • Node.js
  • Express.js

Data Science & ML

  • Machine Learning
  • AI
  • Predictive Modeling
  • Credit Risk Prediction
  • Ensemble Modeling
  • Data Visualization
  • Data Processing
  • ETL
  • Data Warehousing
  • Feature Engineering
  • Model Tuning
  • Natural Language Processing (NLP)
  • Scikit-learn
  • LightGBM

Libraries & Frameworks

  • Numpy
  • Pandas
  • Random Forest Regressor
  • SGDRegressor
  • Dummy Regressor
  • VSCode
  • Git

Databases & Cloud

  • DBMS
  • PostgreSQL
  • AWS S3
  • AWS Athena

Tools & Concepts

  • Power BI
  • Excel
  • Debugging
  • Design Principles
  • Data Engineering
  • Linear Algebra
  • Algorithms
  • Probability and Statistics
  • Object-Oriented Programming (OOPs)

Soft Skills

  • Accountability
  • Collaboration
  • Communication
  • Proactive
  • Problem-solving