CV

You can go through my CV for a overview of my academic journey, professional experiences, personal projects and significant accomplishments. For a comprehensive overview, you can download the full PDF version of my resume by clicking on the button above. If you’re interested in knowing more about my work, please do explore other sections of this website or reach out to me directly through email or Linkedin.

Basics

Name Sailesh Dwivedy
Label Master's Student in Computer Science
Email sadw2186@colorado.edu
Phone +1-650-476-5385
Url https://github.com/saileshdwivedy30
Summary Motivated Computer Science graduate with a 4.0 GPA, specializing in AI, Machine Learning, Natural Language Processing, AWS and GCP. I have gained extensive experiences in research, demonstrating improvements in model accuracy and efficiency along with implementation and deployment of cutting-edge ML solutions.

Education

Projects

  • Research Project: Knowledge Distillation Optimization (Submitted to CVPR 2025)
    • We developed a novel layer selection strategy for Feature Knowledge Distillation (FKD) by analyzing the functional roles of layers to optimize teacher-student model alignment.
    • Improved accuracy by up to 15% over SOTA methods and increased training efficiency by 80%, using datasets CIFAR-10, CIFAR-100, Tiny ImageNet, and vision model architectures like VGG, ResNet, Transformers.
  • Research Project: Distributed DL Training Cost Prediction
    • Designing a data-driven framework to predict training costs for distributed deep learning models on Vertex AI.
    • Collecting and curating data on key parameters, including model hyperparameters, resource usage (CPU, GPU, memory, disk, network), instance configurations, and training cost.
    • Preparing for model development by identifying relevant features and experimenting with regression models (e.g., Random Forest, XGBoost) to uncover patterns in resource utilization and optimize cost predictions.
  • Project: Slurred Speech (Dysarthria) Detection System
    • Utilized the Torgo dataset to develop a system for detecting slurred speech (dysarthria) for medical diagnostics.
    • Achieved a 93% accuracy with classical machine learning models (Random Forest, LightGBM) and obtained promising classification results: Precision: 93%, Recall: 91%, F1-Score: 92% (macro average).
    • Currently building an NLP-based model leveraging ASR transcriptions and transformer-based models, such as BERT, to better capture nuanced speech impairments.

Work

  • 2024.08 - Present
    Research Assistant, Image and Video Computing Lab
    University of Colorado, Boulder
    Worked on Knowledge Distillation in Computer Vision.
    • Continued my work on enhancing performance by 15% over state-of-the-art methods and increased training efficiency by 80%.
  • 2024.05 - 2024.08
    Summer Research Intern, Image and Video Computing Lab
    University of Colorado, Boulder
    Worked on Knowledge Distillation in Computer Vision.
    • Used data-driven techniques to enhance performance by 15% over SOTA.
  • 2023.02 - 2023.08
    Research Associate
    Kalinga Institute of Industrial Technology
    Worked with Prof. Suresh Satapathy on imbalanced classification techniques.
    • Researched SMOTE, sampling, and Tomek Links for imbalanced classification problems.
  • 2020.09 - 2023.02
    Data Science Team Lead
    Highradius Technologies Private Limited
    Led a team of 10 in implementing and deploying ML models for enterprise clients.
    • Designed and deployed Smart Email Composer, using techniques of Information Extraction like Event Extraction and NER combined with Generative AI to classify events and extract key entities from emails, enabling tailored email responses and enhancing account workflows.
    • Owned end-to-end deployment of the AI solution, reducing response time by 40%, delivering business impact.
    • Developed imbalance classification financial models for Deduction Validity Predictor for Fortune 500 clients with 90% accuracy, cutting research time by 25% and boosting dollar amount recovery by 12%.
    • Developed ML models for regression use case of Payment Date Prediction, cutting Days Sales Outstanding by 30%.
    • Automated the ML pipeline leading to 33% deployment timeline reduction resulting in faster value realization.
    • Led a team of 10 on advanced analytics projects, driving innovation through collaboration with cross-functional teams.
    • Leveraged strong communication skills presenting actionable insights to stakeholders for data-informed decision-making.

Skills

Programming and Libraries
Python
PyTorch
SQL
Numpy
Pandas
Data Visualization
Seaborn
Plotly
Matplotlib
Data Science
Statistics
Data Preprocessing
Classification
Regression
Ensemble Methods
Machine Learning
Scikit-Learn
Random Forest
XGBoost
Neural Networks
NLP
Transformers
NER
Sentiment Analysis
Prompt Engineering
Computer Vision
Stable Diffusion
YOLO
CNN
Vision Transformers
ResNet
Knowledge Distillation
MLOps and Cloud
Docker
Kubernetes
MLflow
Vertex AI

Certificates

Awards

Languages

English
Fluent
Hindi
Fluent
Odia
Native Speaker