
Sushmit Vaish
Passionate Software Engineer with expertise in ML infrastructure, backend systems, and full-stack development. Currently pursuing MS in Software Engineering at Carnegie Mellon University with about 4 years of professional experience.
About Me
I am a passionate Software Engineer with expertise in ML infrastructure, backend systems, and full-stack development. Currently pursuing a Master's degree in Software Engineering at Carnegie Mellon University, I have about 4 years of professional experience spanning Amazon, ZS Associates, and academic research.
At Amazon, I pioneered novel LoRA optimization techniques that reduced GPU costs by 4× and training time by 58%. At ZS Associates, I architected scalable data platforms that generated $270K+ in revenue while reducing operational effort by 70%.
My goal is to leverage my expertise in ML systems, cloud infrastructure, and scalable software architecture to create innovative solutions that drive meaningful impact in the technology industry.
Years Industry Experience
Revenue Generated
Process Optimization
Innovation Awards
Multiple recognition awards for innovative solutions and outstanding performance
Process Excellence
Achieved up to 95% optimization in data processing pipelines
Team Leadership
Successfully led cross-functional teams across multiple high-impact projects
Experience

Software Developer Intern (ML)
Amazon
Reduced LoRA inference GPU cost by 4× by pioneering a post-hoc adapter extraction technique preserving performance
Optimized LoRA training time by 58% by minimizing computations using a novel fine-tuning approach
Automated LoRA-enabled inference benchmarking using LLMPerf, generating key latency metrics (TTFT, TPOT)
Key Technologies & Skills

Research Assistant
Carnegie Mellon University
Created an LLM-based auto-grading system with CodeLlama-7B, achieving 80% alignment with instructor feedback
Applied vector embeddings, prompt engineering and advanced NLP metrics (ROUGE-L, BERTScore) for grading accuracy
Key Technologies & Skills

Teaching Assistant
Carnegie Mellon University
Mentored 20+ students in 'Foundations of Software Engineering' through Code Reviews, Recitations & Agile practices
Facilitated backend project design and promoted modern SDLC methodologies in classroom and labs
Key Technologies & Skills

Software Engineer / Senior Data Analyst
ZS Associates
Improved data extraction performance by 25% via Java multi-threading and Hadoop optimization
Architected a data publish platform in AWS that cut internal effort by 70% and increased client refresh rates 5×
Automated quality checks in data processing pipeline to save daily runtime by 50% using bash scripts
Generated $270K revenue through process optimization initiatives
Successfully led cross-functional teams of 2-3 associates, delivering multiple high-impact projects on time and within scope
Key Technologies & Skills

Software Engineer Intern
Ebix Software India Pvt. Ltd.
Boosted app reliability by writing automated unit tests and embedding them in CI/CD workflows
Resolved critical bugs and collaborated with developers in agile delivery cycles
Quickly learned and adapted to new platforms and software systems
Key Technologies & Skills
Professional Impact
Throughout my career, I have consistently delivered measurable results by combining technical expertise with innovative problem-solving. My experience spans from ML optimization at Amazon to scalable data platforms at ZS Associates, always focusing on creating high-impact solutions that drive business value and technical excellence.
Featured Projects
A showcase of my technical projects demonstrating expertise in full-stack development, data science, and innovative problem-solving.
SportX Platform
CompletedA comprehensive sports networking app with AI-based event recommendations, interactive chat modules, and venue discovery. Features service-oriented architecture with Redis caching and Google Maps integration for enhanced user experience.
Emergency Social Network Application
LiveA dynamic real-time chat application ensuring 24x7 availability and high performance in emergency situations. Enables crisis messaging, location sharing, and support access to medical & relief agencies.
Other Notable Projects
High Performance Image Blurring
Efficient implementation of various blurring techniques using advanced optimization methods including SIMD instructions and custom kernel designs for maximum performance.
Fake News Detection using LSTM
LSTM-based deep learning model for sarcasm detection in news headlines, achieving ~83% accuracy. Leverages Keras with TensorFlow and implements tokenization, padding, and early stopping for sequence optimization.
Technical Skills
A comprehensive overview of my technical expertise and professional capabilities developed through years of hands-on experience and continuous learning.
Programming Languages
Continuous Learning
Current Focus
Advanced ML systems, LLM optimization, and scalable cloud-native architectures
Research Areas
LoRA fine-tuning, GPU optimization, and automated ML pipeline development
Leadership
Teaching assistance, team mentoring, and technical project leadership
Education
Academic foundation and continuous learning journey in computer science and software engineering.
Master of Science in Software Engineering
Carnegie Mellon University
Advanced coursework in software engineering, architecture, distributed systems, and machine learning. Focus on scalable software design and modern development practices.
Key Coursework
Bachelor of Technology in Computer Science
Vellore Institute of Technology
Comprehensive foundation in computer science fundamentals, programming, and software development. Strong emphasis on mathematical foundations and practical applications.
Key Coursework
Academic Excellence
Consistent academic performance with a strong foundation in computer science principles and practical software engineering skills.
Strong Academic Performance
Maintaining a 3.95/4.0 GPA in graduate studies at Carnegie Mellon University
Distinguished Graduate
Graduated with 3.85/4.0 GPA from VIT, demonstrating consistent academic excellence
Research Focus
Published research in data extraction and sentiment analysis using machine learning
Publications
Research contributions in the field of data science and machine learning.
Data Extraction and Sentimental Analysis from "Twitter" using Web Scrapping
Abstract
This research focuses on developing a machine learning model to extract data and identify sentiments with respect to posts and messages on Twitter. The study implements advanced web scraping techniques combined with natural language processing to analyze social media sentiment patterns, providing valuable insights into public opinion dynamics.
Key Contributions
Technologies & Methods
Research Impact & Future Work
This research contributes to the growing field of social media analytics and demonstrates practical applications of machine learning in understanding public sentiment.
Methodology Innovation
Novel approach combining web scraping with advanced ML techniques for social media analysis
Practical Applications
Real-world applications in market research, brand monitoring, and public opinion analysis
Academic Contribution
Published in peer-reviewed journal, contributing to the academic discourse on data science
Achievements & Recognition
Recognition for excellence in innovation, leadership, and technical contributions throughout my professional and academic journey.
Innovator Award
Developed and led a cloud-based web services application project using AWS
Ingenious Award
Optimized data processing pipeline & created an interactive tool using VBA
Rising Star Award
For consistent performance during internship & first cycle as a full-time employee
Interface Hackathon Winner
Won first prize among 45 teams for innovative solution development
HackerTech Hackathon Winner
Won sponsorship from VITTBI (Technology Business Incubator Govt. of India)
KickStartup Hackathon Winner
Won first place among 20 teams for entrepreneurial innovation
Recognition Summary
A track record of consistent excellence and innovation across professional and competitive environments.
Professional Awards
ZS Associates
Hackathon Victories
Out of 85+ teams
Years of Recognition
2017 - 2024
Impact Rate
All achievements