About Me

Welcome to my portfolio! I am Sanjai Bala (Sanjaikumar Balasubramaniyan), a passionate AI and Machine Learning engineer with a strong foundation in applied machine learning, data engineering, and analytics. With hands-on experience across research and industry, I have built impactful solutions including LLM-powered analytics tools, real-time dashboards, scalable data pipelines, and predictive modeling systems that improve decision-making. Currently pursuing my Master’s in Data Science at Indiana University Bloomington, I focus on developing intelligent systems that transform complex data into actionable insights and real-world applications.


My projects reflect a strong commitment to solving practical problems using AI and data-driven technology. From developing AI workflows that improved research and analytics efficiency by up to 40% to building an Automatic Number Plate Recognition (ANPR) system with 96% accuracy (published in an international journal), my work combines innovation with reliability. Skilled in Python, JavaScript, SQL, PyTorch, and cloud platforms such as AWS, I am dedicated to building scalable, user-focused AI solutions that create measurable impact. Explore my portfolio to see how I turn data into decisions.!

  • Machine Learning
  • Statistics
  • Data Analytics
  • Generative AI
  • NLP
  • Deep Learning
  • Data Cleaning
  • Data Wrangling
  • ETL Pipelines
  • AWS(S3, EC2, SageMaker)
  • GCP(BigQuery, DataFLow)
  • Apache Spark
  • Python
  • SQL
  • R
  • Jan 2025 - Present
    Research Data Scientist | Indiana University Bloomington.
  • July 2025 - Nov 2025
    AI Engineer Intern | riAI Capital LLC.
  • Apr 2023 - July 2024
    Data Scientist | Systech Solutions Inc.
  • May 2022 - Jan 2023
    Data Analyst Intern | 8Queens Software Technologies Pvt Ltd.
  • Aug 2024 - May 2026
    Master's in Data Science | Indiana University Bloomington.
  • Oct 2020 - Apr 2024
    Bachelor's in Artificial Intelligence & Data Science | SRMIST University.

My Experience

AI Engineer Intern

riAI Capital LLC, Reno, NV

July 2025 – Nov 2025


Conducted foundational research on AI-driven solutions for financial advisory, asset management, and ETF analytics, contributing to the company’s long-term AI platform strategy.

Designed and optimized large language model (LLM) workflows and prompt templates for portfolio optimization and investment research, improving analytical efficiency and decision-making accuracy.

Defined performance metrics and evaluation frameworks to benchmark early-stage AI tools, enabling leadership to track impact, scalability, and adoption across teams.

Research Data Scientist

Indiana University, Bloomington

Jan 2025 - Present


Mapped funding networks for 157,000+ BIPOC nonprofit grants to analyze funding trends and relationships, Overcame data limitations with advanced retrieval techniques for comprehensive analysis.

Conducted network analysis to identify bonding, bridging, and linking patterns in grant funding, Classified grant recipients and funders by demographic focus, uncovering insights into funding equity for underserved communities.

Data Scientist Intern

Systech Solutions Inc, India

Apr 2024 - July 2024


Built real-time Power BI dashboards, enhancing decision-making speed and visualization quality by 40% and Automated data workflows with optimized ETL pipelines, cutting data processing time by 35%.

Deployed machine learning models, boosting customer segmentation accuracy by 25% for predictive insights. Engineered scalable data solutions, enhancing cross-team operational efficiency by 30%.

Data Analyst Intern

8Queens Software Technologies, India

May 2022 - Jan 2023


Streamlined real-time data processing by transforming datasets, improving accessibility and accuracy. Created interactive Tableau dashboards for sales, driving a 15% improvement in performance insights.

Deployed supervised ML model for diabetes risk prediction, boosting predictive accuracy in health analytics and Delivered actionable insights through data-driven solutions, supporting strategic decision-making processes.

My Projects

ROS2 Object Tracking and Control System

Built a real-time ROS2-based object tracking and control system integrating computer vision, state estimation, and closed-loop robotic control.

Implemented object detection and tracking using OpenCV, stabilized motion estimates using a Kalman Filter, and generated smooth velocity commands via PID control.

Designed a modular ROS2 pipeline with camera, perception, tracking, and control nodes, ensuring robust real-time performance under noisy and intermittent detections.

Predictive Maintenance – RUL Estimation (LSTM)

Developed a deep learning–based predictive maintenance system to estimate Remaining Useful Life (RUL) of industrial machinery using multivariate sensor time-series data.

Designed an end-to-end pipeline with data normalization, sliding-window sequence generation, and LSTM-based regression to capture long-term degradation trends.

Evaluated models using RMSE and MAE, demonstrating improved failure prediction accuracy over traditional threshold-based maintenance approaches.

Industrial Surface Defect Detection (Computer Vision)

Built a deep learning–based computer vision system to automatically detect industrial surface defects such as cracks, scratches, and dents from high-resolution images.

Implemented CNN and transfer-learning pipelines using PyTorch, with extensive data augmentation and class-imbalance handling for robustness under varying lighting and surface conditions.

Optimized performance using precision, recall, and F1-score to minimize false negatives, demonstrating applicability to real-world industrial inspection workflows.

AI Agent for Operating Backend System

Built an AI agent using Gemini-pro LLM, automating workflows with 50% improved efficiency and 80% fewer manual tasks at 95% accuracy.

Developed a Flask API with SQLite, cutting response times by 40%, supporting 1,000 queries, and boosting backend efficiency by 30%.

CardioCare

CardioCare is a heart disease management system that leverages machine learning to predict and assess cardiovascular risks based on real-time patient data.

The platform provides healthcare professionals with valuable insights, enabling proactive and informed decision-making for better patient care.

Click below link for Live Website.

Air Calligraphy Using Computer Vision

Designed a computer vision-based system enabling individuals without limbs (hand) to write in the air using a fingertip, eliminating the need for traditional input devices.

Leveraged OpenCV and Python to implement real-time object detection and gesture recognition for intuitive and accessible air-writing.

NYC Crime Analytics - using Preswald

The NY Crime Analytics Application is an interactive dashboard built using Preswald, providing a comprehensive view of crime trends across New York State from 1990 to 2023.

The application allows users to explore county-wise crime statistics, including violent, property, and firearm-related crimes, with visually engaging charts and interactive filtering for deeper insights.

Monument Intelligence Dashboard

The Monument Intelligence Dashboard is an interactive, AI-powered web app built with Preswald, allowing users to explore and analyze iconic monuments worldwide through dynamic visualizations and natural language queries.

The application features an interactive geospatial map, trend analysis by country and century, and a chatbot for intuitive, conversational exploration of global monuments' historical data and visitor statistics.

BlinkIt Sales Analytics

Developed an interactive Power BI dashboard to analyze Blinkit's sales data, providing insights into key metrics such as total sales, item distribution, outlet performance, and customer preferences.

Implemented dynamic filtering and visualizations to track sales trends by product type, outlet size, and location, enabling data-driven decision-making for business growth.

See more

My Products

Jobha Naturals Web Application

Managed a cross-functional team of six developers and coordinated with stakeholders to design, develop, and launch a scalable e-commerce platform for Jobha Naturals. Oversaw project timelines, feature prioritization, and seamless integration of RazorPay for secure transactions.

Maintain ongoing collaboration with business stakeholders to optimize performance, implement updates, and provide strategic technical support.

Box of Wellness Web Application

A static web application that serves as a catalog of nutritious foods tailored for gym-goers, highlighting key nutritional facts. Users can conveniently browse healthy meal options and place orders, promoting informed dietary choices.

Clear layout for easy navigation, ensuring users quickly find the right meals to meet their fitness goals.

App

Coming Soooon

My Research

Snacks & Satiety Research Study

Worked as a Research Assistant under Dr. Bret Rust on the Snacks & Satiety Study, funded by Linus Technology. Contributed to data analysis and research on the effects of snack consumption on satiety.

Contributed to an upcoming data authorship piece in The New York Times Magazine, showcasing expertise in data-driven insights and research.ThisNewYork Times Magazine - for Data Ownership

Funding Networks for BIPOC Non-Profit Organization

Conducted network analysis to identify bonding, bridging, and linking patterns in grant funding, classifying recipients and funders by demographic focus to uncover insights into funding equity for underserved communities.

Presented the research at the ARNOVA conference, showcasing findings on the funding dynamics and equity for BIPOC nonprofit organizations.

Advanced ANPR approach for Vehichle Management

Published a research paper titled "Advanced ANPR for Vehicle Management" in Adalya International Journal (DOI: 10.37896/aj13.3/001), which focused on using YOLO-based object detection combined with OCR for real-time license plate recognition, achieving 96% accuracy and improving blurred image clarity by 85%.

Developed a scalable, autonomous vehicle tracking system that utilized real-time object detection, motion tracking, and decision-based logging to optimize parking flow and entry/exit management in vehicle management systems.

Link

AI-Powered Bank note Identification System for Visually Impaired

Presented a paper titled "AI-Powered Banknote Identification System for Visually Impaired" at the Recent Trends in Analytics and Computing Technologies (RTACT) 2024 conference, showcasing an AI-based solution for assisting visually impaired individuals in identifying banknotes.

The system utilizes advanced computer vision and machine learning techniques to accurately identify banknotes and provide audio feedback, enhancing accessibility for visually impaired users in financial transactions.

Link

Air Calligraphy using Computer Vision

Coming Soon

Contact Me

sanjaibala11@gmail.com

+1 (812) 671-6737

Download CV
Chatbot