Projects
A showcase of my data science and AI projects.
Steam Insights (Gaming Market Analysis & Forecasting)
08.2024 - 12.2024Comprehensive gaming market analysis and forecasting system processing 8M+ data points from 140K+ games. Built ETL pipeline with Apache Airflow, Databricks Spark, and Kafka. Developed ML models (XGBoost, Random Forest) for review analysis and pricing forecasts. Implemented time series forecasting (ARIMA, Prophet) achieving 85% accuracy in genre demand predictions and reliable sales forecasting.
FinRAG3 - Agentic Financial Document AI
02.2025 - Present6-phase agentic AI system for automated investment due diligence processing SEC filings (10-K, 10-Q) and fund prospectuses. Features custom parsing algorithms with 95% accuracy, multi-GPU optimization, and enterprise-grade MLOps pipeline. Reduces financial document analysis time from hours to 5 minutes.
HPC Documentation Assistant
02.2025 - PresentRAG-based chatbot for High Performance Computing documentation automation. Features agentic AI for dynamic data scraping, vector database management for HPC queries, and automated user onboarding system for university research computing infrastructure.
Health Analytics Platform
01.2023 - 03.2023End-to-end health analytics platform using Django and MongoDB for SSOT data collection. Implemented RAG approach with LLaMA and FactCC for accurate health summaries. Built NLP pipeline with spaCy and LSA for health trend extraction. Achieved 60% increase in data availability and 75% reduction in manual errors through adaptive web scraping.