Command Palette

Search for a command to run...

GitHub

Projects

A showcase of my data science and AI projects.

Steam Insights (Gaming Market Analysis & Forecasting)

08.2024 - 12.2024

Comprehensive gaming market analysis and forecasting system processing 8M+ data points from 140K+ games. Built ETL pipeline with Apache Airflow, Databricks Spark, and Kafka. Developed ML models (XGBoost, Random Forest) for review analysis and pricing forecasts. Implemented time series forecasting (ARIMA, Prophet) achieving 85% accuracy in genre demand predictions and reliable sales forecasting.

Apache AirflowDatabricks SparkKafkaXGBoostRandom ForestARIMA+2 more

FinRAG3 - Agentic Financial Document AI

02.2025 - Present

6-phase agentic AI system for automated investment due diligence processing SEC filings (10-K, 10-Q) and fund prospectuses. Features custom parsing algorithms with 95% accuracy, multi-GPU optimization, and enterprise-grade MLOps pipeline. Reduces financial document analysis time from hours to 5 minutes.

LangGraphLangChainChromaDBColBERT v2FastAPIDocker+1 more

HPC Documentation Assistant

02.2025 - Present

RAG-based chatbot for High Performance Computing documentation automation. Features agentic AI for dynamic data scraping, vector database management for HPC queries, and automated user onboarding system for university research computing infrastructure.

RAG SystemsVector DatabasesFastAPIDockerHPC SystemsAgentic AI

Health Analytics Platform

01.2023 - 03.2023

End-to-end health analytics platform using Django and MongoDB for SSOT data collection. Implemented RAG approach with LLaMA and FactCC for accurate health summaries. Built NLP pipeline with spaCy and LSA for health trend extraction. Achieved 60% increase in data availability and 75% reduction in manual errors through adaptive web scraping.

DjangoMongoDBLLaMAFactCCRAG SystemsspaCy+2 more