I build ML platform infrastructure the layer between raw data and production models.
Currently at Yipitdata building the data platform behind Signals: web-scale extraction pipelines, a knowledge base for RAG, semantic search, and model serving benchmarked across open-source and proprietary models.
Previously at CRED on the central data platform team — real-time pipelines at 500M+ events/day (Kafka, Kinesis, Flink, Databricks).
What I'm currently focused on:
- Feature stores & real-time feature computation
- Low-latency model serving infrastructure
- Building ML applications for real world problems
Stack: Python · PySpark · Kafka · Flink · Redis · MLflow · FastAPI · Databricks · AWS (Kinesis, Lambda, ECS) · Docker
🤝 Reach me: [email protected] · LinkedIn
IIT Hyderabad · B.Tech Engineering Science · 2019