Software Engineer · Data · Chicago

Building data infrastructure that scales.

I'm Chaitanya, a data engineer at Egen. I design ETL pipelines and cloud-based systems that turn messy data into reliable, production-ready products. Maintainer of cloudfit and a few other open-source Python packages.

Chaitanya Kasaraneni
5+ yrs in data engineering 6+ peer-reviewed papers 4+ OSS Python packages on PyPI GCP Professional Data Engineer · MS Comp Eng, SJSU

Currently building

Open source

All work →
  • 2026

    cloudfit

    Cloud-agnostic machine type recommender for batch and bioinformatics workloads. Multi-package OSS ecosystem (scoring engine, GCP provider, FastAPI service) with a multi-region bundled snapshot and a live demo on Hugging Face Spaces.

    PyPI FastAPI Multi-cloud Live demo ↗
  • 2026

    samplesheet-parser

    Format-agnostic parser for Illumina SampleSheet.csv files. Auto-detects IEM v1 vs. BCLConvert v2, validates index integrity with Hamming distance checks, and converts or merges sheets across mixed sequencing fleets.

    PyPI Bioconda Bioinformatics Apache 2.0

Selected

Recent research

All publications →

Selected

Writing

All articles →

Like to talk shop?

Always happy to chat about ETL, ML in production, cloud architecture, or research. Drop me a note.