Build and operate a data platform at Microsoft — a contract-driven Spark/Synapse system processing 2+ TB/month of financial data representing $110B in revenue across multiple orgs. Drove the original migration off a legacy U-SQL stack: ~8× E2E speedup at 1/5 the cost.
Proposed and now own the declarative data contract system — the foundation of the platform's configuration-driven approach. Contracts define schemas, validation, and governance and generate pipeline orchestration, so teams ship data behavior without touching platform code. They've become a key surface for cross-team communication and a foundational piece of the platform's AI story. Mandatory reviewer for contract changes; coordinate cross-team rollouts and downstream impact analysis.
Ongoing focus on reliability and DX: faster local Spark workflows, SLA / quarantine / alerting, and self-service onboarding for new entities and teams.
Also helped move an existing big-data platform from the commercial cloud to a secure federal environment.