About
I’m building EvalOps, a system for making AI products more legible, testable, and improvable. I care about the operational side of AI: how teams catch regressions, learn from feedback, and ship behavior changes with evidence.
My background across security, infrastructure, product, and startups shapes how I think about trust, failure modes, and production reality. Before this I co-founded ThreatKey and worked at Vanta, Carta, and Snap.

Experience
Founder & CEO, EvalOps
2025 -Building evaluation infrastructure to make AI products more legible, testable, and improvable. Agent reliability, regression detection, and evidence-based behavior changes.
Senior Staff Security Engineer, Writer
2025 -Building Cerebro, an open-source operations data platform for cloud, SaaS, and security posture management. Policy engine, multi-cloud scanning, AI-powered investigation, and compliance automation.
Senior Product Manager, Vanta
2024 - 2025Joined via ThreatKey. Led security integrations across cloud, code, and infrastructure platforms. Worked on partnerships with Wiz, CrowdStrike, GitHub, GitLab, and others.
Co-founder & CEO, ThreatKey
2020 - 2024Built a SaaS security posture management platform. Identified misconfigurations and vulnerabilities across cloud infrastructure and business tools (AWS, GCP, Google Workspace, Microsoft 365). Self-service onboarding — customers could connect integrations and surface security findings in under a minute.
Lead, Security Operations, Carta
2020 - 2021Built security operations from the ground up. Implemented incident response protocols. Left to go full-time on ThreatKey.
Prior to that, security engineering roles at Lockheed Martin, DoorDash, and Snapchat (2016–2020), and internships from 2013–2016. Been writing code since I should have been playing N64.
Open source & side projects
I run EvalOps, a small research lab focused on agent reliability and evaluation infrastructure.
- Cerebro — operations data platform for cloud and SaaS security (Go, 326K LOC)
- DiffScope — AI code review engine with confidence scoring (Rust)
- cognitive-dissonance-dspy — structured self-critique for LLMs
- dspy-0to1-guide — getting started with DSPy