# AI Product Engineer - ClickStack Observability Platform
## About the Role
Join ClickHouse to build agentic capabilities on top of a petabyte-scale observability platform. You'll focus on developer experience and create AI agents that investigate incidents, surface anomalies, and help answer "why is production broken?"
## What You'll Do
- Build agents that investigate incidents and use ClickStack as their substrate
- Write reusable skills that capture debugging and incident response playbooks
- Own the agent stack end-to-end: context engineering, tool design, evals, tracing, and cost
- Make ClickStack a great platform for AI workloads by building MCP servers, SDKs, and integrations
- Collaborate with open source contributors and customers in the open
- Tackle hard problems: latency, cost, context window limits, eval coverage, hallucinations
## Who You Are
- You have strong opinions about context engineering, tool design, and agent frameworks
- You think in production terms: p99 latency, cost per task, reliability
- You move quickly, ship often, and learn from failures
- You care deeply about developer experience and good DX
- You do well with ambiguity and ownership
## Requirements
- 5+ years of software engineering experience, including 1-2 years on LLM-powered systems or agents in production
- Strong backend skills in TypeScript/Node.js and/or Python
- Hands-on experience building agents: multi-step tool use, planning, memory, error recovery
- Experience designing skills (Markdown-based workflow encodings, Anthropic-style or similar)
- Experience with MCP: building servers, designing tools, thinking through auth and observability
- Strong evals practice: golden sets, LLM-as-judge, regression detection
- SQL proficiency - can write ClickHouse queries directly
- Comfort with Docker and Kubernetes
- Active in open source and developer community
## Bonus
- Built or operated production agents in observability, incident response, or SRE
- Strong opinions on agent observability and tracing
- Experience with prompt caching or context compaction techniques
- Experience with columnar databases and event ingestion pipelines
- Contributed to or maintained open source AI/agent projects
- Familiarity with Go, Rust, or other systems languages
## Perks
- Flexible work environment - globally distributed, remote-friendly company
- Healthcare: employer contributions
- Equity in the company - stock options for all new team members
- Flexible time off in the US, generous entitlement elsewhere
- $500 home office setup for remote employees
- Global company gatherings and offsites