Launch Week's here! Day 2: Scheduled Evals, read more →

Build the reason AI can be trusted

You'll work alongside people who care deeply about the problem and each other. No ego, no busywork — just hard problems, fast shipping, and a team that has your back.

WHY US

Why Confident AI.

Confident AI is a small, fast-moving team building the infrastructure that makes AI trustworthy. We started by building DeepEval, one of the most used packages for LLM evaluation in the world, used by companies such as OpenAI, Google, and Microsoft.

  • The problem matters. AI is shipping to production faster than anyone can verify it works. We're building the trust layer.
  • Small team, outsized impact. A handful of people used by hundreds of thousands of developers — from solo builders to OpenAI and Google.
  • Speed is the culture. Ideas go from conversation to production in days, not months.
  • Real ownership. You pick up a problem, you own it end-to-end — architecture, implementation, shipping, and the metrics that prove it worked.

If you want to do the best work of your career and actually see it matter, this is the place.

OUR CULTURE

What We Value.

No excuses, no BS
If something is wrong, say it so someone can help. We don't sugarcoat, we don't dance around problems, and we don't let ego get in the way of fixing what's broken. Directness isn't rude here — it's respected.
Ownership
You don't wait to be told. You see the problem, you pick it up, you see it through. You test your own work, catch your own mistakes, and ship things you'd stake your name on. Nobody here is checking behind you — because they shouldn't have to.
First principles thinking
We don't do things because that's how they're done. Every decision gets pressure-tested. If the best answer is uncomfortable or unfamiliar, good — that's usually the right direction.
Customer obsession
We exist to solve our customers' problems. We talk to them directly, we respond fast, and we never leave them guessing or ghosted. If a customer has a problem, it's our problem — and they'll always know where they stand with us.
Radical transparency
Hiding a problem won't make it go away. We surface issues early, share context openly, and trust each other with the full picture. No politics, no back-channels — just the truth, delivered with respect.
Never stop sharpening
Nobody here will nag you to get better. We hire people who are already wired that way — who read, ask questions, seek feedback, and come back sharper every week. Growth here isn't a performance review conversation. It's just how you operate.
OPEN POSITIONS

Join our team.

Developer Relations

Founding Developer Advocate

San Francisco$130K–$175K base + equityDeveloper Relations

Overview

Confident AI is building the infrastructure that makes AI trustworthy. We created DeepEval, the open-source evaluation framework, and we're building the commercial platform that engineering teams use to ship reliable AI products. We have strong product-market fit, a developer community that's growing fast, and teams actively choosing us.

We're looking for a Founding Developer Advocate to own the developer experience from first touch to activation — across both our open-source framework and our commercial platform. You'll create the content, build the community, and represent us at events alongside the founding team.

This is a founding role with a seat at the table. You'll have full freedom to decide what to build, what to say, and how to say it. You won't be executing someone else's content calendar — you'll define the strategy and own the results. When developers tell you something about the product isn't working, you're in the room changing the roadmap, not filing a ticket.

What you'll be doing

  • Own developer content strategy and execution across both DeepEval (open-source) and the Confident AI platform (commercial product). These are distinct products with different audiences and different adoption paths — you'll understand both and create content that serves each.
  • Create onboarding content, demo videos, tutorials, and technical walkthroughs that help developers get value from the product fast.
  • Build and grow our developer community. Be present in the forums, Discord channels, GitHub discussions, and social platforms where our users spend time. Engage with them as a peer, not a marketer.
  • Represent Confident AI at developer events, meetups, and conferences alongside the founders. We're all out there building relationships and talking to developers — you'll be a key part of that.
  • Write technical blog posts, thought leadership, and sharp content that positions us as the authority in AI evaluation and testing infrastructure. Real insight, not recycled takes.
  • Be the voice of the developer internally. You'll have direct influence on product decisions based on what you're hearing from the community.
  • Own competitive positioning in developer conversations. Make sure we show up in every discussion where engineering teams are evaluating AI infrastructure solutions.
  • Coordinate with the founding team on product launches across both open-source and commercial products.

You should be someone who

  • 3+ years of experience in developer relations or developer advocacy at a developer tools, open-source, or infrastructure company. This is non-negotiable — the autonomy we're offering requires that you've done this before and done it well.
  • Has an existing network in the developer tools and AI community. You know people, and people know you. When you vouch for a product, it carries weight.
  • Understands the difference between open-source community building and commercial product marketing, and can navigate both authentically.
  • Can actually write — clear, sharp technical content that developers respect, not marketing copy they scroll past.
  • Comfortable on camera and on stage. You'll be producing video content and speaking at events regularly — this isn't optional.
  • Proficient with AI tools like Claude Code and Cursor as part of your daily workflow.
  • Technical enough to understand the product deeply and speak credibly to engineering teams about AI evaluation, testing, and observability.
  • Self-directed and high-agency. You don't wait to be told what to do — you identify what matters, make a plan, and ship.
  • Comfortable with ambiguity and fast iteration. We're a seed-stage startup; the playbook doesn't exist yet.

Your work will

  • Be the reason developers go from signing up to becoming active, engaged users of both DeepEval and the Confident AI platform.
  • Shape how the developer community perceives us and the category we're defining.
  • Directly influence product direction based on what you're hearing from developers every day.
  • Build the community and content engine that scales with the company from seed to market leader.

By joining us, you will

  • Full autonomy: You own the strategy. We're not hiring you to follow a playbook — we're hiring you to write it.
  • A seat at the table: Direct access to the founding team, influence on product decisions, and a voice in company strategy.
  • The problem: You'll work on the problem that makes all other AI work trustworthy. The impact ceiling here is massive.
Marketing

Founding Head of Marketing

San Francisco$130K–$180K base + equityMarketing

Overview

Confident AI is building the infrastructure that makes AI trustworthy. We have strong product-market fit, a developer community that loves what we've built, and engineering teams actively choosing us. Now we need someone to turn that momentum into a brand the entire AI infrastructure industry recognizes.

We're looking for a Founding Head of Marketing to own marketing end-to-end — brand positioning, developer marketing, content strategy, SEO, GEO, distribution, and go-to-market — as our first dedicated marketing hire. You'll work directly with the founders at a stage where your decisions shape the company, the brand, and the category we're defining.

This is not a role where you inherit a playbook. You'll build the marketing function from scratch, define what growth looks like, and create the distribution engine behind one of the fastest-growing companies in AI infrastructure. You'll have full freedom to set the strategy and run with it.

What you'll be doing

  • Own brand positioning, messaging, and narrative for the company and our products. You'll define how the market perceives us and the category we're creating.
  • Build and scale developer marketing distribution channels — SEO, GEO, technical content marketing, partnerships, developer community, and events — and figure out which levers drive adoption at each stage of growth.
  • Develop go-to-market strategy for our commercial platform alongside the founders, turning open-source developer adoption into commercial revenue.
  • Write and publish content that establishes us as the authority in AI infrastructure and developer tooling. Thought leadership and sharp technical storytelling, not documentation and tutorials.
  • Own competitive positioning and make sure we show up in every conversation where engineering teams are evaluating AI infrastructure solutions — including in AI-generated recommendations and search results.
  • Define growth marketing metrics, measure them honestly, and find new channels before the old ones plateau.
  • Represent the company at developer events, meetups, and conferences alongside the founders. You'll be a visible presence in the communities where our users spend time.
  • Create demo videos, tutorials, and technical walkthroughs that help developers discover and adopt the product. You should be comfortable putting your face on content and shipping it consistently.
  • Coordinate product launches across engineering, bringing structure to how we ship, announce, and communicate publicly.

You should be someone who

  • 5+ years in marketing at a developer tools, open-source, or infrastructure company.
  • Understands how open-source developer adoption becomes commercial revenue — and you've been part of making that happen.
  • Has built a marketing function or growth channel from scratch, not just operated within an existing one.
  • Hands-on experience across multiple marketing disciplines: content strategy, product positioning, developer campaigns, and distribution.
  • Can actually write — clear, sharp copy that technical audiences respect, not marketing fluff they ignore.
  • Comfortable translating complex technical concepts into narratives that resonate with engineering leaders, developer advocates, and individual developers alike.
  • Comfortable on camera and on stage. You'll be producing video content and representing us at events — this isn't optional.
  • Proficient with AI tools like Claude Code and Cursor as part of your daily workflow.
  • Self-directed and high-agency. You don't wait to be told what to do — you identify what matters, make a plan, and execute.
  • Comfortable with ambiguity and fast iteration. We're a seed-stage startup; the playbook doesn't exist yet.

Your work will

  • Shape how the market thinks about us and the category we're defining.
  • Be the reason engineering teams choose us over the alternatives.
  • Build the brand and distribution engine that scales with the company from seed to market leader.

By joining us, you will

  • Full autonomy: You own the strategy. We're not hiring you to follow a playbook — we're hiring you to write it.
  • A seat at the table: Direct access to the founding team at the stage where every decision compounds.
  • The problem: You'll work on the problem that makes all other AI work trustworthy. The impact ceiling here is massive.
Engineering

Founding Infrastructure Engineer

San Francisco or Remote (EU)$100K–$200K base + equityEngineering

Overview

Confident AI is building the infrastructure that makes AI trustworthy. We're an observability platform used by engineering teams who need to understand what their AI systems are actually doing. We have strong product-market fit and a growing base of customers deploying in production.

We're looking for a Founding Infrastructure Engineer to own the reliability, scalability, and infrastructure that our platform runs on. Today we operate in the cloud. Soon, our largest customers will deploy us on-prem in environments we don't control — high-traffic, high-ingest workloads where we can't just hotfix in production. You'll be the person who makes sure we survive that transition and thrive on the other side.

This is a foundational hire. You'll design the systems that let an observability platform handle massive data volumes without breaking, build the deployment story for on-prem and hybrid environments, and set the infrastructure standards that the rest of engineering builds on. If you want to own the hardest scaling and reliability problems at an early-stage company, this is the role.

What you'll be doing

  • Own the reliability and scalability of our platform. You'll design and operate infrastructure that handles high-throughput data ingestion at scale — the kind of load that observability platforms generate.
  • Build our on-prem and hybrid deployment story from scratch. This means packaging the entire service stack — ClickHouse, Postgres, Redis, application services — for customer environments, plus configuration management, upgrade paths, and operational runbooks for deployments where you have limited visibility and no ability to push quick fixes.
  • Design and implement the observability, monitoring, and alerting for our own infrastructure — yes, the observability platform needs to observe itself.
  • Own our Kubernetes infrastructure, CI/CD pipelines, and cloud-native architecture across AWS/GCP. Make sure engineers can ship fast without breaking things.
  • Architect our data layer for scale. Our stack runs on ClickHouse for high-volume analytical ingestion, Postgres for transactional data, and Redis for caching and real-time workloads. You'll make sure none of these become the bottleneck as traffic grows by orders of magnitude.
  • Establish infrastructure-as-code practices, deployment automation, and incident response processes that let a small team operate with the reliability of a much larger one.

You should be someone who

  • 5+ years in platform engineering, infrastructure, or SRE roles — ideally at a company where uptime and data throughput were existential.
  • Deep experience with Kubernetes, container orchestration, and cloud-native infrastructure (AWS and/or GCP).
  • Strong experience with ClickHouse, Postgres, or similar — performance tuning, replication, schema design at scale, not just writing queries. Redis experience is a plus.
  • You've built or significantly contributed to on-prem or hybrid deployment systems — packaging, shipping, and supporting a multi-service stack in environments you don't control.
  • You think in systems, not just services. You understand how failures cascade and you design to prevent that.
  • Experience with high-throughput data pipelines — ingestion, processing, and storage at volumes where naive approaches break.
  • Comfortable with infrastructure-as-code (Terraform, Pulumi, or similar) and CI/CD automation.
  • You've been on-call and you've built the systems that make on-call less painful.
  • Self-directed and high-agency. You identify what's about to break before it does, and you fix it without being asked.
  • Comfortable with ambiguity and fast iteration. We're a seed-stage startup; you'll be building the foundation, not maintaining someone else's.
  • You use AI tools — LLMs, copilots, automation — to move faster. At our size, every engineer needs to operate at a multiplied level.

Your work will

  • Be the reason our platform stays up when a Fortune 500 deploys us on-prem with 100x the traffic we've seen before.
  • Define the infrastructure and deployment architecture that scales with the company from seed to market leader.
  • Let the rest of the engineering team ship product fast because the platform underneath them is solid.
Engineering & Growth

Founding Open-Source Growth Engineer

US or Remote$100-200k USD (+equity)Engineering & Growth

What you'll be doing

  • Build features for DeepEval across LLM evaluation and red teaming.
  • Write documentation and blog posts that the open-source community actually wants to read.
  • Distribute content across Reddit, Twitter, LinkedIn, and developer communities.
  • Own our Discord and GitHub community — answer questions, triage issues, build relationships.
  • Define what growth means for each channel, measure it, and find new distribution levers.
  • Form partnerships and integrations with other open-source projects.

You should be someone who

  • Codes proficiently in Python and TypeScript — this is an engineering role, not just a marketing role.
  • Writes well and enjoys it. Documentation, blog posts, community replies — you care about how things read.
  • Has a green GitHub profile and is already active in open-source.
  • Picks things up fast. You'll learn SEO, GEO, and growth strategies on the job.
  • Has genuine curiosity — you read papers, explore new tools, and stay close to what's happening in AI.
  • Communicates clearly and directly.
  • Willing to work 6 days a week in a high-intensity startup.

Your work will

  • Be used by hundreds of thousands of developers, from individual builders to teams at OpenAI and Google.
  • Educate thousands of people on how to properly evaluate their LLM applications.
  • Help grow DeepEval into the standard for AI evaluation.
HIRING PROCESS

Our Hiring Process.

The entire process is usually fully remote and all communication happens over email or via video chat in Google Meet. We know that you may be interviewing elsewhere as well so are respectful of your time and will get back no later than 2 days of each step along the process.

The entire process has 4 steps and takes around 1.5 weeks in total:

  1. Initial 15-30 minute phone screening interview.
  2. One 30-45 minute technical interview.
  3. One week fully-paid work trial.
  4. Full-time offer.

No hires will be made without a work trial. You'll be working with the founders directly throughout the entire process. For any questions, email [email protected].

Interested? Let's talk.