Careers

We're on a mission to dictate the future of AI testing

Join us to build and grow the world's biggest and most loved open-source LLM evaluation product.

What is Confident AI?

Confident AI an open-source company building 1) an open-source package called DeepEval to unit-test LLM applications such as chatbots, agents, and RAG pipelines, and 2) the cloud platform for DeepEval. It's like Next.JS and Vercel. The founding team is a small group of exceptional engineers and researchers from top colleges and companies such as Google, Microsoft, and Princeton.

Our Values and Morals

Things we value:

  • No excuses or BS—if something is wrong, surface it so someone can help.
  • Openness and transparency—hiding a problem won’t make it go away.
  • No politics, micromanagement, or bureaucracy, even in controversial discussions.
  • Autonomy, ownership, and responsibility—just as expected from any grown adult.
  • No ghosting—respect others’ time and effort.
  • Doers, not yappers, function over form. This means we're ok with remote work as long as you deliver.

If this sounds like you, we'd love to talk to you. Please find available job openings down below.

Open Positions

Founding Open-Source (Research) Engineer
US or Remote
$100,000-200,000k USD (+lots of equity)
Engineering & Research

What you'll be doing:
- Working on DeepEval (most used package for LLM evaluation in the world) for both LLM evaluation features and also LLM red teaming features.
- Incorporating the latest research in the features and metrics to our offering and constantly updating it as needed.
- Write content around what you've built in the form of documentation and blog articles for the open-source community.
- Support our open-source community for any questions and help they might need.

You should be able to:
- Read papers, and have a natural curiosity for new research.
- Write clearly, and is an avid reader.
- Code proficiently and quickly in Python and Typescript.
- Work 6 days a week, we're not hiding we expect a lot from you.

Your work will:
- Be used by hundreds of thousands of open-source users, all the way from individual hobbyist to AI leaders at Fortune 500 companies.
- Educate hundreds of thousands of people, that wouldn't otherwise know how to quality assure their LLM applications.
- Be respected and appreciated by the community.

By joining us, you will:
- Be shaping the future of LLM testing and evaluation.
- Learn how to run and do startups, in a relatively safe environment.
- Work closely with the founders, with the possibly of promoted to an executive role in the future.
- Be compensated highly, with generous founding equity. This also means that we expect a lot from you.

Apply by emailing your resume to hiring@confident-ai.com (Titled "Interested in OSRE")
Founding Fullstack (Infrastructure) Engineer
US or Remote
$100,000-200,000k USD (+lots of equity)
Engineering

What you'll be doing:
- Working on Confident AI, the DeepEval cloud platform.
- Scale Confident AI's backend infrastructure to process millions of evaluations a month.
- Deploying Confident AI on-premises for enterprises.
- Support our closed-source customers and help them with anything they might need.
- Occasionally, write interesting content around how you're scaling Confident AI's systems for the developer community.

You should be able to:
- Write SQL, and be an expert in scaling relational database systems (PostgresQL).
- Dockerize distributed, and have experience working with the AWS services such as EKS.
- Conduct on-premise deployments in our customers' cloud providers such as AWS, Azure, and GCP.
- Work with multi-tenant (authentication) systems.
- Follow best data practices to ensure we remain SOCII and HIPAA complied.
- Code proficiently and quickly in Python and Typescript.
- Work 6 days a week, we're not hiding we expect a lot from you.

Your work will:
- Be used by hundreds of engineering teams, all the way from individual developers to Fortune 500 companies.
- Enable hundreds of engineering teams to gain instantly visibility into the performance of their LLM applications that wouldn't otherwise be possible.
- Make DeepEval even more popular (counter-intuitively).
- Be respected and appreciated by our customers.

By joining us, you will:
- Bring LLM testing and evaluation to the largest companies available.
- Learn how to serve enterprise customers as a startup, in a relatively safe environment.
- Work closely with the founders, with the possibly of promoted to an executive role in the future.
- Be compensated highly, with generous founding equity. This also means that we expect a lot from you.

Apply by emailing your resume to hiring@confident-ai.com (Titled "Interested in FIE")

Our Hiring Process

The entire process is usually fully remote and all communication happens over email or via video chat in Google Meet. We know that you may be interviewing elsewhere as well so am respectful of your time and will get back no later than 2 days of each step along the process.

The entire process has 4 steps and takes around 1.5 week in total:

  • Initial 15-30 minute phone screening interview.
  • One 30-45 minute technical interview.
  • One week fully-paid work trial.
  • Full-time offer.

You'll be working with the founders directly throughout the entire process. For any questions, email hiring@confident-ai.com.

Feature Comparison

Endless options designed to scale with your data, support, and infrastructure needs.

Core
Starter
Try Now >
Enterprise
Contact Us >

DeepEval

Evaluation Analytics

Evaluation Debugging

Evaluation Data Exports

Manage Evaluation Datasets

Production Events Tracking

Real-time Evaluations

Custom Metrics in Production

Integration with CI/CD

-

Human A/B Testing Suite

-

JudgementalGPT

-
-

Evaluation Alerting

-
-

Custom Evaluation LLM

-
-

RESTified API endpoints

-
-

Performance Reports

-
-
Data
Starter
Premium
Enterprise

Users per project

Up to 1
Starting from 8
No limit

Data Storage

No limit
No limit
No limit

Evaluation Datasets

Starting from to 5
No limit
No limit

Data Retention

6 months
3 years
7 years
Support
Starter
Premium
Enterprise

Documentation

Community support

Email/ticketed support

Expert technical support

-

Live support

-

Priority response SLAs

-
-

Dedicated onboarding

-
-
Infrastructure
Starter
Premium
Enterprise

SSO Authentication

Auto-scaling

-

Failover for high availability

-
-

Up-time SLA

-
95.0%
99.95%

VPC Peering

-
-
1 included ($250/month per additional VPC)

On-premise deployment

-
-
Contact Us

Frequently Asked Questions.

Does Dataplus offers a free trial?

Lorem ipsum dolor sit amet, consectetur adipiscing elit id venenatis pretium risus euismod dictum egestas orci netus feugiat ut egestas ut sagittis tincidunt phasellus elit etiam cursus orci in. Id sed montes.

Do you offer CRM team plans?

Lorem ipsum dolor sit amet, consectetur adipiscing elit id venenatis pretium risus euismod dictum egestas orci netus feugiat ut egestas ut sagittis tincidunt phasellus elit etiam cursus orci in. Id sed montes.

What are the features in the roadmap?

Lorem ipsum dolor sit amet, consectetur adipiscing elit id venenatis pretium risus euismod dictum egestas orci netus feugiat ut egestas ut sagittis tincidunt phasellus elit etiam cursus orci in. Id sed montes.

Do you currently have open positions?

Lorem ipsum dolor sit amet, consectetur adipiscing elit id venenatis pretium risus euismod dictum egestas orci netus feugiat ut egestas ut sagittis tincidunt phasellus elit etiam cursus orci in. Id sed montes.

Do you offer discounts for non-profits?

Lorem ipsum dolor sit amet, consectetur adipiscing elit id venenatis pretium risus euismod dictum egestas orci netus feugiat ut egestas ut sagittis tincidunt phasellus elit etiam cursus orci in. Id sed montes.

Start using the data retrieval platform of the future.

A CRM Platform For Power Users - Dataplus X Webflow Template