Start using the data retrieval platform of the future.

Join us to build and grow the world's biggest and most loved open-source LLM evaluation product.
Confident AI an open-source company building 1) an open-source package called DeepEval to unit-test LLM applications such as chatbots, agents, and RAG pipelines, and 2) the cloud platform for DeepEval. It's like Next.JS and Vercel. The founding team is a small group of exceptional engineers and researchers from top colleges and companies such as Google, Microsoft, and Princeton.
Things we value:
If this sounds like you, we'd love to talk to you. Please find available job openings down below.
What you'll be doing:
- Working on DeepEval (most used package for LLM evaluation in the world) for both LLM evaluation features and also LLM red teaming features.
- Incorporating the latest research in the features and metrics to our offering and constantly updating it as needed.
- Write content around what you've built in the form of documentation and blog articles for the open-source community.
- Support our open-source community for any questions and help they might need.
You should be able to:
- Read papers, and have a natural curiosity for new research.
- Write clearly, and is an avid reader.
- Code proficiently and quickly in Python and Typescript.
- Work 6 days a week, we're not hiding we expect a lot from you.
Your work will:
- Be used by hundreds of thousands of open-source users, all the way from individual hobbyist to AI leaders at Fortune 500 companies.
- Educate hundreds of thousands of people, that wouldn't otherwise know how to quality assure their LLM applications.
- Be respected and appreciated by the community.
By joining us, you will:
- Be shaping the future of LLM testing and evaluation.
- Learn how to run and do startups, in a relatively safe environment.
- Work closely with the founders, with the possibly of promoted to an executive role in the future.
- Be compensated highly, with generous founding equity. This also means that we expect a lot from you.
What you'll be doing:
- Working on Confident AI, the DeepEval cloud platform.
- Scale Confident AI's backend infrastructure to process millions of evaluations a month.
- Deploying Confident AI on-premises for enterprises.
- Support our closed-source customers and help them with anything they might need.
- Occasionally, write interesting content around how you're scaling Confident AI's systems for the developer community.
You should be able to:
- Write SQL, and be an expert in scaling relational database systems (PostgresQL).
- Dockerize distributed, and have experience working with the AWS services such as EKS.
- Conduct on-premise deployments in our customers' cloud providers such as AWS, Azure, and GCP.
- Work with multi-tenant (authentication) systems.
- Follow best data practices to ensure we remain SOCII and HIPAA complied.
- Code proficiently and quickly in Python and Typescript.
- Work 6 days a week, we're not hiding we expect a lot from you.
Your work will:
- Be used by hundreds of engineering teams, all the way from individual developers to Fortune 500 companies.
- Enable hundreds of engineering teams to gain instantly visibility into the performance of their LLM applications that wouldn't otherwise be possible.
- Make DeepEval even more popular (counter-intuitively).
- Be respected and appreciated by our customers.
By joining us, you will:
- Bring LLM testing and evaluation to the largest companies available.
- Learn how to serve enterprise customers as a startup, in a relatively safe environment.
- Work closely with the founders, with the possibly of promoted to an executive role in the future.
- Be compensated highly, with generous founding equity. This also means that we expect a lot from you.
The entire process is usually fully remote and all communication happens over email or via video chat in Google Meet. We know that you may be interviewing elsewhere as well so am respectful of your time and will get back no later than 2 days of each step along the process.
The entire process has 4 steps and takes around 1.5 week in total:
You'll be working with the founders directly throughout the entire process. For any questions, email hiring@confident-ai.com.
DeepEval
Evaluation Analytics
Evaluation Debugging
Evaluation Data Exports
Manage Evaluation Datasets
Production Events Tracking
Real-time Evaluations
Custom Metrics in Production
Integration with CI/CD
Human A/B Testing Suite
JudgementalGPT
Evaluation Alerting
Custom Evaluation LLM
RESTified API endpoints
Performance Reports
Users per project
Data Storage
Evaluation Datasets
Data Retention
Documentation
Community support
Email/ticketed support
Expert technical support
Live support
Priority response SLAs
Dedicated onboarding
Lorem ipsum dolor sit amet, consectetur adipiscing elit id venenatis pretium risus euismod dictum egestas orci netus feugiat ut egestas ut sagittis tincidunt phasellus elit etiam cursus orci in. Id sed montes.
Lorem ipsum dolor sit amet, consectetur adipiscing elit id venenatis pretium risus euismod dictum egestas orci netus feugiat ut egestas ut sagittis tincidunt phasellus elit etiam cursus orci in. Id sed montes.
Lorem ipsum dolor sit amet, consectetur adipiscing elit id venenatis pretium risus euismod dictum egestas orci netus feugiat ut egestas ut sagittis tincidunt phasellus elit etiam cursus orci in. Id sed montes.
Lorem ipsum dolor sit amet, consectetur adipiscing elit id venenatis pretium risus euismod dictum egestas orci netus feugiat ut egestas ut sagittis tincidunt phasellus elit etiam cursus orci in. Id sed montes.
Lorem ipsum dolor sit amet, consectetur adipiscing elit id venenatis pretium risus euismod dictum egestas orci netus feugiat ut egestas ut sagittis tincidunt phasellus elit etiam cursus orci in. Id sed montes.