Question 1

How is DeepEval different from Confident AI?

Accepted Answer

DeepEval is an open-source evaluation framework that lets you write and run LLM evaluation tests locally in Python. Confident AI is the cloud platform built on top of DeepEval that adds centralized test management, observability, collaboration, and analytics so teams can scale their evaluation workflows organization-wide.

Question 2

Did Confident AI create DeepEval?

Accepted Answer

Yes. The team behind Confident AI created and maintains DeepEval. DeepEval was open-sourced to give the community a best-in-class LLM evaluation framework, while Confident AI extends it with the enterprise features teams need to operationalize evaluations at scale.

Question 3

Do I need DeepEval to use Confident AI?

Accepted Answer

No. Confident AI is a standalone platform whose APIs are integrated into DeepEval. However, Confident AI is also a full LLM observability platform, so you can use it to trace, monitor, and evaluate your LLM applications in one place — no more siloing evals and tracing across different tools.

Question 4

I already use [insert alternative platform here] for observability, should I still use Confident AI?

Accepted Answer

Almost certainly. Observability is trivial, evals are the real challenge. Confident AI's observability is one of the best solutions for quality-driven monitoring of AI apps, and one of the cheapest on the market.

Question 5

Is DeepEval free to use?

Accepted Answer

Yes. DeepEval is fully open-source under the Apache 2.0 license and free to use for any purpose. Confident AI offers a free tier as well, along with paid plans for teams that need advanced features like role-based access, custom dashboards, and dedicated support.

Question 6

What evaluation metrics does DeepEval support?

Accepted Answer

DeepEval ships with 50+ research-backed metrics including faithfulness, answer relevancy, contextual recall, contextual precision, hallucination, bias, toxicity, and more. You can also define fully custom metrics using Python or LLM-as-a-judge approaches.

Question 7

Can I use Confident AI with other evaluation frameworks?

Accepted Answer

Yes. While Confident AI has first-class support for DeepEval, it also integrates with other popular tools and frameworks through its REST API and SDKs, so you can centralize results regardless of how you run your evaluations.

Question 8

How do I get started?

Accepted Answer

Install DeepEval with pip install deepeval, write your first evaluation test, and optionally connect to Confident AI by running deepeval login. You can also sign up for Confident AI directly and start using the platform without DeepEval.

The native DeepEval platform

Bye bye CSVs. Hello collaboration.

Shared evaluation dashboards

Comment & annotate results

Version datasets

Align metrics with humans

Regression testing

The security posture your compliance team wants.

Stay in your stack.
We'll meet you there.

The future of quality AI depends on you.

Have a Question?

Get started today.