confident-ai/deepeval

19.5

Adjusted Score

19.5

Raw Score

100%

Time Factor

2026-07-12

Last Push

16.8K

Stars

Python

Language

322.2K

Lines of Code

2.0K

Files

4.3K

Pattern Hits

2026-07-14

Scan Date

0.10

HC Hit Rate

What These Metrics Mean

Adjusted Score: Primary synthetic code indicator. Raw score normalised per 1,000 lines of code and multiplied by the temporal discount factor. This is the definitive comparative metric — use it to rank repositories by AI authorship density.
Raw Score: The unmodified sum of all severity-weighted, context-multiplied pattern match scores before temporal discounting. Reflects the absolute signal strength independent of when the repository was last active.
Time Factor: The temporal discount multiplier (0–100%) applied to the raw score. Repositories last updated before ChatGPT's launch (Nov 2022) receive a 5% factor. Full signal is only assigned to repositories active in the post-adoption era (Jan 2024+).
Pattern Hits: Total count of individual pattern matches across all files and categories. A high hit count with a low score may indicate a very large codebase with isolated AI snippets; a low count with a high score indicates dense, concentrated AI signatures.
HC Hit Rate: High+Critical pattern hits per file, averaged across the repository. This orthogonal signal catches repositories where a few files are densely packed with high-severity AI tells — a strong indicator even when the normalised score appears moderate due to codebase size.
Lines of Code / Files: Total lines and files analysed. The scanner examines 94 file extensions. These denominators are used to normalise the score, enabling fair comparison between repositories of vastly different sizes.

Score History

This chart maps the temporal evolution of the adjusted synthetic code score across successive scan runs. An upward trajectory indicates ongoing incorporation of AI-generated code or expanding LLM-assisted scaffolding; a stable or declining trajectory may reflect active human refactoring, code removal, or the adoption of stricter authorship policies. The dashed secondary line (right axis) independently tracks total raw pattern hit count, which can diverge from the normalised score when codebase size changes significantly between scans.

Severity Breakdown

Classifies detected patterns by their diagnostic confidence and structural impact. CRITICAL patterns (coefficient 10) represent definitive synthetic signatures — hallucinated imports, explicit LLM attribution metadata — virtually never produced by human authors. HIGH (5) indicates strong structural tells such as cross-file repetition or cross-linguistic idioms. MEDIUM (2) covers recognisable conversational padding and AI-specific vocabulary. LOW (1) captures subtle indicators like tautological comments and generic boilerplate that require density to carry independent signal.

CRITICAL 0HIGH 200MEDIUM 500LOW 3572

Directory Score Breakdown

This horizontal bar chart decomposes the repository's raw synthetic code score by top-level directory, allowing you to pinpoint precisely which modules or components carry the highest AI authorship density. Directories with disproportionately high scores relative to their size warrant targeted manual review: concentrated AI signatures often trace back to mass-generated configuration layers, auto-ported test suites, LLM-scaffolded boilerplate classes, or entire subsystems authored under heavy copilot assistance. Use this view to prioritise your human code-review effort.

Pattern Findings

The scanner identified 4272 distinct pattern matches across 19 syntactic categories. Each entry below represents a discrete location in the source code where the engine recorded a statistically significant AI authorship indicator. Expand any category row to inspect the individual file paths, line numbers, code snippets, and the lexical context (CODE, COMMENT, or STRING) in which each match was detected.

Reading the findings table: The Severity column indicates the diagnostic confidence level (CRITICAL / HIGH / MEDIUM / LOW). The Context column identifies whether the match occurred inside executable code, an inline comment, or a string literal — comment-context matches receive a ×1.5 weight because LLMs systematically over-annotate. The ⚡ bolt icon marks clustered matches: three or more patterns within a 10-line window, each receiving an additional ×1.5 density multiplier as dense clusters constitute far stronger evidence of synthetic authorship than isolated hits.

Hyper-Verbose Identifiers2150 hits · 2127 pts

Severity	File	Line	Snippet	Context
LOW	typescript/test/test-integrations/utils.ts	54	export function assertJsonObjectStructure(	CODE
LOW	typescript/src/metrics/conversational-utils.ts	10	export function checkConversationalTestCaseParams(	CODE
LOW	typescript/src/metrics/g-eval/utils.ts	61	export function constructGEvalParamsString(	CODE
LOW	typescript/src/metrics/g-eval/utils.ts	105	export function validateAndSortRubrics(rubrics?: Rubric[]): Rubric[] \| undefined {	CODE
LOW	typescript/src/metrics/g-eval/utils.ts	122	export function validateCriteriaAndEvaluationSteps(	CODE
LOW	typescript/src/metrics/conversational-g-eval/utils.ts	18	export function constructConversationalGEvalTurnParamsString(	CODE
LOW	typescript/src/metrics/conversational-g-eval/utils.ts	61	export function constructNonTurnsTestCaseString(	CODE
LOW	typescript/src/test-case/mllm-image.ts	186	export function extractImageIdsFromString(	CODE
LOW	typescript/src/dataset/utils.ts	5	export function convertTestCasesToGoldens(testCases: LLMTestCase[]): Golden[] {	CODE
LOW	typescript/src/dataset/utils.ts	25	export function convertGoldensToTestCases(	CODE
LOW	typescript/src/dataset/utils.ts	47	export function convertConvoGoldensToConvoTestCases(	CODE
LOW	typescript/src/dataset/utils.ts	90	function createConversationalTestCase(params: {	CODE
LOW	typescript/src/integrations/openinference/index.ts	130	export function createOpenInferenceProcessors(	CODE
LOW	typescript/src/integrations/langchain/utils.ts	114	export function prepareToolCallInputParameters(	CODE
LOW	typescript/src/integrations/openai-agents/extractors.ts	29	export function updateTracePropertiesFromSpanData(	CODE
LOW	typescript/src/integrations/openai-agents/extractors.ts	138	function updateSpanPropertiesFromResponseSpanData(	CODE
LOW	typescript/src/integrations/openai-agents/extractors.ts	212	function updateSpanPropertiesFromGenerationSpanData(	CODE
LOW	typescript/src/integrations/openai-agents/extractors.ts	250	function updateSpanPropertiesFromFunctionSpanData(	CODE
LOW	typescript/src/integrations/openai-agents/extractors.ts	271	function updateSpanPropertiesFromAgentSpanData(	CODE
LOW	typescript/src/integrations/openai-agents/extractors.ts	299	function updateSpanPropertiesFromMcpListToolsSpanData(	CODE
LOW	typescript/src/integrations/openai-agents/extractors.ts	313	function updateSpanPropertiesFromHandoffSpanData(	CODE
LOW	typescript/src/integrations/openai-agents/extractors.ts	327	function updateSpanPropertiesFromCustomSpanData(	CODE
LOW	typescript/src/integrations/openai-agents/extractors.ts	338	function updateSpanPropertiesFromGuardrailSpanData(	CODE
LOW	typescript/src/integrations/openai-agents/extractors.ts	456	function parseFunctionToolCallParam(toolCall: any): any {	CODE
LOW	typescript/src/evaluate/console-report.ts	361	export function printHyperparametersWarning(): void {	CODE
LOW	typescript/src/prompt/utils.ts	22	export function interpolateMustacheWithSpace(	CODE
LOW	typescript/src/prompt/utils.ts	44	export function interpolateDollarBrackets(	CODE
LOW	typescript/src/openai/utils.ts	28	export function stringifyMultimodalContent(content: any): string {	CODE
LOW	typescript/src/openai/extractor.ts	13	export function safeExtractInputParameters(	CODE
LOW	typescript/src/openai/extractor.ts	26	export function extractInputParametersFromCompletionAPI(	CODE
LOW	typescript/src/openai/extractor.ts	63	export function extractInputParametersFromResponseAPI(	CODE
LOW	typescript/src/openai/extractor.ts	99	export function safeExtractOutputParameters(	CODE
LOW	typescript/src/openai/extractor.ts	119	export function extractOutputParametersFromCompletionAPI(	CODE
LOW	typescript/src/openai/extractor.ts	165	export function extractOutputParametersFromResponseAPI(	CODE
LOW	typescript/src/openai/patch.ts	93	function patchAsyncOpenAIClientMethod(	CODE
LOW	typescript/src/openai/patch.ts	183	function updateInputAndOutputOfCurrentTrace(	CODE
LOW⚡	tests/test_agent_loop_detection.py	86	def test_repeated_tool_calls_detected():	CODE
LOW⚡	tests/test_agent_loop_detection.py	119	def test_no_trace_returns_zero():	CODE
LOW⚡	tests/test_agent_loop_detection.py	135	def test_reasoning_stagnation_detected():	CODE
LOW⚡	tests/test_agent_loop_detection.py	169	def test_disable_tool_repetition_check():	CODE
LOW⚡	tests/test_agent_loop_detection.py	198	def test_score_combines_with_correct_weights():	CODE
LOW⚡	tests/test_agent_loop_detection.py	216	def test_call_graph_cycle_detected():	CODE
LOW⚡	tests/test_agent_loop_detection.py	258	def test_sequential_same_name_not_a_cycle():	CODE
LOW⚡	tests/test_agent_loop_detection.py	290	def test_same_name_different_input_not_a_cycle():	CODE
LOW⚡	tests/test_agent_loop_detection.py	335	def test_reordered_stagnation_detected():	CODE
LOW	tests/test_templates/test_metric_templates.py	42	def test_templates_json_is_up_to_date():	CODE
LOW	tests/test_templates/test_metric_templates.py	52	def test_templates_json_is_valid_and_nonempty():	CODE
LOW	tests/test_metrics/test_image_coherence_metric.py	20	def test_multimodal_async_metric_measure(self):	CODE
LOW	tests/test_metrics/test_image_coherence_metric.py	41	def test_multimodal_sync_metric_measure(self):	CODE
LOW	tests/test_metrics/test_image_coherence_metric.py	62	def test_invalid_model_throws_error_for_multimodal(self):	CODE
LOW	tests/test_metrics/test_image_coherence_metric.py	82	def test_multimodal_evaluate_method(self):	CODE
LOW	tests/test_metrics/test_mcp_use_metric.py	21	def test_normal_sync_metric_measure(self):	CODE
LOW	tests/test_metrics/test_mcp_use_metric.py	41	def test_normal_async_metric_measure(self):	CODE
LOW	tests/test_metrics/test_mcp_use_metric.py	61	def test_multimodal_async_metric_measure(self):	CODE
LOW	tests/test_metrics/test_mcp_use_metric.py	78	def test_multimodal_sync_metric_measure(self):	CODE
LOW	tests/test_metrics/test_mcp_use_metric.py	95	def test_invalid_model_throws_error_for_multimodal(self):	CODE
LOW	tests/test_metrics/test_mcp_use_metric.py	109	def test_normal_evaluate_method(self):	CODE
LOW	tests/test_metrics/test_mcp_use_metric.py	129	def test_multimodal_evaluate_method(self):	CODE
LOW	tests/test_metrics/test_topic_adherence_metric.py	20	def test_normal_sync_metric_measure(self):	CODE
LOW	tests/test_metrics/test_topic_adherence_metric.py	44	def test_normal_async_metric_measure(self):	CODE
2090 more matches not shown…

Decorative Section Separators410 hits · 1545 pts

Severity	File	Line	Snippet	Context
MEDIUM⚡	tests/test_agent_loop_detection.py	81	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	tests/test_agent_loop_detection.py	83	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	tests/test_agent_loop_detection.py	114	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	tests/test_agent_loop_detection.py	116	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	tests/test_agent_loop_detection.py	130	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	tests/test_agent_loop_detection.py	132	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	tests/test_agent_loop_detection.py	164	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	tests/test_agent_loop_detection.py	166	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	tests/test_agent_loop_detection.py	193	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	tests/test_agent_loop_detection.py	195	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	tests/test_agent_loop_detection.py	211	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	tests/test_agent_loop_detection.py	213	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	tests/test_agent_loop_detection.py	253	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	tests/test_agent_loop_detection.py	255	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	tests/test_agent_loop_detection.py	285	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	tests/test_agent_loop_detection.py	287	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	tests/test_agent_loop_detection.py	330	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	tests/test_agent_loop_detection.py	332	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	tests/test_agent_loop_detection.py	12	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	tests/test_agent_loop_detection.py	14	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	tests/test_agent_loop_detection.py	58	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	tests/test_agent_loop_detection.py	60	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	tests/test_metrics/test_dag_serialization.py	31	# ----------------------------------------------------------------------------	COMMENT
MEDIUM⚡	tests/test_metrics/test_dag_serialization.py	33	# ----------------------------------------------------------------------------	COMMENT
MEDIUM⚡	tests/test_metrics/test_dag_serialization.py	224	# ----------------------------------------------------------------------------	COMMENT
MEDIUM⚡	tests/test_metrics/test_dag_serialization.py	226	# ----------------------------------------------------------------------------	COMMENT
MEDIUM⚡	tests/test_metrics/test_dag_serialization.py	396	# ----------------------------------------------------------------------------	COMMENT
MEDIUM⚡	tests/test_metrics/test_dag_serialization.py	398	# ----------------------------------------------------------------------------	COMMENT
MEDIUM	tests/test_metrics/test_dag_serialization.py	287	# ----------------------------------------------------------------------------	COMMENT
MEDIUM	tests/test_metrics/test_dag_serialization.py	289	# ----------------------------------------------------------------------------	COMMENT
MEDIUM⚡	…t_integrations/test_googleadk/test_span_interceptor.py	172	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	…t_integrations/test_googleadk/test_span_interceptor.py	174	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	…t_integrations/test_googleadk/test_span_interceptor.py	281	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	…t_integrations/test_googleadk/test_span_interceptor.py	283	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	…t_integrations/test_googleadk/test_span_interceptor.py	465	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	…t_integrations/test_googleadk/test_span_interceptor.py	468	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	…t_integrations/test_googleadk/test_span_interceptor.py	528	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	…t_integrations/test_googleadk/test_span_interceptor.py	530	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	…t_integrations/test_googleadk/test_span_interceptor.py	1002	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	…t_integrations/test_googleadk/test_span_interceptor.py	1004	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	…t_integrations/test_googleadk/test_span_interceptor.py	362	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	…t_integrations/test_googleadk/test_span_interceptor.py	364	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	…t_integrations/test_googleadk/test_span_interceptor.py	702	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	…t_integrations/test_googleadk/test_span_interceptor.py	705	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	…t_integrations/test_googleadk/test_span_interceptor.py	956	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	…t_integrations/test_googleadk/test_span_interceptor.py	958	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	tests/test_integrations/test_llamaindex/test_sync.py	42	# =============================================================================	COMMENT
MEDIUM	tests/test_integrations/test_llamaindex/test_sync.py	44	# =============================================================================	COMMENT
MEDIUM⚡	tests/test_integrations/test_llamaindex/test_sync.py	66	# =============================================================================	COMMENT
MEDIUM⚡	tests/test_integrations/test_llamaindex/test_sync.py	68	# =============================================================================	COMMENT
MEDIUM⚡	tests/test_integrations/test_llamaindex/test_sync.py	105	# =============================================================================	COMMENT
MEDIUM⚡	tests/test_integrations/test_llamaindex/test_sync.py	107	# =============================================================================	COMMENT
MEDIUM	tests/test_integrations/test_llamaindex/test_async.py	58	# =============================================================================	COMMENT
MEDIUM	tests/test_integrations/test_llamaindex/test_async.py	60	# =============================================================================	COMMENT
MEDIUM⚡	tests/test_integrations/test_llamaindex/test_async.py	83	# =============================================================================	COMMENT
MEDIUM⚡	tests/test_integrations/test_llamaindex/test_async.py	85	# =============================================================================	COMMENT
MEDIUM	tests/test_integrations/test_llamaindex/test_async.py	108	# =============================================================================	COMMENT
MEDIUM	tests/test_integrations/test_llamaindex/test_async.py	110	# =============================================================================	COMMENT
MEDIUM⚡	tests/test_integrations/test_llamaindex/test_async.py	150	# =============================================================================	COMMENT
MEDIUM⚡	tests/test_integrations/test_llamaindex/test_async.py	152	# =============================================================================	COMMENT
350 more matches not shown…

Cross-File Repetition135 hits · 675 pts

Severity	File	Line	Snippet	Context
HIGH	tests/test_integrations/test_googleadk/test_sync.py	0	tool-level metric_collection now flows through ``with next_tool_span(metric_collection=...)`` at the call site instead o	STRING
HIGH	tests/test_integrations/test_strands/test_sync.py	0	tool-level metric_collection now flows through ``with next_tool_span(metric_collection=...)`` at the call site instead o	STRING
HIGH	tests/test_integrations/test_agentcore/test_sync.py	0	tool-level metric_collection now flows through ``with next_tool_span(metric_collection=...)`` at the call site instead o	STRING
HIGH	tests/test_integrations/test_googleadk/test_async.py	0	async equivalent of ``testdeepevalfeatures``: span-level kwargs migrate from ``init_evals_agentcore(...)`` to per-call `	STRING
HIGH	tests/test_integrations/test_strands/test_async.py	0	async equivalent of ``testdeepevalfeatures``: span-level kwargs migrate from ``init_evals_agentcore(...)`` to per-call `	STRING
HIGH	tests/test_integrations/test_agentcore/test_async.py	0	async equivalent of ``testdeepevalfeatures``: span-level kwargs migrate from ``init_evals_agentcore(...)`` to per-call `	STRING
HIGH	…t_integrations/test_googleadk/test_span_interceptor.py	0	falls back to settings when current_trace_context is none.	STRING
HIGH	…est_integrations/test_strands/test_span_interceptor.py	0	falls back to settings when current_trace_context is none.	STRING
HIGH	…_integrations/test_pydanticai/test_span_interceptor.py	0	falls back to settings when current_trace_context is none.	STRING
HIGH	…t_integrations/test_agentcore/test_span_interceptor.py	0	falls back to settings when current_trace_context is none.	STRING
HIGH	…t_integrations/test_googleadk/test_span_interceptor.py	0	trace attrs are snapshotted fresh at on_end, not on_start. regression guard for the at-on_start asymmetry: if a downstre	STRING
HIGH	…est_integrations/test_strands/test_span_interceptor.py	0	trace attrs are snapshotted fresh at on_end, not on_start. regression guard for the at-on_start asymmetry: if a downstre	STRING
HIGH	…t_integrations/test_agentcore/test_span_interceptor.py	0	trace attrs are snapshotted fresh at on_end, not on_start. regression guard for the at-on_start asymmetry: if a downstre	STRING
HIGH	…t_integrations/test_googleadk/test_span_interceptor.py	0	symmetric to ``testspancontextpushpop`` but at the trace level. the interceptor pushes an implicit ``trace`` placeholder	STRING
HIGH	…est_integrations/test_strands/test_span_interceptor.py	0	symmetric to ``testspancontextpushpop`` but at the trace level. the interceptor pushes an implicit ``trace`` placeholder	STRING
HIGH	…t_integrations/test_agentcore/test_span_interceptor.py	0	symmetric to ``testspancontextpushpop`` but at the trace level. the interceptor pushes an implicit ``trace`` placeholder	STRING
HIGH	…t_integrations/test_googleadk/test_span_interceptor.py	0	when a real deepeval span is on ``current_span_context`` and the otel span is a root (no native parent), the interceptor	STRING
HIGH	…est_integrations/test_strands/test_span_interceptor.py	0	when a real deepeval span is on ``current_span_context`` and the otel span is a root (no native parent), the interceptor	STRING
HIGH	…t_integrations/test_agentcore/test_span_interceptor.py	0	when a real deepeval span is on ``current_span_context`` and the otel span is a root (no native parent), the interceptor	STRING
HIGH	…t_integrations/test_googleadk/test_span_interceptor.py	0	otel children already have a real parent_id pointing into the same otel trace — no need to bridge.	STRING
HIGH	…est_integrations/test_strands/test_span_interceptor.py	0	otel children already have a real parent_id pointing into the same otel trace — no need to bridge.	STRING
HIGH	…t_integrations/test_agentcore/test_span_interceptor.py	0	otel children already have a real parent_id pointing into the same otel trace — no need to bridge.	STRING
HIGH	…t_integrations/test_googleadk/test_span_interceptor.py	0	one-shot semantics through the interceptor: a second agent span inside the same ``with`` block does not inherit.	STRING
HIGH	…est_integrations/test_strands/test_span_interceptor.py	0	one-shot semantics through the interceptor: a second agent span inside the same ``with`` block does not inherit.	STRING
HIGH	…_integrations/test_pydanticai/test_span_interceptor.py	0	one-shot semantics through the interceptor: a second agent span inside the same ``with`` block does not inherit.	STRING
HIGH	…t_integrations/test_agentcore/test_span_interceptor.py	0	one-shot semantics through the interceptor: a second agent span inside the same ``with`` block does not inherit.	STRING
HIGH	…t_integrations/test_googleadk/test_span_interceptor.py	0	typed slot is not consumed by spans of a different type. an llm span fired inside ``with next_agent_span(...)`` should p	STRING
HIGH	…est_integrations/test_strands/test_span_interceptor.py	0	typed slot is not consumed by spans of a different type. an llm span fired inside ``with next_agent_span(...)`` should p	STRING
HIGH	…t_integrations/test_agentcore/test_span_interceptor.py	0	typed slot is not consumed by spans of a different type. an llm span fired inside ``with next_agent_span(...)`` should p	STRING
HIGH	…t_integrations/test_googleadk/test_span_interceptor.py	0	mirrors the ``test_tool_metric_collection`` flow in test_sync.py — ``with next_tool_span(metric_collection=...)`` sets t	STRING
HIGH	…est_integrations/test_strands/test_span_interceptor.py	0	mirrors the ``test_tool_metric_collection`` flow in test_sync.py — ``with next_tool_span(metric_collection=...)`` sets t	STRING
HIGH	…t_integrations/test_agentcore/test_span_interceptor.py	0	mirrors the ``test_tool_metric_collection`` flow in test_sync.py — ``with next_tool_span(metric_collection=...)`` sets t	STRING
HIGH	…t_integrations/test_googleadk/test_span_interceptor.py	0	last-write-wins: ``next_agent_span`` sets the floor at on_start; later ``update_current_span(...)`` (e.g. from inside a	STRING
HIGH	…est_integrations/test_strands/test_span_interceptor.py	0	last-write-wins: ``next_agent_span`` sets the floor at on_start; later ``update_current_span(...)`` (e.g. from inside a	STRING
HIGH	…t_integrations/test_agentcore/test_span_interceptor.py	0	last-write-wins: ``next_agent_span`` sets the floor at on_start; later ``update_current_span(...)`` (e.g. from inside a	STRING
HIGH	…t_integrations/test_googleadk/test_span_interceptor.py	0	``with next_agent_span(metrics=[...])`` populates the placeholder; at on_end the interceptor calls ``stash_pending_metri	STRING
HIGH	…est_integrations/test_strands/test_span_interceptor.py	0	``with next_agent_span(metrics=[...])`` populates the placeholder; at on_end the interceptor calls ``stash_pending_metri	STRING
HIGH	…t_integrations/test_agentcore/test_span_interceptor.py	0	``with next_agent_span(metrics=[...])`` populates the placeholder; at on_end the interceptor calls ``stash_pending_metri	STRING
HIGH	…t_integrations/test_googleadk/test_span_interceptor.py	0	in production paths (``is_evaluating=false``) the metrics overlay would leak — gate prevents the stash.	STRING
HIGH	…est_integrations/test_strands/test_span_interceptor.py	0	in production paths (``is_evaluating=false``) the metrics overlay would leak — gate prevents the stash.	STRING
HIGH	…t_integrations/test_agentcore/test_span_interceptor.py	0	in production paths (``is_evaluating=false``) the metrics overlay would leak — gate prevents the stash.	STRING
HIGH	…t_integrations/test_googleadk/test_span_interceptor.py	0	span-level kwargs were removed in the otel poc migration. each must raise ``typeerror`` on construction so callers see e	STRING
HIGH	…est_integrations/test_strands/test_span_interceptor.py	0	span-level kwargs were removed in the otel poc migration. each must raise ``typeerror`` on construction so callers see e	STRING
HIGH	…t_integrations/test_agentcore/test_span_interceptor.py	0	span-level kwargs were removed in the otel poc migration. each must raise ``typeerror`` on construction so callers see e	STRING
HIGH	…t_integrations/test_googleadk/test_span_interceptor.py	0	constructor must succeed when no api_key is supplied or in env. the otel pipeline still wires up locally — only the outb	STRING
HIGH	…est_integrations/test_strands/test_span_interceptor.py	0	constructor must succeed when no api_key is supplied or in env. the otel pipeline still wires up locally — only the outb	STRING
HIGH	…t_integrations/test_agentcore/test_span_interceptor.py	0	constructor must succeed when no api_key is supplied or in env. the otel pipeline still wires up locally — only the outb	STRING
HIGH	…ns/test_googleadk/apps/googleadk_multiple_tools_app.py	0	trace-level setup for the multiple-tools fixture. per-tool / per-agent metric collections belong on ``with next_*_span(.	STRING
HIGH	…ations/test_strands/apps/strands_multiple_tools_app.py	0	trace-level setup for the multiple-tools fixture. per-tool / per-agent metric collections belong on ``with next_*_span(.	STRING
HIGH	…ns/test_agentcore/apps/agentcore_multiple_tools_app.py	0	trace-level setup for the multiple-tools fixture. per-tool / per-agent metric collections belong on ``with next_*_span(.	STRING
HIGH	…integrations/test_googleadk/apps/googleadk_tool_app.py	0	trace-only setup. tool / agent / llm span-level fields belong at the call site (``with next_*_span(...)`` or ``update_cu	STRING
HIGH	…est_integrations/test_strands/apps/strands_tool_app.py	0	trace-only setup. tool / agent / llm span-level fields belong at the call site (``with next_*_span(...)`` or ``update_cu	STRING
HIGH	…integrations/test_agentcore/apps/agentcore_tool_app.py	0	trace-only setup. tool / agent / llm span-level fields belong at the call site (``with next_*_span(...)`` or ``update_cu	STRING
HIGH	…integrations/test_googleadk/apps/googleadk_eval_app.py	0	agentcore evals fixture — trace-level setup with a strands tool that mutates its own span via ``update_current_span``. a	STRING
HIGH	…est_integrations/test_strands/apps/strands_eval_app.py	0	agentcore evals fixture — trace-level setup with a strands tool that mutates its own span via ``update_current_span``. a	STRING
HIGH	…integrations/test_agentcore/apps/agentcore_eval_app.py	0	agentcore evals fixture — trace-level setup with a strands tool that mutates its own span via ``update_current_span``. a	STRING
HIGH	…test_integrations/test_crewai/test_crewai_component.py	0	fetch weather data for a given city. returns temperature and conditions.	STRING
HIGH	tests/test_integrations/test_crewai/test_crewai.py	0	fetch weather data for a given city. returns temperature and conditions.	STRING
HIGH	…val/test_ai_agent_evals/test_crewai/test_crewai_e2e.py	0	fetch weather data for a given city. returns temperature and conditions.	STRING
HIGH	tests/test_integrations/test_langgraph/test_sync.py	0	decorator that switches between generate and assert mode based on generate_schemas env var. args: schema_name: name of t	STRING
75 more matches not shown…

Unused Imports593 hits · 556 pts

Severity	File	Line	Context
LOW	test_agentcore_agent.py	33	CODE
LOW	test_agentcore_agent.py	35	CODE
LOW	test_pydantic_agent.py	19	CODE
LOW	tests/test_agent_loop_detection.py	6	CODE
LOW	tests/test_metrics/test_mcp_use_metric.py	4	CODE
LOW	tests/test_metrics/test_exact_match_metric.py	4	CODE
LOW	tests/test_metrics/test_dag_serialization.py	4	CODE
LOW	tests/test_metrics/test_dag_serialization.py	21	CODE
LOW	…sts/test_metrics/test_turn_contextual_recall_metric.py	4	CODE
LOW	tests/test_metrics/turn_contextual_relevancy_metric.py	4	CODE
LOW	…t_metrics/test_answer_relevancy_metric_empty_output.py	18	CODE
LOW	tests/test_integrations/utils.py	1	CODE
LOW	…t_integrations/test_googleadk/test_span_interceptor.py	46	CODE
LOW	…tegrations/test_googleadk/apps/googleadk_simple_app.py	8	CODE
LOW	…ns/test_googleadk/apps/googleadk_multiple_tools_app.py	9	CODE
LOW	…integrations/test_googleadk/apps/googleadk_tool_app.py	8	CODE
LOW	…integrations/test_googleadk/apps/googleadk_eval_app.py	18	CODE
LOW	tests/test_integrations/test_anthropic/conftest.py	6	CODE
LOW	tests/test_integrations/test_crewai/test_stress.py	48	CODE
LOW	…t_integrations/test_crewai/test_knowledge_retriever.py	7	CODE
LOW	…t_integrations/test_crewai/test_knowledge_retriever.py	8	CODE
LOW	…test_integrations/test_crewai/test_crewai_component.py	2	CODE
LOW	…test_integrations/test_crewai/test_crewai_component.py	3	CODE
LOW	…test_integrations/test_crewai/test_crewai_component.py	4	CODE
LOW	…test_integrations/test_crewai/test_crewai_component.py	5	CODE
LOW	tests/test_integrations/test_crewai/test_crewai.py	2	CODE
LOW	tests/test_integrations/test_crewai/test_crewai.py	3	CODE
LOW	tests/test_integrations/test_crewai/test_crewai.py	4	CODE
LOW	tests/test_integrations/test_crewai/test_crewai.py	5	CODE
LOW	tests/test_integrations/test_crewai/test_crewai.py	7	CODE
LOW	tests/test_integrations/test_crewai/test_crewai.py	19	CODE
LOW	tests/test_integrations/test_crewai/apps/simple_app.py	7	CODE
LOW	tests/test_integrations/test_crewai/apps/simple_app.py	9	CODE
LOW	tests/test_integrations/test_crewai/apps/async_app.py	7	CODE
LOW	…rations/test_langgraph/apps/langgraph_next_span_app.py	18	CODE
LOW	tests/test_integrations/test_strands/conftest.py	2	CODE
LOW	…est_integrations/test_strands/test_span_interceptor.py	28	CODE
LOW	…est_integrations/test_strands/test_span_interceptor.py	40	CODE
LOW	…tegrations/test_openai/test_tool_call_flow_response.py	6	CODE
LOW	tests/test_integrations/test_openai/test_sync_openai.py	4	CODE
LOW	…grations/test_openai/test_tool_call_flow_completion.py	6	CODE
LOW	…sts/test_integrations/test_openai/test_async_openai.py	1	CODE
LOW	…sts/test_integrations/test_openai/test_async_openai.py	7	CODE
LOW	tests/test_integrations/test_openai_agents/test_sync.py	9	CODE
LOW	…grations/test_openai_agents/test_scenerios/test_run.py	6	CODE
LOW	…test_openai_agents/test_scenerios/test_multi_agents.py	2	CODE
LOW	…test_openai_agents/test_scenerios/test_multi_agents.py	3	CODE
LOW	…test_openai_agents/test_scenerios/test_multi_agents.py	6	CODE
LOW	…test_openai_agents/test_scenerios/test_multi_agents.py	7	CODE
LOW	…i_agents/test_scenerios/test_with_trace_and_wrapped.py	3	CODE
LOW	…s/test_openai_agents/test_scenerios/test_with_trace.py	5	CODE
LOW	…ons/test_openai_agents/test_scenerios/test_run_sync.py	2	CODE
LOW	…ons/test_openai_agents/test_scenerios/test_run_sync.py	3	CODE
LOW	…ai_agents/test_scenerios/test_weather_agent_patched.py	2	CODE
LOW	…test_openai_agents/test_scenerios/test_run_streamed.py	4	CODE
LOW	…sts/test_integrations/test_langchain/test_next_span.py	18	CODE
LOW	tests/test_integrations/test_pydanticai/conftest.py	2	CODE
LOW	tests/test_integrations/test_agentcore/conftest.py	2	CODE
LOW	…t_integrations/test_agentcore/test_span_interceptor.py	28	CODE
LOW	…t_integrations/test_agentcore/test_span_interceptor.py	40	CODE
533 more matches not shown…

Excessive Try-Catch Wrapping317 hits · 344 pts

Severity	File	Line	Snippet	Context
LOW⚡	tests/test_integrations/test_langgraph/conftest.py	143	except Exception:	CODE
LOW	tests/test_integrations/test_langgraph/conftest.py	241	except Exception:	CODE
LOW	tests/test_integrations/test_langgraph/conftest.py	267	except Exception:	CODE
LOW	tests/test_integrations/test_langgraph/conftest.py	691	except Exception:	CODE
LOW	…ns/test_langgraph/apps/langgraph_multiple_tools_app.py	69	except Exception as e:	CODE
LOW	…ns/test_langgraph/apps/langgraph_parallel_tools_app.py	80	except Exception:	CODE
LOW	tests/test_integrations/test_openai_agents/conftest.py	17	except Exception:	CODE
LOW	…est_integrations/test_openai_agents/apps/tool_agent.py	35	except Exception:	CODE
LOW⚡	tests/test_integrations/test_langchain/conftest.py	120	except Exception:	CODE
LOW	tests/test_integrations/test_langchain/conftest.py	199	except Exception:	CODE
LOW	tests/test_integrations/test_langchain/conftest.py	224	except Exception:	CODE
LOW	tests/test_integrations/test_langchain/conftest.py	455	except Exception:	CODE
LOW	…ntegrations/test_langchain/apps/langchain_agent_app.py	39	except Exception as e:	CODE
LOW	…test_langchain/apps/langchain_metric_collection_app.py	34	except Exception as e:	CODE
LOW	…ns/test_langchain/apps/langchain_parallel_tools_app.py	63	except Exception:	CODE
LOW	…ns/test_langchain/apps/langchain_multiple_tools_app.py	68	except Exception as e:	CODE
LOW	tests/test_core/conftest.py	7	except Exception:	CODE
LOW	tests/test_core/conftest.py	194	except Exception:	CODE
LOW	…racing/test_integration/test_current_golden_context.py	76	except Exception:	CODE
LOW	…racing/test_integration/test_current_golden_context.py	104	except Exception:	CODE
LOW	tests/test_core/test_tracing/apps/sync_app.py	39	except Exception as e:	CODE
LOW	tests/test_core/test_tracing/apps/sync_app.py	65	except Exception:	CODE
LOW	tests/test_core/test_tracing/apps/sync_app.py	84	except Exception:	CODE
LOW	tests/test_core/test_tracing/apps/sync_app.py	101	except Exception:	CODE
LOW	tests/test_core/test_tracing/apps/sync_app.py	134	except Exception as e:	CODE
LOW	tests/test_core/test_tracing/apps/sync_app.py	159	except Exception as e:	CODE
LOW	tests/test_core/test_tracing/apps/sync_app.py	222	except Exception as e:	CODE
LOW	tests/test_core/test_tracing/apps/sync_app.py	270	except Exception:	STRING
MEDIUM	…acing/test_generators/test_generator_context_safety.py	187	def consume(g):	CODE
LOW	…acing/test_generators/test_generator_context_safety.py	191	except Exception as e:	CODE
LOW	tests/test_core/test_evaluation/test_local_store.py	150	except Exception as e: # pragma: no cover	CODE
LOW	…re/test_evaluation/test_execute/test_error_boundary.py	442	except Exception:	CODE
LOW	…re/test_evaluation/test_execute/test_error_boundary.py	481	except Exception:	CODE
MEDIUM	…re/test_evaluation/test_execute/test_error_boundary.py	121	def parent_catches():	CODE
LOW	…s/test_core/test_synthesizer/test_context_generator.py	552	except Exception:	CODE
LOW	.scripts/changelog/release_notes.py	322	except Exception as exc:	STRING
LOW	.scripts/changelog/generate.py	553	except Exception as e:	STRING
LOW	.scripts/changelog/generate.py	971	except Exception as e:	STRING
LOW	.scripts/changelog/generate.py	1114	except Exception as e:	STRING
LOW	.scripts/changelog/generate.py	1143	except Exception:	STRING
LOW	.scripts/changelog/generate.py	1418	except Exception as e:	CODE
LOW	scripts/check_openai_model_capabilities.py	89	except Exception as exc:	CODE
LOW	scripts/check_openai_model_capabilities.py	119	except Exception as exc:	CODE
LOW⚡	deepeval/telemetry.py	140	except Exception:	CODE
LOW⚡	deepeval/telemetry.py	153	except Exception:	CODE
MEDIUM	deepeval/telemetry.py	41	def blocked_by_firewall():	CODE
LOW	deepeval/utils.py	484	except Exception as e:	CODE
MEDIUM	deepeval/utils.py	485	print(f"An error occurred: {e}")	CODE
LOW	deepeval/utils.py	831	except Exception:	CODE
LOW	deepeval/utils.py	857	except Exception:	CODE
MEDIUM	deepeval/utils.py	209	def get_or_create_event_loop() -> asyncio.AbstractEventLoop:	CODE
LOW	deepeval/metrics/indicator.py	139	except Exception as e:	CODE
LOW	deepeval/metrics/indicator.py	299	except Exception as e:	CODE
LOW	deepeval/metrics/utils.py	448	except Exception as e:	CODE
LOW	…l/metrics/agent_loop_detection/agent_loop_detection.py	307	except Exception:	CODE
LOW	deepeval/metrics/g_eval/utils.py	382	except Exception:	CODE
LOW	deepeval/metrics/role_adherence/role_adherence.py	205	except Exception:	CODE
LOW	deepeval/metrics/role_adherence/role_adherence.py	229	except Exception:	CODE
LOW	deepeval/model_integrations/utils.py	110	except Exception:	CODE
LOW	deepeval/tracing/tracing.py	126	except Exception as e:	CODE
257 more matches not shown…

Deep Nesting241 hits · 223 pts

Severity	File	Line	Context
LOW	tests/test_integrations/utils.py	28	CODE
LOW	tests/test_integrations/utils.py	87	CODE
LOW	tests/test_integrations/utils.py	43	CODE
LOW	tests/test_integrations/utils.py	156	CODE
LOW	tests/test_integrations/utils.py	270	CODE
LOW	tests/test_integrations/test_langgraph/conftest.py	182	CODE
LOW	tests/test_integrations/test_langgraph/conftest.py	213	CODE
LOW	tests/test_integrations/test_langgraph/conftest.py	347	CODE
LOW	…grations/test_openai/test_tool_call_flow_completion.py	66	CODE
LOW	tests/test_integrations/test_langchain/conftest.py	153	CODE
LOW	tests/test_integrations/test_langchain/conftest.py	176	CODE
LOW	…ntegrations/test_langchain/apps/langchain_agent_app.py	66	CODE
LOW	…ntegrations/test_langchain/apps/langchain_agent_app.py	115	CODE
LOW	…rations/test_langchain/apps/langchain_streaming_app.py	56	CODE
LOW	…rations/test_langchain/apps/langchain_streaming_app.py	101	CODE
LOW	…rations/test_langchain/apps/langchain_streaming_app.py	146	CODE
LOW	…rations/test_langchain/apps/langchain_streaming_app.py	191	CODE
LOW	…ns/test_langchain/apps/langchain_parallel_tools_app.py	86	CODE
LOW	…ns/test_langchain/apps/langchain_parallel_tools_app.py	131	CODE
LOW	…ns/test_langchain/apps/langchain_multiple_tools_app.py	86	CODE
LOW	…ns/test_langchain/apps/langchain_multiple_tools_app.py	131	CODE
LOW	…tions/test_langchain/apps/langchain_conditional_app.py	68	CODE
LOW	…tions/test_langchain/apps/langchain_conditional_app.py	113	CODE
LOW	…tions/test_langchain/apps/langchain_single_tool_app.py	35	CODE
LOW	…tions/test_langchain/apps/langchain_single_tool_app.py	84	CODE
LOW	tests/test_confident/test_dataset.py	72	CODE
LOW	tests/test_core/stubs.py	119	CODE
LOW	tests/test_core/stubs.py	125	CODE
LOW	…re/test_evaluation/test_execute/test_error_boundary.py	436	CODE
LOW	.scripts/changelog/release_notes.py	106	CODE
LOW	.scripts/changelog/generate.py	835	CODE
LOW	.scripts/changelog/generate.py	864	CODE
LOW	.scripts/changelog/generate.py	930	CODE
LOW	.scripts/changelog/generate.py	1039	CODE
LOW	.scripts/changelog/generate.py	1196	CODE
LOW	.scripts/changelog/generate.py	1206	CODE
LOW	.scripts/changelog/generate.py	1257	CODE
LOW	examples/mcp_evaluation/mcp_eval_multi_turn.py	59	CODE
LOW	scripts/compile_metric_templates.py	38	CODE
LOW	deepeval/progress_context.py	39	CODE
LOW	deepeval/utils.py	139	CODE
LOW	deepeval/utils.py	449	CODE
LOW	deepeval/utils.py	464	CODE
LOW	deepeval/metrics/indicator.py	73	CODE
LOW	deepeval/metrics/indicator.py	157	CODE
LOW	deepeval/metrics/indicator.py	241	CODE
LOW	deepeval/metrics/utils.py	233	CODE
LOW	deepeval/metrics/utils.py	305	CODE
LOW	deepeval/metrics/utils.py	659	CODE
LOW	deepeval/metrics/utils.py	758	CODE
LOW	deepeval/metrics/tool_use/tool_use.py	298	CODE
LOW	deepeval/metrics/summarization/summarization.py	292	CODE
LOW	deepeval/metrics/dag/graph.py	30	CODE
LOW	deepeval/metrics/dag/utils.py	33	CODE
LOW	deepeval/metrics/dag/utils.py	74	CODE
LOW	deepeval/metrics/dag/nodes.py	678	CODE
LOW	deepeval/metrics/dag/nodes.py	84	CODE
LOW	deepeval/metrics/dag/nodes.py	157	CODE
LOW	deepeval/metrics/conversational_dag/nodes.py	827	CODE
LOW	deepeval/metrics/conversational_dag/nodes.py	109	CODE
181 more matches not shown…

Cross-Language Confusion51 hits · 218 pts

Severity	File	Line	Snippet	Context
HIGH	tests/test_metrics/test_json_correctness_metric.py	30	actual_output="{'name': null}",	CODE
HIGH	tests/test_metrics/test_json_correctness_metric.py	56	actual_output="{'name': null}",	CODE
HIGH	tests/test_metrics/test_json_correctness_metric.py	150	actual_output="{'name': null}",	CODE
HIGH⚡	tests/test_confident/test_prompt.py	172	prompt.push(text=f"Latest content {UUID}")	CODE
HIGH⚡	tests/test_confident/test_prompt.py	185	prompt.push(text=f"Version 1 {UUID1}")	CODE
HIGH⚡	tests/test_confident/test_prompt.py	189	prompt.push(text=f"Version 2 {UUID2}")	CODE
HIGH⚡	tests/test_confident/test_prompt.py	898	prompt.push(messages=MESSAGES)	CODE
HIGH⚡	tests/test_confident/test_prompt.py	913	prompt.push(messages=MESSAGES1)	CODE
HIGH⚡	tests/test_confident/test_prompt.py	918	prompt.push(messages=MESSAGES2)	CODE
HIGH	tests/test_confident/test_prompt.py	136	prompt.push(text=TEXT)	CODE
HIGH	tests/test_confident/test_prompt.py	153	prompt.push(	CODE
HIGH	tests/test_confident/test_prompt.py	263	prompt_setup.push(text=f"Setup cache content {uuid.uuid4()}")	CODE
HIGH	tests/test_confident/test_prompt.py	314	prompt.push(	CODE
HIGH	tests/test_confident/test_prompt.py	341	prompt.push(	CODE
HIGH	tests/test_confident/test_prompt.py	370	prompt.push(	CODE
HIGH	tests/test_confident/test_prompt.py	409	prompt.push(	CODE
HIGH	tests/test_confident/test_prompt.py	428	prompt.push(	CODE
HIGH	tests/test_confident/test_prompt.py	447	prompt.push(	CODE
HIGH	tests/test_confident/test_prompt.py	466	prompt.push(	CODE
HIGH	tests/test_confident/test_prompt.py	485	prompt.push(	CODE
HIGH	tests/test_confident/test_prompt.py	507	prompt.push(	CODE
HIGH	tests/test_confident/test_prompt.py	537	prompt.push(	CODE
HIGH	tests/test_confident/test_prompt.py	587	prompt.push(	CODE
HIGH	tests/test_confident/test_prompt.py	620	prompt.push(	CODE
HIGH	tests/test_confident/test_prompt.py	632	prompt.push(	CODE
HIGH	tests/test_confident/test_prompt.py	659	prompt.push(	CODE
HIGH	tests/test_confident/test_prompt.py	694	prompt.push(	CODE
HIGH	tests/test_confident/test_prompt.py	736	prompt1.push(	CODE
HIGH	tests/test_confident/test_prompt.py	767	prompt.push(text="Main branch push")	CODE
HIGH	tests/test_confident/test_prompt.py	771	prompt.push(text="Different branch push", branch=self.BRANCH_NAME)	CODE
HIGH	tests/test_confident/test_prompt.py	855	prompt.push(messages=MESSAGES)	CODE
HIGH	tests/test_confident/test_prompt.py	875	prompt.push(	CODE
HIGH	tests/test_confident/test_prompt.py	1054	prompt.push(	CODE
HIGH	tests/test_confident/test_prompt.py	1084	prompt.push(	CODE
HIGH	tests/test_confident/test_prompt.py	1118	prompt.push(	CODE
HIGH	tests/test_confident/test_prompt.py	1168	prompt.push(	CODE
HIGH	tests/test_confident/test_prompt.py	1221	prompt.push(	CODE
HIGH	tests/test_confident/test_prompt.py	1257	prompt.push(	CODE
HIGH	tests/test_confident/test_prompt.py	1269	prompt.push(	CODE
HIGH	tests/test_confident/test_prompt.py	1299	prompt.push(	CODE
HIGH	tests/test_confident/test_prompt.py	1337	prompt.push(	CODE
HIGH	tests/test_confident/test_prompt.py	1380	prompt1.push(	CODE
HIGH	tests/test_confident/test_prompt.py	1411	prompt.push(	CODE
HIGH	tests/test_confident/test_prompt.py	1417	prompt.push(	CODE
HIGH	tests/test_confident/test_dataset.py	125	dataset.push(alias=self.PUSH_ALIAS)	CODE
HIGH	tests/test_confident/test_dataset.py	165	dataset.push(alias=self.PUSH_ALIAS)	CODE
HIGH	tests/test_confident/test_dataset.py	183	new_dataset.push(alias=self.PUSH_ALIAS, version=version)	CODE
HIGH	tests/test_confident/test_dataset.py	214	dataset.push(alias=self.PUSH_ALIAS)	CODE
HIGH	deepeval/prompt/prompt.py	856	return self.push(	CODE
HIGH	deepeval/simulator/simulation_graph/template.py	40	"index": null,	CODE
HIGH	deepeval/test_run/hyperparameters.py	48	value.push(_verbose=verbose)	CODE

Self-Referential Comments43 hits · 135 pts

Severity	File	Line	Snippet	Context
MEDIUM	tests/test_integrations/test_exporter/readable_spans.py	5	# Create a simple span context	COMMENT
MEDIUM	tests/test_integrations/test_exporter/readable_spans.py	13	# Create the ReadableSpan with one attribute	COMMENT
MEDIUM	tests/test_integrations/test_exporter/readable_spans.py	184	# Create a multi-turn span context	COMMENT
MEDIUM	tests/test_integrations/test_exporter/readable_spans.py	192	# Create the multi-turn readable span	COMMENT
MEDIUM⚡	…t_integrations/test_crewai/test_knowledge_retriever.py	18	# Create a knowledge source	COMMENT
MEDIUM⚡	…t_integrations/test_crewai/test_knowledge_retriever.py	24	# Create an LLM with a temperature of 0 to ensure deterministic outputs	COMMENT
MEDIUM⚡	…t_integrations/test_crewai/test_knowledge_retriever.py	27	# Create an agent with the knowledge store	COMMENT
MEDIUM	tests/test_integrations/test_langgraph/conftest.py	78	# Create a unique identifier for this test run	COMMENT
MEDIUM	…test_langgraph/apps/langgraph_metric_collection_app.py	19	# Create a Prompt object for prompt tracking	COMMENT
MEDIUM	…grations/test_openai_agents/test_scenerios/test_run.py	63	# Create the weather specialist agent	COMMENT
MEDIUM	…i_agents/test_scenerios/test_with_trace_and_wrapped.py	82	# Create the weather specialist agent	COMMENT
MEDIUM	…s/test_openai_agents/test_scenerios/test_with_trace.py	65	# Create the weather specialist agent	COMMENT
MEDIUM	…ons/test_openai_agents/test_scenerios/test_run_sync.py	62	# Create the weather specialist agent	COMMENT
MEDIUM	…ai_agents/test_scenerios/test_weather_agent_patched.py	78	# Create the weather specialist agent	COMMENT
MEDIUM	…test_openai_agents/test_scenerios/test_run_streamed.py	63	# Create the weather specialist agent	COMMENT
MEDIUM	tests/test_integrations/test_langchain/conftest.py	58	# Create a unique identifier for this test run	COMMENT
MEDIUM	…test_langchain/apps/langchain_metric_collection_app.py	16	# Create a Prompt object for prompt tracking	COMMENT
MEDIUM	tests/test_core/stubs.py	189	# Define the mock types	COMMENT
MEDIUM	tests/test_core/test_models/test_openai_model.py	166	# Create a mock parsed response	COMMENT
MEDIUM	tests/test_core/test_models/test_openai_model.py	215	# Create a mock that tracks the call arguments	COMMENT
MEDIUM	tests/test_core/test_models/test_openai_model.py	263	# Create a mock parsed response	COMMENT
MEDIUM	tests/test_core/test_cli/test_cli.py	651	# Create a real JSON file (with whitespace) so we can verify normalization.	COMMENT
MEDIUM⚡	…agent_evals/test_langgraph/test_langgraph_component.py	12	# Define the tool	COMMENT
MEDIUM	…ocs/test_confident/test_integrations/test_langgraph.py	23	# Create a metric	COMMENT
MEDIUM	deepeval/metrics/ragas.py	76	# Create a dataset from the test case	COMMENT
MEDIUM	deepeval/metrics/ragas.py	531	# Create a dataset from the test case	COMMENT
MEDIUM	deepeval/tracing/tracing.py	589	# Create a new event loop	COMMENT
MEDIUM	deepeval/tracing/tracing.py	928	# Create the base API span	COMMENT
MEDIUM	deepeval/synthesizer/synthesizer.py	1875	# Create the pandas DataFrame	COMMENT
MEDIUM	deepeval/benchmarks/math_qa/math_qa.py	144	# Create a DataFrame from task_results_data	COMMENT
MEDIUM	deepeval/benchmarks/bbq/bbq.py	106	# Create a DataFrame from task_results_data	COMMENT
MEDIUM	deepeval/benchmarks/human_eval/human_eval.py	21	# Create a restricted globals dictionary with only safe built-ins	COMMENT
MEDIUM	deepeval/benchmarks/human_eval/human_eval.py	160	# Create a DataFrame from task_results_data	COMMENT
MEDIUM	deepeval/benchmarks/logi_qa/logi_qa.py	146	# Create a DataFrame from task_results_data	COMMENT
MEDIUM	deepeval/benchmarks/drop/drop.py	148	# Create a DataFrame from task_results_data	COMMENT
MEDIUM	deepeval/benchmarks/truthful_qa/truthful_qa.py	155	# Create a DataFrame from task_results_data	COMMENT
MEDIUM	deepeval/benchmarks/big_bench_hard/big_bench_hard.py	178	# Create a DataFrame from task_results_data	COMMENT
MEDIUM	deepeval/benchmarks/squad/squad.py	147	# Create a DataFrame from task_results_data	COMMENT
MEDIUM	deepeval/benchmarks/mmlu/mmlu.py	145	# Create a DataFrame from task_results_data	COMMENT
MEDIUM	deepeval/benchmarks/equity_med_qa/equity_med_qa.py	88	# Create a DataFrame from task_results_data	COMMENT
MEDIUM	deepeval/benchmarks/hellaswag/hellaswag.py	149	# Create a DataFrame from task_results_data	COMMENT
MEDIUM	deepeval/test_run/test_run.py	379	# Create a single TraceMetricScores object instead of a list	COMMENT
MEDIUM	deepeval/test_run/hyperparameters.py	72	# Define the wrapper function that will be the actual decorator	COMMENT

Modern Structural Boilerplate103 hits · 108 pts

Severity	File	Line	Snippet	Context
LOW⚡	tests/test_core/stubs.py	28	def update_metric_data(self, args, *kwargs) -> None: ...	CODE
LOW⚡	tests/test_core/stubs.py	29	def update_status(self, args, *kwargs) -> None: ...	CODE
LOW⚡	tests/test_core/stubs.py	30	def update_run_duration(self, args, *kwargs) -> None: ...	CODE
LOW⚡	tests/test_core/test_stubs_contract.py	144	def update_metric_data(self, args, *kwargs) -> None:	CODE
LOW⚡	tests/test_core/test_stubs_contract.py	147	def update_status(self, args, *kwargs) -> None:	CODE
LOW⚡	tests/test_core/test_stubs_contract.py	150	def update_run_duration(self, args, *kwargs) -> None:	CODE
LOW	deepeval/__init__.py	89	__all__ = [	CODE
LOW	deepeval/key_handler.py	14	logger = logging.getLogger(__name__)	CODE
LOW	deepeval/metrics/indicator.py	22	logger = logging.getLogger(__name__)	CODE
LOW	deepeval/metrics/__init__.py	71	__all__ = [	CODE
LOW	deepeval/metrics/conversational_g_eval/__init__.py	3	__all__ = ["ConversationalGEval"]	CODE
LOW	deepeval/metrics/hallucination/__init__.py	3	__all__ = ["HallucinationMetric"]	CODE
LOW	deepeval/metrics/tool_permission/__init__.py	3	__all__ = ["ToolPermissionMetric"]	CODE
LOW	deepeval/metrics/contextual_precision/__init__.py	3	__all__ = ["ContextualPrecisionMetric"]	CODE
LOW	deepeval/metrics/summarization/__init__.py	3	__all__ = ["SummarizationMetric"]	CODE
LOW	deepeval/metrics/misuse/__init__.py	3	__all__ = ["MisuseMetric"]	CODE
LOW	deepeval/metrics/dag/serialization/__init__.py	9	__all__ = [	CODE
LOW	deepeval/metrics/g_eval/__init__.py	4	__all__ = ["Rubric", "GEval"]	CODE
LOW	deepeval/metrics/toxicity/__init__.py	3	__all__ = ["ToxicityMetric"]	CODE
LOW	deepeval/metrics/faithfulness/__init__.py	3	__all__ = ["FaithfulnessMetric"]	CODE
LOW	deepeval/metrics/answer_relevancy/__init__.py	3	__all__ = ["AnswerRelevancyMetric"]	CODE
LOW	deepeval/metrics/contextual_recall/__init__.py	3	__all__ = ["ContextualRecallMetric"]	CODE
LOW	deepeval/metrics/bias/__init__.py	3	__all__ = ["BiasMetric"]	CODE
LOW	deepeval/metrics/community/__init__.py	5	__all__ = [	CODE
LOW	…al/metrics/community/citation_faithfulness/__init__.py	3	__all__ = ["CitationFaithfulnessMetric"]	CODE
LOW	deepeval/metrics/contextual_relevancy/__init__.py	3	__all__ = ["ContextualRelevancyMetric"]	CODE
LOW	deepeval/tracing/__init__.py	23	__all__ = [	CODE
LOW	deepeval/tracing/otel/context_aware_processor.py	40	logger = logging.getLogger(__name__)	CODE
LOW⚡	deepeval/tracing/otel/context_aware_processor.py	196	__all__ = ["ContextAwareSpanProcessor"]	CODE
LOW	deepeval/tracing/otel/__init__.py	4	__all__ = [	CODE
LOW	deepeval/optimizer/__init__.py	3	__all__ = [	CODE
LOW	deepeval/optimizer/rewriter/__init__.py	3	__all__ = [	CODE
LOW	deepeval/optimizer/algorithms/__init__.py	6	__all__ = ["GEPA", "MIPROV2", "COPRO", "SIMBA"]	CODE
LOW	deepeval/optimizer/algorithms/simba/__init__.py	3	__all__ = [	CODE
LOW	deepeval/optimizer/algorithms/simba/simba.py	80	def _update_step(self, message: str) -> None:	CODE
LOW	deepeval/optimizer/algorithms/simba/simba.py	84	def _update_trial_progress(self, step: int, total: int) -> None:	CODE
LOW	deepeval/optimizer/algorithms/gepa/gepa.py	773	def _update_step(self, iteration: int, label: str) -> None:	CODE
LOW	deepeval/optimizer/algorithms/gepa/__init__.py	3	__all__ = [	CODE
LOW	deepeval/optimizer/algorithms/miprov2/miprov2.py	140	def _update_step(self, message: str) -> None:	CODE
LOW	deepeval/optimizer/algorithms/miprov2/miprov2.py	145	def _update_trial_progress(self, step: int, total: int) -> None:	CODE
LOW	deepeval/optimizer/algorithms/copro/copro.py	80	def _update_step(self, message: str) -> None:	CODE
LOW	deepeval/optimizer/algorithms/copro/copro.py	84	def _update_trial_progress(self, step: int, total: int) -> None:	CODE
LOW	deepeval/optimizer/algorithms/copro/__init__.py	3	__all__ = [	CODE
LOW	deepeval/optimizer/scorer/__init__.py	3	__all__ = [	CODE
LOW	deepeval/config/utils.py	78	def set_env_bool(key: str, value: Optional[bool] = False) -> None:	CODE
LOW	deepeval/config/settings.py	54	logger = logging.getLogger(__name__)	CODE
LOW	deepeval/config/settings_manager.py	21	logger = logging.getLogger(__name__)	CODE
LOW	deepeval/test_case/__init__.py	25	__all__ = [	CODE
LOW	deepeval/dataset/__init__.py	5	__all__ = [	CODE
LOW	deepeval/openai_agents/patch.py	387	def _set_agent_metrics(agent: Agent, agent_span: AgentSpan) -> None:	CODE
LOW	deepeval/openai_agents/__init__.py	7	__all__ = ["DeepEvalTracingProcessor", "Agent", "function_tool"]	CODE
LOW	deepeval/annotation/__init__.py	3	__all__ = ["send_annotation", "a_send_annotation"]	CODE
LOW	deepeval/models/__init__.py	28	__all__ = [	CODE
LOW	deepeval/models/utils.py	7	logger = logging.getLogger(__name__)	CODE
LOW	deepeval/models/retry_policy.py	65	logger = logging.getLogger(__name__)	CODE
LOW	deepeval/models/retry_policy.py	1032	__all__ = [	CODE
LOW	deepeval/models/embedding_models/__init__.py	6	__all__ = [	CODE
LOW	deepeval/models/llms/__init__.py	15	__all__ = [	CODE
LOW	deepeval/cli/auth/__init__.py	12	__all__ = [	CODE
LOW	deepeval/cli/diagnose/__init__.py	6	__all__ = ["diagnose_command", "resolve_setting_source"]	CODE
43 more matches not shown…

Modern AI Meta-Vocabulary27 hits · 90 pts

Severity	File	Line	Snippet	Context
MEDIUM	typescript/src/metrics/README.md	209	### Trace metrics (agentic)	COMMENT
MEDIUM⚡	tests/test_integrations/test_llamaindex/test_sync.py	67	# RAG APP TESTS	COMMENT
MEDIUM⚡	tests/test_integrations/test_llamaindex/test_async.py	84	# ASYNC RAG APP TESTS	COMMENT
MEDIUM	…test_integrations/test_crewai/apps/hierarchical_app.py	23	agent=writer, # In hierarchical, this is the target, but Manager orchestrates	CODE
MEDIUM	tests/test_integrations/test_langgraph/conftest.py	601	context=context, # None - not a RAG app	CODE
MEDIUM	tests/test_integrations/test_langgraph/conftest.py	602	retrievalContext=retrieval_context, # None - not a RAG app	CODE
MEDIUM⚡	tests/test_integrations/test_langgraph/test_sync.py	554	# RETRIEVER (RAG) TESTS	COMMENT
MEDIUM	…rations/test_langgraph/apps/langgraph_retriever_app.py	111	# Create augmented prompt with system message for RAG	COMMENT
MEDIUM⚡	tests/test_integrations/test_langchain/test_sync.py	428	# RETRIEVER (RAG) TESTS	COMMENT
MEDIUM⚡	tests/test_integrations/test_langchain/test_async.py	436	# ASYNC RETRIEVER (RAG) TESTS	COMMENT
MEDIUM	…rations/test_langchain/apps/langchain_retriever_app.py	97	# Create augmented prompt with system message for RAG	COMMENT
MEDIUM	…rations/test_langchain/apps/langchain_retriever_app.py	142	# Create augmented prompt with system message for RAG	COMMENT
MEDIUM	…rations/test_langchain/apps/langchain_retriever_app.py	206	# Create augmented prompt with system message for RAG	COMMENT
MEDIUM	tests/test_core/test_imports.py	14	# RAG metrics	COMMENT
MEDIUM	tests/test_core/test_synthesizer_bugs.py	506	# The few-shot example must demonstrate the new key.	COMMENT
MEDIUM	tests/test_core/test_synthesizer_bugs.py	541	# The few-shot example must demonstrate the new key.	COMMENT
MEDIUM	…_core/test_evaluation/test_trace_results_extraction.py	68	# Monkeypatch create_api_trace in the agentic submodule where	COMMENT
MEDIUM	…ocs/test_deepeval/test_llm_evals/test_setup_tracing.py	71	# Calls RAG pipeline	COMMENT
MEDIUM	…ocs/test_deepeval/test_llm_evals/test_setup_tracing.py	77	# Generate final response incorporating both RAG and search results	COMMENT
MEDIUM	…est_confident/test_tracing_features/test_span_types.py	98	# Custom span wrapping the RAG pipeline	COMMENT
MEDIUM⚡	…est_confident/test_tracing_features/test_span_types.py	110	# Agent that does RAG + tool calling	COMMENT
MEDIUM⚡	…est_confident/test_tracing_features/test_span_types.py	113	# Call RAG pipeline	COMMENT
MEDIUM⚡	…est_confident/test_tracing_features/test_span_types.py	119	# Generate final response incorporating both RAG and search results	COMMENT
MEDIUM	docs/lib/source.ts	85	// Backwards-compatible alias so scaffold-generated routes that still import	COMMENT
MEDIUM	deepeval/metrics/__init__.py	86	# RAG metrics	COMMENT
MEDIUM	deepeval/evaluate/execute/__init__.py	28	# agentic	COMMENT
MEDIUM	deepeval/evaluate/execute/loop.py	443	# parent. Mirrors the async path in ``agentic.py``.	COMMENT

Over-Commented Block72 hits · 70 pts

Severity	File	Line	Snippet	Context
LOW	test_agentcore_agent.py	41	# Wire the deepeval OTel pipeline at import time. Trace-level kwargs	COMMENT
LOW	typescript/test/test-core/tracing.test.ts	81	return response;	COMMENT
LOW	typescript/test/test-core/tracing.test.ts	101	// toolsCalled: [toolCall],	COMMENT
LOW	typescript/test/test-core/tracing.test.ts	121	// query: "test query",	COMMENT
LOW	typescript/test/test-core/evaluate.test.ts	1	// import { config } from "dotenv";	COMMENT
LOW	typescript/test/test-core/evaluate.test.ts	21	// promptVersion: "v1.0",	COMMENT
LOW	typescript/examples/tracing/example-quickstart.ts	1	import OpenAI from "openai";	COMMENT
LOW	typescript/examples/dataset/example-dataset.ts	1	// /**	COMMENT
LOW	typescript/examples/dataset/example-dataset.ts	21	// // Option 2: Import from specific submodules	COMMENT
LOW	typescript/examples/dataset/example-dataset.ts	41	// filePath: csvFilePath,	COMMENT
LOW	typescript/examples/dataset/example-dataset.ts	61	// );	COMMENT
LOW	typescript/examples/dataset/example-dataset.ts	81	// actualOutput: "H2O",	COMMENT
LOW	typescript/examples/dataset/example-dataset.ts	101	// expectedOutput: "H2O",	COMMENT
LOW	typescript/examples/dataset/example-dataset.ts	121	// );	COMMENT
LOW	typescript/examples/dataset/example-dataset.ts	141	// // Create a new dataset for pushing	COMMENT
LOW	typescript/examples/dataset/example-dataset.ts	161	// retrievalContext: [	COMMENT
LOW	typescript/examples/dataset/example-dataset.ts	181	// if (error.message && error.message.includes("deepeval login")) {	COMMENT
LOW	typescript/examples/dataset/example-dataset.ts	201	// autoConvertGoldensToTestCases: false,	COMMENT
LOW	typescript/examples/dataset/example-dataset.ts	221	// }	COMMENT
LOW	typescript/src/integrations/langchain/patch-tool.ts	1	// import { tool as originalTool } from "@langchain/core/tools";	COMMENT
LOW	typescript/src/integrations/langchain/patch-tool.ts	21	// metrics = kwargs.metrics;	COMMENT
LOW	typescript/src/integrations/langchain/patch-tool.ts	41	// }	COMMENT
LOW	typescript/src/integrations/langchain/patch-tool.ts	61	// try {	COMMENT
LOW	…_integrations/test_pydanticai/test_span_interceptor.py	801	DeepEvalInstrumentationSettings(api_key="dummy", is_test_mode=False)	COMMENT
LOW	tests/test_confident/test_dataset.py	221		COMMENT
LOW	…i_agent_evals/test_pydanticai/test_pydanticai_setup.py	21	# result = await agent.run("What are the LLMs?")	COMMENT
LOW	…_deepeval/test_llm_evals/test_component_level_evals.py	61	# dataset.evaluate(task)	COMMENT
LOW	docs/source.config.ts	101	// `next/image`. Our blog + tutorial content references dozens of	COMMENT
LOW	docs/app/layout.tsx	61	// Site-wide fallback preview image. Every section/page inherits	COMMENT
LOW	docs/lib/layout.shared.tsx	81	// `links` intentionally omitted — text items live in `navLinks`	COMMENT
LOW	…ts/MetricTagsDisplayer/MetricTagsDisplayer.module.scss	1	// Metric classification tags shown under each metric's H1. Each variant	COMMENT
LOW	examples/tracing/test_chatbot.py	1	# from deepeval.tracing import trace, TraceType	COMMENT
LOW	examples/tracing/test_chatbot.py	21	# ],	COMMENT
LOW	examples/tracing/test_chatbot.py	41	# def retriever(self, input=input):	COMMENT
LOW	examples/tracing/test_chatbot.py	61		COMMENT
LOW	examples/tracing/test_chatbot.py	81	# "Be at least 35 years old.",	COMMENT
LOW	deepeval/metrics/ragas.py	261	def __name__(self):	COMMENT
LOW	deepeval/metrics/ragas.py	281	# if isinstance(model, str):	COMMENT
LOW	deepeval/metrics/ragas.py	301	# except ModuleNotFoundError:	COMMENT
LOW	deepeval/metrics/ragas.py	321	# metrics=[NoiseSensitivity()],	COMMENT
LOW	deepeval/tracing/types.py	201	turn_id: Optional[str] = Field(None, serialization_alias="turnId")	COMMENT
LOW	deepeval/tracing/context.py	281	# user-code seam — i.e. spans the user never executes code inside, so	COMMENT
LOW	deepeval/tracing/context.py	601	try:	COMMENT
LOW	deepeval/tracing/otel/exporter.py	121	spans: typing.Sequence[ReadableSpan],	COMMENT
LOW	deepeval/tracing/otel/exporter.py	181	current_trace.confident_api_key = api_key	COMMENT
LOW	deepeval/config/settings.py	841	description="Read timeout (seconds) when fetching remote images for multimodal inputs.",	COMMENT
LOW	deepeval/config/settings.py	881	5.0,	COMMENT
LOW	deepeval/openai_agents/runner.py	1	# from __future__ import annotations	COMMENT
LOW	deepeval/openai_agents/runner.py	21	# from agents.models.interface import Model	COMMENT
LOW	deepeval/openai_agents/runner.py	41	# # Import observed provider/model helpers from our agent module	COMMENT
LOW	deepeval/openai_agents/runner.py	61		COMMENT
LOW	deepeval/openai_agents/runner.py	81	# patched_get_model.__doc__ = original_get_model.__doc__	COMMENT
LOW	deepeval/openai_agents/runner.py	101	# context: Optional[TContext] = None,	COMMENT
LOW	deepeval/openai_agents/runner.py	121	# span_type="custom",	COMMENT
LOW	deepeval/openai_agents/runner.py	141	# current_span.input = input	COMMENT
LOW	deepeval/openai_agents/runner.py	161	# update_trace_attributes(output=_output)	COMMENT
LOW	deepeval/openai_agents/runner.py	181	# metadata: Optional[dict] = None,	COMMENT
LOW	deepeval/openai_agents/runner.py	201	# metric_collection=metric_collection,	COMMENT
LOW	deepeval/openai_agents/runner.py	221	# **kwargs, # backwards compatibility	COMMENT
LOW	deepeval/openai_agents/runner.py	241	# max_turns: int = DEFAULT_MAX_TURNS,	COMMENT
12 more matches not shown…

AI Structural Patterns74 hits · 69 pts

Severity	File	Line	Context
LOW	tests/test_integrations/test_langgraph/apps/main.py	167	CODE
LOW	…ts/test_core/test_tracing/test_masking/test_masking.py	32	CODE
LOW	…re/test_tracing/test_nested_spans/test_nested_spans.py	22	CODE
LOW	…re/test_tracing/test_nested_spans/test_nested_spans.py	77	CODE
LOW	tests/test_core/test_tracing/apps/async_app.py	56	CODE
LOW	…_agent_evals/test_langchain/test_langchain_e2e_sync.py	56	CODE
LOW	…ocs/test_confident/test_integrations/test_anthropic.py	22	CODE
LOW	…ocs/test_confident/test_integrations/test_anthropic.py	42	CODE
LOW	…t_docs/test_confident/test_integrations/test_openai.py	18	CODE
LOW	…t_docs/test_confident/test_integrations/test_openai.py	39	CODE
LOW	…est_confident/test_tracing_features/test_span_types.py	107	CODE
LOW	.scripts/changelog/generate.py	1039	CODE
LOW	…metrics/conversational_g_eval/conversational_g_eval.py	44	CODE
LOW	…l/metrics/agent_loop_detection/agent_loop_detection.py	119	CODE
LOW	deepeval/metrics/summarization/summarization.py	43	CODE
LOW	deepeval/metrics/g_eval/g_eval.py	46	CODE
LOW	deepeval/metrics/faithfulness/faithfulness.py	58	CODE
LOW	deepeval/metrics/turn_faithfulness/turn_faithfulness.py	48	CODE
LOW	deepeval/metrics/tool_correctness/tool_correctness.py	32	CODE
LOW	deepeval/metrics/turn_relevancy/turn_relevancy.py	30	CODE
LOW	deepeval/tracing/trace_context.py	46	CODE
LOW	deepeval/tracing/context.py	64	CODE
LOW	deepeval/tracing/context.py	120	CODE
LOW	deepeval/tracing/context.py	357	CODE
LOW	deepeval/tracing/context.py	405	CODE
LOW	deepeval/tracing/context.py	454	CODE
LOW	deepeval/tracing/context.py	513	CODE
LOW	deepeval/tracing/context.py	560	CODE
LOW	deepeval/optimizer/algorithms/gepa/gepa.py	88	CODE
LOW	deepeval/optimizer/algorithms/miprov2/miprov2.py	53	CODE
LOW	deepeval/dataset/dataset.py	255	CODE
LOW	deepeval/dataset/dataset.py	487	CODE
LOW	deepeval/dataset/dataset.py	672	CODE
LOW	deepeval/dataset/dataset.py	755	CODE
LOW	deepeval/dataset/dataset.py	1507	CODE
LOW	deepeval/annotation/annotation.py	7	CODE
LOW	deepeval/annotation/annotation.py	43	CODE
LOW	deepeval/models/_summac_model.py	293	CODE
LOW	deepeval/models/_summac_model.py	526	CODE
LOW	deepeval/models/llms/gemini_model.py	60	CODE
LOW	deepeval/models/llms/azure_model.py	41	CODE
LOW	deepeval/models/llms/amazon_bedrock_model.py	40	CODE
LOW	deepeval/cli/main.py	416	CODE
LOW	deepeval/cli/main.py	821	CODE
LOW	deepeval/cli/main.py	2794	CODE
LOW	deepeval/cli/test/command.py	48	CODE
LOW	deepeval/cli/generate/command.py	41	CODE
LOW	deepeval/synthesizer/synthesizer.py	118	CODE
LOW	deepeval/synthesizer/synthesizer.py	672	CODE
LOW	deepeval/synthesizer/synthesizer.py	905	CODE
LOW	deepeval/synthesizer/synthesizer.py	2301	CODE
LOW	deepeval/synthesizer/synthesizer.py	2548	CODE
LOW	deepeval/synthesizer/chunking/context_generator.py	111	CODE
LOW	deepeval/integrations/crewai/tool.py	26	CODE
LOW	deepeval/integrations/crewai/tool.py	50	CODE
LOW	deepeval/integrations/crewai/tool.py	75	CODE
LOW	deepeval/integrations/openinference/instrumentator.py	379	CODE
LOW	deepeval/integrations/openinference/otel.py	66	CODE
LOW	deepeval/integrations/langchain/callback.py	85	CODE
LOW	deepeval/integrations/langchain/patch.py	40	CODE
14 more matches not shown…

Docstring Block Structure11 hits · 55 pts

Severity	File	Line	Snippet	Context
HIGH	deepeval/utils.py	915	Ensures that a required parameter is provided. If the parameter is `None`, raises a `DeepEvalError` with a help	STRING
HIGH	deepeval/utils.py	947	Imports an optional dependency module or raises a `DeepEvalError` if the module is not found. The error message	STRING
HIGH	deepeval/dataset/dataset.py	271	Load test cases from a CSV file. This method reads a CSV file, extracting test case data based on spec	STRING
HIGH	deepeval/dataset/dataset.py	420	Load test cases from a JSON file. This method reads a JSON file containing a list of objects, each rep	STRING
HIGH	deepeval/models/utils.py	44	Extract base model name from provider-prefixed format. This function is useful for extracting the actual model name	STRING
HIGH	deepeval/models/utils.py	80	Normalize and validate a provider API key stored as a SecretStr. Args: secret: The SecretS	STRING
HIGH	deepeval/models/retry_policy.py	515	Run a synchronous callable with a soft timeout enforced by a helper thread, with a global cap on concurrent tim	STRING
HIGH	deepeval/scorer/scorer.py	276	Calculate the toxicity score of a given text prediction using the Detoxify model. Args: pr	STRING
HIGH	deepeval/synthesizer/synthesizer.py	1892	Save synthetic goldens to a file. Args: file_type: Type of file to save as ('json' or 'csv').	STRING
HIGH	deepeval/anthropic/utils.py	10	Return a short, human-readable summary string for an Anthropic-style multimodal `content` value. This is used	STRING
HIGH	deepeval/openai/utils.py	42	Return a short, human-readable summary string for an OpenAI-style multimodal `content` value. This is used to	STRING

Redundant / Tautological Comments24 hits · 38 pts

Severity	File	Line	Snippet	Context
LOW	tests/test_integrations/test_langgraph/conftest.py	630	# Print values when present	COMMENT
LOW	…ntegrations/test_langchain/apps/langchain_agent_app.py	82	# Check if we have tool calls	COMMENT
LOW	…ntegrations/test_langchain/apps/langchain_agent_app.py	131	# Check if we have tool calls	COMMENT
LOW	tests/test_core/test_evaluation/test_console_report.py	113	# Check if the aggregate table is built correctly	COMMENT
LOW	tests/test_core/test_evaluation/test_console_report.py	119	# Check if it's a Panel and contains the aggregate metrics table	COMMENT
LOW	…al/test_ai_agent_evals/test_setup_end_to_end_python.py	24	# Loop through dataset	COMMENT
LOW	deepeval/utils.py	418	# Check if obj is iterable (but not a string)	COMMENT
LOW	deepeval/metrics/dag/nodes.py	413	# Check if all children are ClassificationResultNode and their classifications are boolean	COMMENT
LOW	deepeval/metrics/dag/nodes.py	423	# Check if there is one True and one False classification	COMMENT
LOW	deepeval/metrics/dag/nodes.py	542	# Check if children is not empty	COMMENT
LOW	deepeval/metrics/dag/nodes.py	553	# Check if the verdict attribute of each child is a string	COMMENT
LOW	deepeval/metrics/conversational_dag/nodes.py	490	# Check if all children are ClassificationResultNode and their classifications are boolean	COMMENT
LOW	deepeval/metrics/conversational_dag/nodes.py	502	# Check if there is one True and one False classification	COMMENT
LOW	deepeval/metrics/conversational_dag/nodes.py	657	# Check if children is not empty	COMMENT
LOW	deepeval/metrics/conversational_dag/nodes.py	670	# Check if the verdict attribute of each child is a string	COMMENT
LOW⚡	deepeval/metrics/g_eval/utils.py	193	# Check if both criteria and evaluation_steps are not None at the same time	COMMENT
LOW⚡	deepeval/metrics/g_eval/utils.py	199	# Check if criteria is provided, it cannot be an empty string	COMMENT
LOW⚡	deepeval/metrics/g_eval/utils.py	203	# Check if evaluation_steps is provided, it cannot be an empty list	COMMENT
LOW	deepeval/metrics/g_eval/utils.py	224	# Check if ranges overlap	COMMENT
LOW	deepeval/tracing/tracing.py	575	# Add the trace to the queue	COMMENT
LOW	deepeval/tracing/tracing.py	822	# Add children to the stack for processing	COMMENT
LOW	deepeval/synthesizer/chunking/context_generator.py	341	# Check if chunk_size and max_context_size is valid for document lengths	COMMENT
LOW	deepeval/evaluate/execute/agentic.py	303	# Set it to None to ensure the test_case is added	COMMENT
LOW	deepeval/prompt/prompt.py	520	# Check if we need to bootstrap the cache	COMMENT

Magic Placeholder Names3 hits · 18 pts

Severity	File	Line	Snippet	Context
HIGH	typescript/examples/dataset/example-dataset.ts	188	// console.log("export CONFIDENT_API_KEY='your-api-key-here'");	COMMENT
HIGH	typescript/examples/dataset/example-dataset.ts	233	// console.log("export CONFIDENT_API_KEY='your-api-key-here'");	COMMENT
HIGH	deepeval/models/llms/gemini_model.py	52	api_key="your-api-key"	STRING

AI Slop Vocabulary5 hits · 13 pts

Severity	File	Line	Snippet	Context
MEDIUM	tests/test_core/test_tracing/apps/sync_app.py	212	research_prompt = f"""Based on the following retrieved documents, provide a comprehensive research analysis of '	STRING
MEDIUM	docs/src/sections/home/HomeSection.module.scss	96	// Duplicated content length is 200%; shift by half for a seamless loop.	COMMENT
MEDIUM⚡	deepeval/tracing/otel/context_aware_processor.py	139	# Trace-shape testing override: when a test harness has set	COMMENT
LOW⚡	deepeval/dataset/utils.py	279	# already a Task so just return it	COMMENT
LOW⚡	deepeval/dataset/utils.py	283	# If it is a future, it is already scheduled, so just return it	COMMENT

Fake / Example Data11 hits · 12 pts

Severity	File	Line	Snippet	Context
LOW	…acing/schemas/masking/comprehensive_masked_schema.json	15	"name": "John Doe"	CODE
LOW	…acing/schemas/masking/comprehensive_masked_schema.json	23	"name": "John Doe"	CODE
LOW	…acing/schemas/masking/comprehensive_masked_schema.json	39	"name": "John Doe"	CODE
LOW	…acing/schemas/masking/comprehensive_masked_schema.json	47	"name": "John Doe"	CODE
LOW⚡	…ts/test_core/test_tracing/test_masking/test_masking.py	80	"email": "user@example.com",	CODE
LOW⚡	…ts/test_core/test_tracing/test_masking/test_masking.py	82	"name": "John Doe",	CODE
LOW	…test_tracing/test_configuration/test_masking_config.py	183	return "user@example.com"	CODE
LOW	…test_tracing/test_configuration/test_masking_config.py	186	assert result == "user@example.com" # Function returns original	CODE
LOW⚡	tests/test_core/test_models/test_azure_model.py	603	api_key="fake-key",	CODE
LOW	…ics/knowledge_retention/templates/generate_verdict.txt	43	"Phone Number": "555-1029"	CODE
LOW	deepeval/cli/auth/flow.py	36	"email": "user@example.com"}}	STRING

Overly Generic Function Names1 hit · 0 pts

Severity	File	Line	Snippet	Context
LOW	tests/test_integrations/utils.py	441	async def my_function():	STRING

Slop Phrases1 hit · 0 pts

Severity	File	Line	Snippet	Context
LOW⚡	tests/test_core/test_prompts/test_interpolation.py	568	This ensures users don't forget to provide required template variables.	STRING

Analysis Overview

What These Metrics Mean

Score History

Severity Breakdown

Directory Score Breakdown

Pattern Findings