deepseek-ai/DeepEP

5.6

Adjusted Score

5.6

Raw Score

100%

Time Factor

2026-07-14

Last Push

9.8K

Stars

Cuda

Language

13.0K

Lines of Code

Files

Pattern Hits

2026-07-14

Scan Date

0.00

HC Hit Rate

What These Metrics Mean

Adjusted Score: Primary synthetic code indicator. Raw score normalised per 1,000 lines of code and multiplied by the temporal discount factor. This is the definitive comparative metric — use it to rank repositories by AI authorship density.
Raw Score: The unmodified sum of all severity-weighted, context-multiplied pattern match scores before temporal discounting. Reflects the absolute signal strength independent of when the repository was last active.
Time Factor: The temporal discount multiplier (0–100%) applied to the raw score. Repositories last updated before ChatGPT's launch (Nov 2022) receive a 5% factor. Full signal is only assigned to repositories active in the post-adoption era (Jan 2024+).
Pattern Hits: Total count of individual pattern matches across all files and categories. A high hit count with a low score may indicate a very large codebase with isolated AI snippets; a low count with a high score indicates dense, concentrated AI signatures.
HC Hit Rate: High+Critical pattern hits per file, averaged across the repository. This orthogonal signal catches repositories where a few files are densely packed with high-severity AI tells — a strong indicator even when the normalised score appears moderate due to codebase size.
Lines of Code / Files: Total lines and files analysed. The scanner examines 94 file extensions. These denominators are used to normalise the score, enabling fair comparison between repositories of vastly different sizes.

Score History

This chart maps the temporal evolution of the adjusted synthetic code score across successive scan runs. An upward trajectory indicates ongoing incorporation of AI-generated code or expanding LLM-assisted scaffolding; a stable or declining trajectory may reflect active human refactoring, code removal, or the adoption of stricter authorship policies. The dashed secondary line (right axis) independently tracks total raw pattern hit count, which can diverge from the normalised score when codebase size changes significantly between scans.

Severity Breakdown

Classifies detected patterns by their diagnostic confidence and structural impact. CRITICAL patterns (coefficient 10) represent definitive synthetic signatures — hallucinated imports, explicit LLM attribution metadata — virtually never produced by human authors. HIGH (5) indicates strong structural tells such as cross-file repetition or cross-linguistic idioms. MEDIUM (2) covers recognisable conversational padding and AI-specific vocabulary. LOW (1) captures subtle indicators like tautological comments and generic boilerplate that require density to carry independent signal.

CRITICAL 0HIGH 0MEDIUM 2LOW 65

Directory Score Breakdown

This horizontal bar chart decomposes the repository's raw synthetic code score by top-level directory, allowing you to pinpoint precisely which modules or components carry the highest AI authorship density. Directories with disproportionately high scores relative to their size warrant targeted manual review: concentrated AI signatures often trace back to mass-generated configuration layers, auto-ported test suites, LLM-scaffolded boilerplate classes, or entire subsystems authored under heavy copilot assistance. Use this view to prioritise your human code-review effort.

Pattern Findings

The scanner identified 67 distinct pattern matches across 10 syntactic categories. Each entry below represents a discrete location in the source code where the engine recorded a statistically significant AI authorship indicator. Expand any category row to inspect the individual file paths, line numbers, code snippets, and the lexical context (CODE, COMMENT, or STRING) in which each match was detected.

Reading the findings table: The Severity column indicates the diagnostic confidence level (CRITICAL / HIGH / MEDIUM / LOW). The Context column identifies whether the match occurred inside executable code, an inline comment, or a string literal — comment-context matches receive a ×1.5 weight because LLMs systematically over-annotate. The ⚡ bolt icon marks clustered matches: three or more patterns within a 10-line window, each receiving an additional ×1.5 density multiplier as dense clusters constitute far stronger evidence of synthetic authorship than isolated hits.

Hyper-Verbose Identifiers19 hits · 20 pts

Severity	File	Line	Snippet	Context
LOW	setup.py	42	def get_nvshmem_host_lib_name(base_dir):	CODE
LOW	deep_ep/buffers/legacy.py	176	def get_low_latency_rdma_size_hint(num_max_dispatch_tokens_per_rank: int, hidden: int, num_ranks: int, num_experts:	CODE
LOW	deep_ep/buffers/legacy.py	672	def low_latency_update_mask_buffer(self, rank_to_mask: int, mask: bool = False):	CODE
LOW⚡	deep_ep/buffers/legacy.py	683	def low_latency_query_mask_buffer(self, mask_status: torch.Tensor):	CODE
LOW⚡	deep_ep/buffers/legacy.py	693	def low_latency_clean_mask_buffer(self):	CODE
LOW⚡	deep_ep/buffers/legacy.py	700	def get_next_low_latency_combine_buffer(self, handle: object):	CODE
LOW	deep_ep/buffers/elastic.py	409	def get_engram_storage_size_hint(num_entries: int, hidden: int,	CODE
LOW	deep_ep/buffers/elastic.py	459	def get_agrs_num_max_session_bytes(group: dist.ProcessGroup,	CODE
LOW	deep_ep/buffers/elastic.py	480	def get_agrs_buffer_size_hint(group: dist.ProcessGroup,	CODE
LOW	deep_ep/utils/envs.py	183	def check_torch_deterministic() -> None:	CODE
LOW	deep_ep/utils/envs.py	223	def check_fast_rdma_atomic_support(nic_name: str = _DEFAULT_NIC_NAME) -> bool:	CODE
LOW	deep_ep/utils/comm.py	78	def destroy_all_managed_nccl_comm() -> None:	CODE
LOW	deep_ep/utils/gate.py	116	def get_precise_unbalanced_scores(num_tokens: int, num_experts: int, num_ranks: int, num_topk: int, ratio: float):	CODE
LOW	deep_ep/utils/gate.py	148	def map_unbalanced_ratio_to_factor(num_tokens: int, num_experts: int, num_ranks: int, num_topk: int, ratio: float) -> fl	CODE
LOW	deep_ep/utils/gate.py	167	def get_random_unbalanced_scores(num_tokens: int, num_experts: int, num_ranks: int, num_topk: int, ratio: float):	CODE
LOW	deep_ep/utils/refs.py	126	def generate_pre_combine_data(src_token_global_idx: torch.Tensor,	CODE
LOW	tests/elastic/test_ep.py	295	def get_unique_and_valid_dst_count(dst_idx: torch.Tensor,	CODE
LOW	tests/legacy/test_low_latency.py	14	def simulate_failure_and_skip(rank: int, api: Literal["dispatch", "combine", "clean"], expected_masked_ranks: Set[int]):	CODE
LOW	tests/legacy/test_low_latency.py	33	def query_mask_buffer_and_check(api: Literal["dispatch", "combine", "clean"], buffer: deep_ep.Buffer, mask_status: torch	CODE

Unused Imports13 hits · 13 pts

Severity	File	Line	Context
LOW	deep_ep/__init__.py	5	CODE
LOW	deep_ep/__init__.py	88	CODE
LOW	deep_ep/__init__.py	89	CODE
LOW	deep_ep/__init__.py	89	CODE
LOW	deep_ep/__init__.py	91	CODE
LOW	deep_ep/__init__.py	91	CODE
LOW	deep_ep/__init__.py	92	CODE
LOW	deep_ep/__init__.py	92	CODE
LOW	deep_ep/__init__.py	95	CODE
LOW	deep_ep/__init__.py	95	CODE
LOW	deep_ep/buffers/elastic.py	6	CODE
LOW	deep_ep/utils/__init__.py	3	CODE
LOW	tests/elastic/test_agrs.py	2	CODE

Deep Nesting11 hits · 11 pts

Severity	File	Line	Context
LOW	deep_ep/utils/envs.py	145	CODE
LOW	deep_ep/utils/testing.py	111	CODE
LOW	deep_ep/utils/find_pkgs.py	8	CODE
LOW	tests/elastic/test_ep.py	22	CODE
LOW	tests/elastic/test_ep.py	59	CODE
LOW	tests/elastic/test_agrs.py	89	CODE
LOW	tests/legacy/test_low_latency.py	39	CODE
LOW	tests/legacy/test_intranode.py	17	CODE
LOW	tests/legacy/test_internode.py	18	CODE
LOW	tests/legacy/test_internode.py	318	CODE
LOW	tests/utils/test_gate.py	7	CODE

Over-Commented Block8 hits · 8 pts

Severity	File	Line	Snippet	Context
LOW	format.sh	1	#!/usr/bin/env bash	COMMENT
LOW	csrc/python_api.cpp	1	#include <pybind11/pybind11.h>	COMMENT
LOW	csrc/kernels/elastic/api.hpp	1	#pragma once	COMMENT
LOW	csrc/elastic/buffer.hpp	1	#pragma once	COMMENT
LOW	csrc/legacy/buffer.hpp	1	#pragma once	COMMENT
LOW	csrc/utils/system.hpp	1	#pragma once	COMMENT
LOW	csrc/utils/format.hpp	1	#pragma once	COMMENT
LOW	csrc/jit/compiler.hpp	1	#pragma once	COMMENT

AI Structural Patterns7 hits · 7 pts

Severity	File	Line	Context
LOW	deep_ep/buffers/legacy.py	33	CODE
LOW	deep_ep/buffers/legacy.py	322	CODE
LOW	deep_ep/buffers/legacy.py	458	CODE
LOW	deep_ep/buffers/elastic.py	228	CODE
LOW	deep_ep/buffers/elastic.py	855	CODE
LOW	deep_ep/buffers/elastic.py	1046	CODE
LOW	deep_ep/utils/refs.py	243	CODE

Self-Referential Comments2 hits · 6 pts

Severity	File	Line	Snippet	Context
MEDIUM	deep_ep/utils/comm.py	71	# Create a new communicator	COMMENT
MEDIUM	deep_ep/utils/gate.py	20	# Create the mask	COMMENT

Excessive Try-Catch Wrapping4 hits · 4 pts

Severity	File	Line	Snippet	Context
LOW	deep_ep/__init__.py	38	except Exception:	CODE
LOW	deep_ep/utils/envs.py	217	except Exception as e:	CODE
LOW	deep_ep/utils/envs.py	241	except Exception:	CODE
LOW	deep_ep/utils/envs.py	266	except Exception as e:	CODE

Example Usage Blocks1 hit · 2 pts

Severity	File	Line	Snippet	Context
LOW	format.sh	2	# Usage:	COMMENT

Redundant / Tautological Comments1 hit · 2 pts

Severity	File	Line	Snippet	Context
LOW	format.sh	179	# Check if there are any uncommitted changes after all formatting steps.	COMMENT

Modern Structural Boilerplate1 hit · 1 pts

Severity	File	Line	Snippet	Context
LOW	deep_ep/buffers/legacy.py	154	def set_num_sms(new_num_sms: int) -> None:	CODE

Analysis Overview

What These Metrics Mean

Score History

Severity Breakdown

Directory Score Breakdown

Pattern Findings