sgl-project/sglang

18.2

Adjusted Score

18.2

Raw Score

100%

Time Factor

2026-07-14

Last Push

30.3K

Stars

Python

Language

1.9M

Lines of Code

6.3K

Files

26.4K

Pattern Hits

2026-07-14

Scan Date

0.08

HC Hit Rate

What These Metrics Mean

Adjusted Score: Primary synthetic code indicator. Raw score normalised per 1,000 lines of code and multiplied by the temporal discount factor. This is the definitive comparative metric — use it to rank repositories by AI authorship density.
Raw Score: The unmodified sum of all severity-weighted, context-multiplied pattern match scores before temporal discounting. Reflects the absolute signal strength independent of when the repository was last active.
Time Factor: The temporal discount multiplier (0–100%) applied to the raw score. Repositories last updated before ChatGPT's launch (Nov 2022) receive a 5% factor. Full signal is only assigned to repositories active in the post-adoption era (Jan 2024+).
Pattern Hits: Total count of individual pattern matches across all files and categories. A high hit count with a low score may indicate a very large codebase with isolated AI snippets; a low count with a high score indicates dense, concentrated AI signatures.
HC Hit Rate: High+Critical pattern hits per file, averaged across the repository. This orthogonal signal catches repositories where a few files are densely packed with high-severity AI tells — a strong indicator even when the normalised score appears moderate due to codebase size.
Lines of Code / Files: Total lines and files analysed. The scanner examines 94 file extensions. These denominators are used to normalise the score, enabling fair comparison between repositories of vastly different sizes.

Score History

This chart maps the temporal evolution of the adjusted synthetic code score across successive scan runs. An upward trajectory indicates ongoing incorporation of AI-generated code or expanding LLM-assisted scaffolding; a stable or declining trajectory may reflect active human refactoring, code removal, or the adoption of stricter authorship policies. The dashed secondary line (right axis) independently tracks total raw pattern hit count, which can diverge from the normalised score when codebase size changes significantly between scans.

Severity Breakdown

Classifies detected patterns by their diagnostic confidence and structural impact. CRITICAL patterns (coefficient 10) represent definitive synthetic signatures — hallucinated imports, explicit LLM attribution metadata — virtually never produced by human authors. HIGH (5) indicates strong structural tells such as cross-file repetition or cross-linguistic idioms. MEDIUM (2) covers recognisable conversational padding and AI-specific vocabulary. LOW (1) captures subtle indicators like tautological comments and generic boilerplate that require density to carry independent signal.

CRITICAL 8HIGH 470MEDIUM 2665LOW 23286

Directory Score Breakdown

This horizontal bar chart decomposes the repository's raw synthetic code score by top-level directory, allowing you to pinpoint precisely which modules or components carry the highest AI authorship density. Directories with disproportionately high scores relative to their size warrant targeted manual review: concentrated AI signatures often trace back to mass-generated configuration layers, auto-ported test suites, LLM-scaffolded boilerplate classes, or entire subsystems authored under heavy copilot assistance. Use this view to prioritise your human code-review effort.

Pattern Findings

The scanner identified 26429 distinct pattern matches across 25 syntactic categories. Each entry below represents a discrete location in the source code where the engine recorded a statistically significant AI authorship indicator. Expand any category row to inspect the individual file paths, line numbers, code snippets, and the lexical context (CODE, COMMENT, or STRING) in which each match was detected.

Reading the findings table: The Severity column indicates the diagnostic confidence level (CRITICAL / HIGH / MEDIUM / LOW). The Context column identifies whether the match occurred inside executable code, an inline comment, or a string literal — comment-context matches receive a ×1.5 weight because LLMs systematically over-annotate. The ⚡ bolt icon marks clustered matches: three or more patterns within a 10-line window, each receiving an additional ×1.5 density multiplier as dense clusters constitute far stronger evidence of synthetic authorship than isolated hits.

Hyper-Verbose Identifiers13875 hits · 13623 pts

Severity	File	Line	Snippet	Context
LOW⚡	benchmark/dspy/bench_dspy_intro.py	136	def validate_context_and_answer(example, pred, trace=None):	CODE
LOW	…ing_window_attention_triton/bench_triton_swa_kernel.py	11	def extend_attention_fwd_torch(	CODE
LOW	benchmark/kernels/all_gather/benchmark_aiter.py	175	def expected_logical_all_gather(	CODE
LOW	benchmark/kernels/all_gather/benchmark_aiter.py	230	def install_aiter_aot_import_shims() -> None:	CODE
LOW⚡	…shinfer_allreduce_fusion/benchmark_fused_collective.py	122	def cleanup_flashinfer_workspace(ipc_handles):	CODE
LOW	…shinfer_allreduce_fusion/benchmark_fused_collective.py	85	def setup_flashinfer_workspace(	CODE
LOW	…shinfer_allreduce_fusion/benchmark_fused_collective.py	153	def get_trtllm_fused_allreduce_kwargs(self):	CODE
LOW	…shinfer_allreduce_fusion/benchmark_fused_collective.py	163	def flashinfer_fused_allreduce_rmsnorm(	CODE
LOW	…shinfer_allreduce_fusion/benchmark_fused_collective.py	288	def standard_allreduce_rmsnorm(	CODE
LOW	…shinfer_allreduce_fusion/benchmark_fused_collective.py	398	def standard_allreduce_rmsnorm_native(	CODE
LOW	…shinfer_allreduce_fusion/benchmark_fused_collective.py	479	def standard_allreduce_rmsnorm_native_compiled(	CODE
LOW	…shinfer_allreduce_fusion/benchmark_fused_collective.py	895	def prepare_results_with_speedups(results_dict):	CODE
LOW	…uler_batch/benchmark_write_req_to_token_pool_triton.py	12	def write_req_to_token_pool_triton(	CODE
LOW	…uler_batch/benchmark_write_req_to_token_pool_triton.py	49	def write_req_to_token_pool_triton_optimize(	CODE
LOW	…uler_batch/benchmark_write_req_to_token_pool_triton.py	91	def write_req_to_token_pool_reference(	CODE
LOW	…uler_batch/benchmark_write_req_to_token_pool_triton.py	114	def test_write_req_to_token_pool():	CODE
LOW	benchmark/kernels/fused_moe_triton/common_utils.py	24	def calculate_shard_intermediate_size(	CODE
LOW	benchmark/kernels/fused_moe_triton/common_utils.py	209	def get_rocm_configs_compute_bound() -> List[Dict[str, int]]:	CODE
LOW	benchmark/kernels/fused_moe_triton/common_utils.py	232	def get_configs_compute_bound() -> List[Dict[str, int]]:	CODE
LOW	…pseek/benchmark_deepgemm_dsv3_router_gemm_blackwell.py	137	def get_benchmark_plot_friendly(tp_sizes):	CODE
LOW	…nels/deepseek/benchmark_deepgemm_fp8_gemm_blackwell.py	231	def get_benchmark_plot_friendly(tp_size):	CODE
LOW	…hmark/kernels/quantization/tuning_block_wise_kernel.py	148	def get_rocm_configs_compute_bound():	CODE
LOW	…hmark/kernels/quantization/tuning_block_wise_kernel.py	171	def get_configs_compute_bound():	CODE
LOW	…s/decoding_attention_triton/triton_flashinfer_cudnn.py	98	def decode_attention_flashinfer(dtype, head_num_q, head_num_kv):	CODE
LOW	benchmark/mmmu/bench_sglang.py	127	async def process_sample_with_semaphore(	CODE
LOW	benchmark/mmmu/eval_utils.py	292	def _parse_explicit_multi_choice_answer(response, all_choices):	CODE
LOW	benchmark/mmmu/eval_utils.py	303	def parse_multi_choice_response(response, all_choices, index2ans):	CODE
LOW	…hmark/bench_pynccl_allocator/bench_segment_tracking.py	56	def bench_register_segments_with_comm(	CODE
LOW	…hmark/bench_pynccl_allocator/bench_segment_tracking.py	114	def bench_with_various_segment_counts(	CODE
LOW	benchmark/ocr/eval_utils.py	440	def eval_math_formula_accuracy(test: dict, ocr_output: str) -> bool:	CODE
LOW	benchmark/scheduler/bench_token_storage.py	84	def _batch_tensor_from_pyarrays(parts: list[array]) -> torch.Tensor:	CODE
LOW	benchmark/scheduler/bench_token_storage.py	256	def microbench_torch_tensor_paths(	CODE
LOW	benchmark/hicache/bench_mix.py	299	async def async_request_sglang_generate(	CODE
LOW	benchmark/hicache/bench_warm_cache.py	90	def _create_bench_client_session() -> aiohttp.ClientSession:	CODE
LOW	benchmark/hicache/bench_warm_cache.py	97	async def async_request_sglang_generate(	CODE
LOW	benchmark/hicache/bench_warm_cache.py	413	def maybe_write_summary_jsonl(	CODE
LOW	benchmark/hicache/bench_warm_cache.py	472	async def benchmark_shared_prefix_pct(	CODE
LOW	benchmark/hicache/bench_serving.py	71	async def async_request_openai_completions(	CODE
LOW	benchmark/hicache/data_processing.py	150	def sample_ultrachat_requests(	CODE
LOW	benchmark/hicache/data_processing.py	434	def sample_generated_shared_prefix_requests(	CODE
LOW	benchmark/hicache/bench_hicache_write_back.py	69	def _validate_mha_correctness(	CODE
LOW	benchmark/hicache/bench_hicache_write_back.py	88	def _validate_mla_correctness(	CODE
LOW	benchmark/prefill_only/bench_score.py	65	def create_score_request_builder():	CODE
LOW	benchmark/prefill_only/bench_score.py	80	def generate_text_with_token_count_local(num_toks):	CODE
LOW	benchmark/prefill_only/bench_score.py	120	def build_warmup_score_request() -> dict:	CODE
LOW	benchmark/prefill_only/util.py	46	def generate_text_with_token_count(	CODE
LOW	benchmark/prefill_only/util.py	106	def prepare_all_requests_parallel(	CODE
LOW	benchmark/prefill_only/util.py	398	async def perform_global_warmup_and_freeze(	CODE
LOW⚡	benchmark/prefill_only/bench_embeddings.py	100	def validate_embeddings_response(response_data: dict) -> bool:	CODE
LOW⚡	benchmark/prefill_only/bench_embeddings.py	110	def build_warmup_embeddings_request() -> dict:	CODE
LOW	…nchmark/bench_in_batch_prefix/bench_in_batch_prefix.py	76	def test_batch_by_batch_with_hint(all_prompts, gen_len):	CODE
LOW	benchmark/asr/bench_sglang.py	55	def run_asr_transcription_sync(client, model_name, y, sr, language=None):	CODE
LOW	benchmark/asr/bench_sglang.py	75	def run_asr_transcription_stream_sync(	CODE
LOW	benchmark/benchmark_batch/benchmark_tokenizer.py	179	def generate_random_token_ids(*, num_prompts, num_tokens, tokenizer):	CODE
LOW	benchmark/lora/lora_bench.py	48	async def async_request_openai_completions(	CODE
LOW	…ark/bench_linear_attention/bench_cutedsl_kda_decode.py	155	def run_prefill_then_decode_baseline(inp):	CODE
LOW	…ark/bench_linear_attention/bench_cutedsl_kda_decode.py	191	def run_prefill_then_decode_cutedsl(inp):	CODE
LOW	benchmark/generative_agents/agent_functions.py	56	def generate_event_triple_prompt(persona_name, action):	CODE
LOW	benchmark/generative_agents/agent_functions.py	89	def generate_pronunciatio_prompt(action):	CODE
LOW	benchmark/generative_agents/agent_functions.py	158	def action_location_sector_prompt(	CODE
13815 more matches not shown…

Decorative Section Separators2144 hits · 6986 pts

Severity	File	Line	Snippet	Context
MEDIUM	…hmark/kernels/quantization/tuning_block_wise_kernel.py	13	# ==============================================================================	COMMENT
MEDIUM	benchmark/kernels/lora_csgmv/tune_lora_csgmv.py	128	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/kernels/lora_csgmv/tune_lora_csgmv.py	130	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/kernels/lora_csgmv/tune_lora_csgmv.py	181	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/kernels/lora_csgmv/tune_lora_csgmv.py	183	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/kernels/lora_csgmv/tune_lora_csgmv.py	333	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/kernels/lora_csgmv/tune_lora_csgmv.py	335	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/kernels/lora_csgmv/tune_lora_csgmv.py	390	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/kernels/lora_csgmv/tune_lora_csgmv.py	392	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	benchmark/ocr/bench_sglang.py	367	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	benchmark/ocr/bench_sglang.py	369	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	benchmark/ocr/bench_sglang.py	372	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	benchmark/ocr/bench_sglang.py	374	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/ocr/bench_sglang.py	54	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/ocr/bench_sglang.py	56	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/ocr/bench_sglang.py	67	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/ocr/bench_sglang.py	69	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/ocr/bench_sglang.py	86	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/ocr/bench_sglang.py	88	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/ocr/bench_sglang.py	214	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/ocr/bench_sglang.py	216	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/ocr/bench_sglang.py	263	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/ocr/bench_sglang.py	265	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/ocr/bench_sglang.py	310	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/ocr/bench_sglang.py	312	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/ocr/bench_sglang.py	565	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/ocr/bench_sglang.py	567	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/ocr/bench_sglang.py	625	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/ocr/bench_sglang.py	627	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/ocr/generate_report.py	30	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/ocr/generate_report.py	32	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/ocr/generate_report.py	75	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/ocr/generate_report.py	77	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/ocr/generate_report.py	273	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/ocr/generate_report.py	275	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/ocr/generate_report.py	322	# ---------------------------------------------------------------------------	STRING
MEDIUM	benchmark/ocr/generate_report.py	324	# ---------------------------------------------------------------------------	STRING
MEDIUM	benchmark/ocr/eval_utils.py	22	# ── Unicode normalization ─────────────────────────────────────────────────────	COMMENT
MEDIUM	benchmark/ocr/eval_utils.py	50	# ── Matching helpers ──────────────────────────────────────────────────────────	COMMENT
MEDIUM	benchmark/ocr/eval_utils.py	96	# ── olmOCR-bench test evaluators ─────────────────────────────────────────────	COMMENT
MEDIUM	benchmark/ocr/eval_utils.py	190	# ── Table parsing helpers ─────────────────────────────────────────────────────	COMMENT
MEDIUM	benchmark/ocr/eval_utils.py	291	# ── 1. Structured tables (HTML <tr>/<td> or Markdown) ────────────────────	COMMENT
MEDIUM	benchmark/ocr/eval_utils.py	342	# ── 2. Flat <table>…</table> fallback (DeepSeek-OCR-2 format) ────────────	COMMENT
MEDIUM	benchmark/ocr/eval_utils.py	429	# Math formula evaluation ─────────────────────────────────────────────────────	COMMENT
MEDIUM	benchmark/ocr/eval_utils.py	467	# ── Main dispatcher ───────────────────────────────────────────────────────────	COMMENT
MEDIUM	benchmark/ocr/eval_utils.py	510	# ── Aggregation & reporting ───────────────────────────────────────────────────	COMMENT
MEDIUM	benchmark/ocr/eval_utils.py	602	# ── Normalized Edit Distance (OmniDocBench-style text quality metric) ─────────	COMMENT
MEDIUM	benchmark/lora/lora_bench.py	13	# ==============================================================================	COMMENT
MEDIUM	benchmark/bench_linear_attention/bench_gdn_decode.py	39	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/bench_linear_attention/bench_gdn_decode.py	41	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/bench_linear_attention/bench_gdn_decode.py	89	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/bench_linear_attention/bench_gdn_decode.py	91	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/bench_linear_attention/bench_gdn_decode.py	155	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/bench_linear_attention/bench_gdn_decode.py	157	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/bench_linear_attention/bench_gdn_decode.py	206	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/bench_linear_attention/bench_gdn_decode.py	208	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/bench_linear_attention/bench_gdn_decode.py	299	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	benchmark/bench_linear_attention/bench_gdn_decode.py	301	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	…rk/bench_linear_attention/bench_kda_prefill_cutedsl.py	49	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	…rk/bench_linear_attention/bench_kda_prefill_cutedsl.py	51	# ---------------------------------------------------------------------------	COMMENT
2084 more matches not shown…

Unused Imports2672 hits · 2496 pts

Severity	File	Line	Context
LOW	benchmark/kernels/all_gather/benchmark_aiter.py	16	CODE
LOW	…ls/deepseek/benchmark_cute_dsl_fp8_paged_mqa_logits.py	3	CODE
LOW	benchmark/scheduler/bench_token_storage.py	25	CODE
LOW	benchmark/hicache/perf.py	1	CODE
LOW	benchmark/hicache/bench_hicache_write_back.py	1	CODE
LOW	benchmark/bench_linear_attention/bench_gdn_qkv_split.py	1	CODE
LOW	…router/tests/scripts/generate_kv_events_hash_parity.py	48	CODE
LOW	…perimental/sgl-router/tests/e2e/test_tokenize_smoke.py	5	CODE
LOW	experimental/sgl-router/tests/e2e/conftest.py	21	CODE
LOW	experimental/sgl-router/tests/e2e/test_chat_smoke.py	5	CODE
LOW	experimental/sgl-router/tests/e2e/test_chat_smoke.py	8	CODE
LOW	experimental/sgl-router/tests/e2e/infra/gateway.py	25	CODE
LOW	experimental/sgl-router/tests/e2e/infra/model_specs.py	16	CODE
LOW	experimental/sgl-router/tests/e2e/infra/model_pool.py	19	CODE
LOW	…al/sgl-router/tests/e2e/k8s_integration/fake_worker.py	10	CODE
LOW	…sgl-router/tests/e2e/k8s_integration/test_lifecycle.py	14	CODE
LOW	…sgl-router/tests/e2e/k8s_integration/test_lifecycle.py	16	CODE
LOW	…sgl-router/tests/e2e/k8s_integration/test_lifecycle.py	19	CODE
LOW	…sgl-router/tests/e2e/k8s_integration/test_lifecycle.py	20	CODE
LOW	…ental/sgl-router/tests/e2e/k8s_integration/conftest.py	12	CODE
LOW	…sgl-router/tests/e2e/k8s_integration/test_discovery.py	8	CODE
LOW	…sgl-router/tests/e2e/k8s_integration/test_discovery.py	11	CODE
LOW	…sgl-router/tests/e2e/k8s_integration/test_discovery.py	12	CODE
LOW	…uter/tests/e2e/k8s_integration/test_cross_namespace.py	15	CODE
LOW	…uter/tests/e2e/k8s_integration/test_cross_namespace.py	18	CODE
LOW	…outer/tests/e2e/k8s_integration/test_reconciliation.py	16	CODE
LOW	…outer/tests/e2e/k8s_integration/test_reconciliation.py	18	CODE
LOW	…l-router/tests/e2e/chat_completions/test_validation.py	14	CODE
LOW	…r/tests/e2e/chat_completions/test_load_based_policy.py	6	CODE
LOW	…ts/e2e/chat_completions/test_two_router_convergence.py	38	CODE
LOW	test/manual/test_ray_engine.py	25	CODE
LOW	test/manual/quant/test_block_fp8.py	679	CODE
LOW	test/manual/layers/moe/bench_mxfp4_sm90_kernels.py	21	CODE
LOW	test/manual/kv_canary/test_self_e2e_baseline_dsv4.py	1	CODE
LOW	test/manual/hicache/test_pp_with_hicache.py	79	CODE
LOW	test/registered/kernels/test_gemma4_fused_routing.py	12	CODE
LOW	test/registered/kernels/test_sm120_paged_mqa_logits.py	18	CODE
LOW	test/registered/kernels/test_flash_mla_backends.py	22	CODE
LOW	test/registered/attention/test_kda_prefill_flashkda.py	34	CODE
LOW	…ion/unittests/dual_chunk/test_dual_chunk_flash_attn.py	50	CODE
LOW	…ed/unit/layers/quantization/test_mxfp4_sm90_cutlass.py	14	CODE
LOW	…ed/unit/layers/quantization/test_mxfp4_sm90_cutlass.py	174	CODE
LOW	…unit/hardware_backend/mlx/test_runner_init_contract.py	8	CODE
LOW	…tered/unit/hardware_backend/mlx/test_mlx_pool_dtype.py	17	CODE
LOW	…hardware_backend/mlx/test_mlx_quantization_override.py	7	CODE
LOW	…d/unit/hardware_backend/mlx/test_attention_patching.py	3	CODE
LOW	…ered/unit/hardware_backend/mlx/test_scheduler_mixin.py	14	CODE
LOW	…istered/unit/hardware_backend/mlx/test_quantization.py	13	CODE
LOW	…hardware_backend/mlx/test_mlx_reference_correctness.py	36	CODE
LOW	…ed/unit/hardware_backend/mlx/test_tp_worker_routing.py	28	CODE
LOW	…tered/unit/hardware_backend/mlx/test_metal_profiler.py	14	CODE
LOW	…/hardware_backend/mlx/test_mlx_runner_pool_contract.py	13	CODE
LOW	test/registered/unit/distributed/test_parallel_state.py	37	CODE
LOW	test/registered/unit/distributed/test_vmm_utils.py	12	CODE
LOW	…stered/unit/managers/test_customized_info_streaming.py	1	CODE
LOW	…t/registered/unit/scripted_runtime/test_http_server.py	1	CODE
LOW	…red/unit/scripted_runtime/test_tokenizer_recv_proxy.py	1	CODE
LOW	…d/unit/scripted_runtime/test_background_http_poster.py	1	CODE
LOW	…d/unit/scripted_runtime/test_scripted_runtime_utils.py	1	CODE
LOW	…egistered/unit/scripted_runtime/test_scheduler_hook.py	1	CODE
2612 more matches not shown…

Cross-File Repetition399 hits · 1995 pts

Severity	File	Line	Snippet	Context
HIGH	…hmark/kernels/quantization/tuning_block_wise_kernel.py	0	this function performs matrix multiplication with block-wise quantization. it takes two input tensors `a` and `b` with s	STRING
HIGH	python/sglang/kernels/ops/quantization/int8_kernel.py	0	this function performs matrix multiplication with block-wise quantization. it takes two input tensors `a` and `b` with s	STRING
HIGH	python/sglang/kernels/ops/quantization/fp8_kernel.py	0	this function performs matrix multiplication with block-wise quantization. it takes two input tensors `a` and `b` with s	STRING
HIGH	benchmark/tree_of_thought_v0/bench_sglang.py	0	please generate a high-level plan for solving the following question. as the first step, just say what method and idea y	STRING
HIGH	benchmark/tree_of_thought_v0/bench_other.py	0	please generate a high-level plan for solving the following question. as the first step, just say what method and idea y	STRING
HIGH	benchmark/tree_of_thought_deep/bench_sglang.py	0	please generate a high-level plan for solving the following question. as the first step, just say what method and idea y	STRING
HIGH	benchmark/tree_of_thought_deep/lmql_funcs.py	0	please generate a high-level plan for solving the following question. as the first step, just say what method and idea y	STRING
HIGH	benchmark/tree_of_thought_deep/bench_other.py	0	please generate a high-level plan for solving the following question. as the first step, just say what method and idea y	STRING
HIGH	benchmark/tree_of_thought_v0/bench_sglang.py	0	the plan looks good! now, use real numbers and do the calculation. please solve the question step-by-step according to t	STRING
HIGH	benchmark/tree_of_thought_v0/bench_other.py	0	the plan looks good! now, use real numbers and do the calculation. please solve the question step-by-step according to t	STRING
HIGH	benchmark/tree_of_thought_deep/bench_sglang.py	0	the plan looks good! now, use real numbers and do the calculation. please solve the question step-by-step according to t	STRING
HIGH	benchmark/tree_of_thought_deep/lmql_funcs.py	0	the plan looks good! now, use real numbers and do the calculation. please solve the question step-by-step according to t	STRING
HIGH	benchmark/tree_of_thought_deep/bench_other.py	0	the plan looks good! now, use real numbers and do the calculation. please solve the question step-by-step according to t	STRING
HIGH	benchmark/tree_of_thought_v0/bench_sglang.py	0	okay. now, evaluate your own solution and give it a score on a scale of 1 to 5. please do rigorous check of the correctn	STRING
HIGH	benchmark/tree_of_thought_v0/bench_other.py	0	okay. now, evaluate your own solution and give it a score on a scale of 1 to 5. please do rigorous check of the correctn	STRING
HIGH	benchmark/tree_of_thought_deep/bench_sglang.py	0	okay. now, evaluate your own solution and give it a score on a scale of 1 to 5. please do rigorous check of the correctn	STRING
HIGH	benchmark/tree_of_thought_deep/lmql_funcs.py	0	okay. now, evaluate your own solution and give it a score on a scale of 1 to 5. please do rigorous check of the correctn	STRING
HIGH	benchmark/tree_of_thought_deep/bench_other.py	0	okay. now, evaluate your own solution and give it a score on a scale of 1 to 5. please do rigorous check of the correctn	STRING
HIGH	benchmark/tip_suggestion/bench_sglang.py	0	please expand a tip for a topic into a detailed paragraph. topic: staying healthy tip: regular exercise paragraph: incor	STRING
HIGH	benchmark/tip_suggestion/lmql_funcs.py	0	please expand a tip for a topic into a detailed paragraph. topic: staying healthy tip: regular exercise paragraph: incor	STRING
HIGH	benchmark/tip_suggestion/bench_other.py	0	please expand a tip for a topic into a detailed paragraph. topic: staying healthy tip: regular exercise paragraph: incor	STRING
HIGH	benchmark/json_jump_forward/bench_sglang.py	0	"house": "(gryffindor\|slytherin\|ravenclaw\|hufflepuff)",\n	STRING
HIGH	benchmark/json_jump_forward/bench_other.py	0	"house": "(gryffindor\|slytherin\|ravenclaw\|hufflepuff)",\n	STRING
HIGH	examples/frontend_language/usage/json_decode.py	0	"house": "(gryffindor\|slytherin\|ravenclaw\|hufflepuff)",\n	STRING
HIGH	benchmark/json_jump_forward/bench_sglang.py	0	"blood status": "(pure-blood\|half-blood\|muggle-born)",\n	STRING
HIGH	benchmark/json_jump_forward/bench_other.py	0	"blood status": "(pure-blood\|half-blood\|muggle-born)",\n	STRING
HIGH	examples/frontend_language/usage/json_decode.py	0	"blood status": "(pure-blood\|half-blood\|muggle-born)",\n	STRING
HIGH	benchmark/json_jump_forward/bench_sglang.py	0	"occupation": "(student\|teacher\|auror\|ministry of magic\|death eater\|order of the phoenix)",\n	STRING
HIGH	benchmark/json_jump_forward/bench_other.py	0	"occupation": "(student\|teacher\|auror\|ministry of magic\|death eater\|order of the phoenix)",\n	STRING
HIGH	examples/frontend_language/usage/json_decode.py	0	"occupation": "(student\|teacher\|auror\|ministry of magic\|death eater\|order of the phoenix)",\n	STRING
HIGH	benchmark/tree_of_thought_deep/bench_sglang.py	0	based on your reflection, do you change your mind? now, give me the final answer after careful consideration.	STRING
HIGH	benchmark/tree_of_thought_deep/lmql_funcs.py	0	based on your reflection, do you change your mind? now, give me the final answer after careful consideration.	STRING
HIGH	benchmark/tree_of_thought_deep/bench_other.py	0	based on your reflection, do you change your mind? now, give me the final answer after careful consideration.	STRING
HIGH	…ph/test_piecewise_cuda_graph_support_1_gpu_archived.py	0	archived test classes split out of test/registered/4-gpu-models/test_qwen35_models.py. originally registered with `regis	STRING
HIGH	test/manual/quant/test_nvfp4_gemm_archived.py	0	archived test classes split out of test/registered/4-gpu-models/test_qwen35_models.py. originally registered with `regis	STRING
HIGH	test/manual/quant/test_awq_archived.py	0	archived test classes split out of test/registered/4-gpu-models/test_qwen35_models.py. originally registered with `regis	STRING
HIGH	test/manual/distributed/test_dp_attention_archived.py	0	archived test classes split out of test/registered/4-gpu-models/test_qwen35_models.py. originally registered with `regis	STRING
HIGH	…/manual/models/test_nvidia_nemotron_3_nano_archived.py	0	archived test classes split out of test/registered/4-gpu-models/test_qwen35_models.py. originally registered with `regis	STRING
HIGH	test/manual/mla/test_mla_flashinfer_archived.py	0	archived test classes split out of test/registered/4-gpu-models/test_qwen35_models.py. originally registered with `regis	STRING
HIGH	test/manual/mla/test_flashmla_archived.py	0	archived test classes split out of test/registered/4-gpu-models/test_qwen35_models.py. originally registered with `regis	STRING
HIGH	test/manual/mla/test_mla_int8_deepseek_v3_archived.py	0	archived test classes split out of test/registered/4-gpu-models/test_qwen35_models.py. originally registered with `regis	STRING
HIGH	…al/4-gpu-models/test_qwen3_next_models_mtp_archived.py	0	archived test classes split out of test/registered/4-gpu-models/test_qwen35_models.py. originally registered with `regis	STRING
HIGH	test/manual/4-gpu-models/test_qwen35_models_archived.py	0	archived test classes split out of test/registered/4-gpu-models/test_qwen35_models.py. originally registered with `regis	STRING
HIGH	test/manual/quant/test_block_fp8.py	0	this function performs matrix multiplication with block-wise quantization using native torch. it takes two input tensors	STRING
HIGH	test/manual/quant/test_block_fp8_deep_gemm_blackwell.py	0	this function performs matrix multiplication with block-wise quantization using native torch. it takes two input tensors	STRING
HIGH	test/registered/quant/test_block_int8.py	0	this function performs matrix multiplication with block-wise quantization using native torch. it takes two input tensors	STRING
HIGH	test/manual/nightly/test_vlms_vit_cuda_graph.py	0	evaluate a vlm on the mmmu validation set with lmms-eval. only `model_version` (checkpoint) and `chat_template` vary; we	STRING
HIGH	test/manual/nightly/test_vlms_vit_flashinfer_cudnn.py	0	evaluate a vlm on the mmmu validation set with lmms-eval. only `model_version` (checkpoint) and `chat_template` vary; we	STRING
HIGH	test/manual/nightly/test_vlms_piecewise_cuda_graph.py	0	evaluate a vlm on the mmmu validation set with lmms-eval. only `model_version` (checkpoint) and `chat_template` vary; we	STRING
HIGH	python/sglang/test/kits/mmmu_vlm_kit.py	0	evaluate a vlm on the mmmu validation set with lmms-eval. only `model_version` (checkpoint) and `chat_template` vary; we	STRING
HIGH	test/manual/nightly/test_vlms_vit_cuda_graph.py	0	common method to run vlm mmmu benchmark test. args: model: model to test output_path: path for output logs test_name: op	STRING
HIGH	test/manual/nightly/test_vlms_vit_flashinfer_cudnn.py	0	common method to run vlm mmmu benchmark test. args: model: model to test output_path: path for output logs test_name: op	STRING
HIGH	test/manual/nightly/test_vlms_piecewise_cuda_graph.py	0	common method to run vlm mmmu benchmark test. args: model: model to test output_path: path for output logs test_name: op	STRING
HIGH	python/sglang/test/kits/mmmu_vlm_kit.py	0	common method to run vlm mmmu benchmark test. args: model: model to test output_path: path for output logs test_name: op	STRING
HIGH	test/manual/models/test_nvidia_nemotron_nano_v2_vl.py	0	`--limit=0.1`: 10 percent of each task - this is fine for testing since the nominal result isn't interesting - this run	STRING
HIGH	test/registered/models_e2e/test_ministral3_models.py	0	`--limit=0.1`: 10 percent of each task - this is fine for testing since the nominal result isn't interesting - this run	STRING
HIGH	test/registered/models_e2e/test_ministral4_models.py	0	`--limit=0.1`: 10 percent of each task - this is fine for testing since the nominal result isn't interesting - this run	STRING
HIGH	test/registered/quant/test_int8_kernel.py	0	matrix multiplication function that supports per-token input quantization and per-column weight quantization	STRING
HIGH	test/registered/cpu/utils.py	0	matrix multiplication function that supports per-token input quantization and per-column weight quantization	STRING
HIGH	…t/registered/moe/test_triton_moe_channel_fp8_kernel.py	0	matrix multiplication function that supports per-token input quantization and per-column weight quantization	STRING
339 more matches not shown…

Excessive Try-Catch Wrapping1707 hits · 1975 pts

Severity	File	Line	Snippet	Context
LOW	benchmark/bench_adaptive_speculative.py	70	except Exception as e:	CODE
LOW	benchmark/kernels/all_gather/benchmark_aiter.py	365	except Exception:	CODE
LOW⚡	…shinfer_allreduce_fusion/benchmark_fused_collective.py	117	except Exception as e:	CODE
LOW⚡	…shinfer_allreduce_fusion/benchmark_fused_collective.py	130	except Exception as e:	CODE
LOW	…shinfer_allreduce_fusion/benchmark_fused_collective.py	47	except Exception: # pragma: no cover - fallback on non-supported platforms	CODE
LOW	…shinfer_allreduce_fusion/benchmark_fused_collective.py	651	except Exception as e:	CODE
LOW	…shinfer_allreduce_fusion/benchmark_fused_collective.py	665	except Exception as e:	CODE
LOW	…shinfer_allreduce_fusion/benchmark_fused_collective.py	684	except Exception as e:	CODE
LOW	…shinfer_allreduce_fusion/benchmark_fused_collective.py	701	except Exception as e:	CODE
LOW	…shinfer_allreduce_fusion/benchmark_fused_collective.py	721	except Exception as e:	CODE
LOW	…shinfer_allreduce_fusion/benchmark_fused_collective.py	738	except Exception as e:	CODE
LOW	…shinfer_allreduce_fusion/benchmark_fused_collective.py	763	except Exception as e:	CODE
LOW	…shinfer_allreduce_fusion/benchmark_fused_collective.py	788	except Exception as e:	CODE
LOW	…shinfer_allreduce_fusion/benchmark_fused_collective.py	812	except Exception as e:	CODE
LOW	…shinfer_allreduce_fusion/benchmark_fused_collective.py	829	except Exception as e:	CODE
LOW	…shinfer_allreduce_fusion/benchmark_fused_collective.py	855	except Exception as e:	CODE
LOW	…shinfer_allreduce_fusion/benchmark_fused_collective.py	883	except Exception as e:	CODE
LOW	…shinfer_allreduce_fusion/benchmark_fused_collective.py	1084	except Exception as e:	CODE
LOW	benchmark/kernels/all_reduce/benchmark_aiter.py	227	except Exception as e:	CODE
LOW	benchmark/kernels/all_reduce/benchmark_aiter.py	237	except Exception as e:	CODE
LOW	benchmark/kernels/all_reduce/benchmark_aiter.py	253	except Exception as e:	CODE
LOW	benchmark/kernels/all_reduce/benchmark_aiter.py	265	except Exception as e:	CODE
LOW	benchmark/kernels/all_reduce/benchmark_aiter.py	305	except Exception:	CODE
LOW	benchmark/kernels/all_reduce/benchmark_all_reduce.py	245	except Exception as e:	CODE
LOW	benchmark/kernels/all_reduce/benchmark_all_reduce.py	255	except Exception as e:	CODE
LOW	benchmark/kernels/all_reduce/benchmark_all_reduce.py	274	except Exception as e:	CODE
LOW	benchmark/kernels/all_reduce/benchmark_all_reduce.py	284	except Exception as e:	CODE
LOW	benchmark/kernels/all_reduce/benchmark_all_reduce.py	324	except Exception:	CODE
LOW	benchmark/kernels/lora_csgmv/tune_lora_csgmv.py	228	except Exception:	CODE
LOW	benchmark/kernels/lora_csgmv/tune_lora_csgmv.py	305	except Exception:	CODE
LOW	benchmark/mmmu/bench_sglang.py	64	except Exception:	CODE
LOW	benchmark/mmmu/bench_hf.py	35	except Exception as first_exception:	CODE
LOW	benchmark/mmmu/bench_hf.py	60	except Exception as second_exception:	CODE
LOW	benchmark/mmmu/eval_utils.py	195	except Exception as exc:	CODE
MEDIUM	benchmark/mmmu/eval_utils.py	514	print(f"Error to extract answer from: {pred_i}")	CODE
LOW	benchmark/json_schema/bench_sglang.py	104	except Exception as e:	CODE
MEDIUM	…hmark/bench_pynccl_allocator/bench_segment_tracking.py	195	print("Error: CUDA is not available. This benchmark requires a GPU.")	CODE
LOW⚡	benchmark/ocr/bench_sglang.py	363	except Exception:	CODE
LOW	benchmark/ocr/bench_sglang.py	449	except Exception as exc:	CODE
LOW	benchmark/ocr/bench_sglang.py	460	except Exception as exc:	CODE
LOW	benchmark/ocr/bench_sglang.py	686	except Exception as exc:	CODE
LOW	benchmark/ocr/eval_utils.py	503	except Exception as exc:	CODE
LOW	benchmark/hicache/bench_mix.py	362	except Exception as e:	CODE
LOW	benchmark/hicache/bench_mix.py	423	except Exception as e:	CODE
MEDIUM	benchmark/hicache/bench_mix.py	479	print(f"Error processing response for client {user_data}: {e}")	CODE
MEDIUM	benchmark/hicache/bench_mix.py	417	def handle_request(self, user_data):	CODE
LOW	benchmark/hicache/bench_warm_cache.py	169	except Exception as exc:	CODE
LOW	benchmark/hicache/bench_serving.py	194	except Exception:	CODE
LOW	benchmark/hicache/bench_serving.py	214	except Exception:	CODE
LOW	benchmark/hicache/bench_serving.py	273	except Exception as e:	CODE
MEDIUM	benchmark/hicache/bench_serving.py	632	print(f"Error running benchmark for request rate: {request_rate}")	CODE
LOW	benchmark/hicache/bench_serving.py	739	except Exception as e:	CODE
MEDIUM	benchmark/hicache/bench_multiturn.py	180	print(f"Error writing to JSONL file: {e}")	CODE
LOW	benchmark/hicache/bench_multiturn.py	395	except Exception as e:	CODE
MEDIUM	benchmark/hicache/bench_multiturn.py	543	print(f"Error processing response for client {client_id}: {e}")	CODE
LOW	benchmark/hicache/bench_multiturn.py	552	except Exception as e:	CODE
LOW	benchmark/prefill_only/bench_score.py	108	except Exception as e:	CODE
MEDIUM	benchmark/prefill_only/bench_score.py	109	print(f"Error building request {index}: {e}")	CODE
LOW	benchmark/prefill_only/util.py	131	except Exception as e:	CODE
MEDIUM	benchmark/prefill_only/util.py	132	print(f"Error building request {index}: {e}")	CODE
1647 more matches not shown…

Deep Nesting1746 hits · 1510 pts

Severity	File	Line	Context
LOW	benchmark/kernels/all_gather/benchmark_aiter.py	290	CODE
LOW	…shinfer_allreduce_fusion/benchmark_fused_collective.py	606	CODE
LOW	…shinfer_allreduce_fusion/benchmark_fused_collective.py	895	CODE
LOW	…hmark/kernels/all_reduce/benchmark_fused_ar_rms_amd.py	367	CODE
LOW	…/fused_moe_triton/benchmark_torch_compile_fused_moe.py	16	CODE
LOW	benchmark/kernels/fused_moe_triton/common_utils.py	33	CODE
LOW	benchmark/kernels/fused_moe_triton/common_utils.py	209	CODE
LOW	benchmark/kernels/fused_moe_triton/common_utils.py	232	CODE
LOW	…ls/deepseek/benchmark_cute_dsl_fp8_paged_mqa_logits.py	89	CODE
LOW	…hmark/kernels/quantization/tuning_block_wise_kernel.py	148	CODE
LOW	…hmark/kernels/quantization/tuning_block_wise_kernel.py	171	CODE
LOW	benchmark/kernels/lora_csgmv/tune_lora_csgmv.py	133	CODE
LOW	benchmark/kernels/lora_csgmv/tune_lora_csgmv.py	151	CODE
LOW	benchmark/kernels/deepep/deepep_utils.py	157	CODE
LOW	benchmark/kernels/deepep/tuning_deepep.py	30	CODE
LOW	…s/decoding_attention_triton/triton_flashinfer_cudnn.py	159	CODE
LOW	benchmark/mmmu/eval_utils.py	154	CODE
LOW	benchmark/mmmu/eval_utils.py	303	CODE
LOW	benchmark/mmmu/eval_utils.py	423	CODE
LOW	benchmark/mmmu/eval_utils.py	528	CODE
LOW	benchmark/mmmu/eval_utils.py	430	CODE
LOW	benchmark/json_schema/bench_sglang.py	29	CODE
LOW	benchmark/ocr/bench_sglang.py	430	CODE
LOW	benchmark/ocr/generate_report.py	148	CODE
LOW	benchmark/ocr/eval_utils.py	263	CODE
LOW	benchmark/ocr/eval_utils.py	390	CODE
LOW	benchmark/line_retrieval/bench_sglang.py	30	CODE
LOW	benchmark/hicache/bench_mix.py	299	CODE
LOW	benchmark/hicache/nextqa.py	9	CODE
LOW	benchmark/hicache/bench_warm_cache.py	97	CODE
LOW	benchmark/hicache/bench_serving.py	71	CODE
LOW	benchmark/hicache/bench_serving.py	286	CODE
LOW	benchmark/hicache/data_processing.py	37	CODE
LOW	benchmark/hicache/data_processing.py	198	CODE
LOW	benchmark/hicache/data_processing.py	267	CODE
LOW	benchmark/hicache/data_processing.py	520	CODE
LOW	benchmark/hicache/bench_multiturn.py	435	CODE
LOW	benchmark/hicache/bench_multiturn.py	555	CODE
LOW	benchmark/prefill_only/util.py	106	CODE
LOW	benchmark/prefill_only/util.py	284	CODE
LOW	benchmark/reasoning_benchmark/answer_extraction.py	8	CODE
LOW	benchmark/reasoning_benchmark/answer_extraction.py	182	CODE
LOW	benchmark/reasoning_benchmark/answer_extraction.py	214	CODE
LOW	benchmark/reasoning_benchmark/eval_utils.py	59	CODE
LOW	benchmark/asr/bench_sglang.py	75	CODE
LOW	benchmark/json_jump_forward/bench_other.py	132	CODE
LOW	benchmark/lora/lora_bench.py	48	CODE
LOW	benchmark/bench_linear_attention/bench_gdn_decode.py	359	CODE
LOW	benchmark/bench_linear_attention/bench_gdn_prefill.py	520	CODE
LOW	experimental/sgl-router/tests/e2e/infra/gateway.py	234	CODE
LOW	experimental/sgl-router/tests/e2e/infra/gateway.py	348	CODE
LOW	experimental/sgl-router/tests/e2e/infra/model_pool.py	95	CODE
LOW	…outer/tests/e2e/k8s_integration/test_reconciliation.py	85	CODE
LOW	…l-router/tests/e2e/chat_completions/test_validation.py	62	CODE
LOW	test/manual/test_quick_allreduce.py	106	CODE
LOW	test/manual/test_quick_allreduce.py	274	CODE
LOW	test/manual/test_triton_attention_rocm_mla.py	224	CODE
LOW	test/manual/test_logprobs.py	99	CODE
LOW	test/manual/test_logprobs.py	302	CODE
LOW	test/manual/test_triton_moe_wna16.py	16	CODE
1686 more matches not shown…

Modern Structural Boilerplate1019 hits · 1041 pts

Severity	File	Line	Snippet	Context
LOW	…shinfer_allreduce_fusion/benchmark_fused_collective.py	54	logger = logging.getLogger(__name__)	CODE
LOW	benchmark/hicache/bench_mix.py	20	logger = logging.getLogger(__name__)	CODE
LOW	benchmark/prefill_only/bench_embeddings.py	35	logger = logging.getLogger(__name__)	CODE
LOW	experimental/sgl-router/tests/e2e/conftest.py	37	logger = logging.getLogger(__name__)	CODE
LOW	experimental/sgl-router/tests/e2e/infra/gateway.py	39	logger = logging.getLogger(__name__)	CODE
LOW	experimental/sgl-router/tests/e2e/infra/model_pool.py	34	logger = logging.getLogger(__name__)	CODE
LOW	…ental/sgl-router/tests/e2e/k8s_integration/conftest.py	22	logger = logging.getLogger(__name__)	CODE
LOW	test/manual/ascend/disaggregation_utils.py	18	logger = logging.getLogger(__name__)	CODE
LOW⚡	…d/unit/hardware_backend/mlx/test_attention_patching.py	83	def _set_runner_decode_context_defaults(runner) -> None:	CODE
LOW⚡	…d/unit/hardware_backend/mlx/test_attention_patching.py	90	def _set_dummy_server_args_for_auxiliary_state_tests() -> None:	CODE
LOW	test/registered/ascend/interface/test_npu_api_encode.py	21	logger = logging.getLogger(__name__)	CODE
LOW	…tered/ascend/basic_function/quant/test_npu_gguf_moe.py	21	logger = logging.getLogger(__name__)	CODE
LOW	…tered/ascend/basic_function/quant/test_npu_gptq_moe.py	22	logger = logging.getLogger(__name__)	CODE
LOW	…/ascend/basic_function/quant/test_npu_autoround_moe.py	22	logger = logging.getLogger(__name__)	CODE
LOW	…egistered/ascend/basic_function/quant/test_npu_gguf.py	19	logger = logging.getLogger(__name__)	CODE
LOW	…scend/basic_function/quant/test_npu_autoround_dense.py	20	logger = logging.getLogger(__name__)	CODE
LOW	…ered/ascend/reward_models/test_npu_gemma_2_27b_v0_2.py	12	logger = logging.getLogger(__name__)	CODE
LOW⚡	test/registered/observability/test_tracing.py	46	logger = logging.getLogger(__name__)	CODE
LOW	…egistered/observability/test_tracing_disaggregation.py	34	logger = logging.getLogger(__name__)	CODE
LOW	test/registered/vlm/test_vision_chunked_prefill.py	37	logger = logging.getLogger(__name__)	CODE
LOW	test/registered/amd/test_zimage_turbo.py	24	logger = logging.getLogger(__name__)	CODE
LOW	test/registered/amd/test_wan22_fp8_mla.py	26	logger = logging.getLogger(__name__)	CODE
LOW	test/registered/rl/test_return_routed_experts.py	36	logger = logging.getLogger(__name__)	CODE
LOW	test/registered/rl/test_return_indexer_topk.py	32	logger = logging.getLogger(__name__)	CODE
LOW	python/sglang/kernel_api_logging.py	66	def _setup_logger() -> None:	CODE
LOW	python/sglang/__init__.py	81	__all__ = [	CODE
LOW	python/sglang/_mps_stub.py	98	def set_device(device: Any) -> None: # noqa: ARG001	CODE
LOW	python/sglang/utils.py	32	logger = logging.getLogger(__name__)	CODE
LOW	python/sglang/benchmark/dspark_sps_profiler.py	23	logger = logging.getLogger(__name__)	CODE
LOW	python/sglang/benchmark/dspark_sps_profiler.py	836	def set_forced_budget_frac(*, base_url: str, frac: Optional[float]) -> None:	CODE
LOW	python/sglang/benchmark/dspark_sts_fit.py	16	logger = logging.getLogger(__name__)	CODE
LOW	python/sglang/benchmark/datasets/__init__.py	51	__all__ = [	CODE
LOW	python/sglang/kernels/__init__.py	52	__all__ = [	CODE
LOW	python/sglang/kernels/fused_op.py	109	def set_fused_op_backend(backend: Optional[KernelBackend]) -> None:	CODE
LOW	python/sglang/kernels/ops/attention/decode_attention.py	32	logger = logging.getLogger(__name__)	CODE
LOW	python/sglang/kernels/ops/attention/__init__.py	43	__all__ = []	CODE
LOW	python/sglang/kernels/ops/attention/deepseek_v4_rope.py	10	logger = logging.getLogger(__name__)	CODE
LOW	python/sglang/kernels/ops/attention/deepseek_v4_rope.py	263	def set_batched_rope(enabled: bool = True) -> None:	CODE
LOW	python/sglang/kernels/ops/grammar/__init__.py	26	__all__ = []	CODE
LOW	python/sglang/kernels/ops/memory/__init__.py	30	__all__ = []	CODE
LOW	python/sglang/kernels/ops/layernorm/mhc.py	15	logger = logging.getLogger(__name__)	CODE
LOW	python/sglang/kernels/ops/layernorm/__init__.py	329	__all__ = [	CODE
LOW	python/sglang/kernels/ops/mamba/__init__.py	86	__all__ = ["causal_conv1d_fwd", "causal_conv1d_update"]	CODE
LOW	python/sglang/kernels/ops/activation/__init__.py	164	__all__ = [	CODE
LOW	python/sglang/kernels/ops/sampling/__init__.py	56	__all__ = ["top_k_renorm_probs", "top_p_renorm_probs"]	CODE
LOW	python/sglang/kernels/ops/diffusion/__init__.py	105	__all__ = [	CODE
LOW	python/sglang/kernels/ops/kvcache/__init__.py	56	__all__ = ["reshape_and_cache_flash"]	CODE
LOW	…hon/sglang/kernels/ops/kvcache/trtllm_fp8_kv_kernel.py	22	logger = logging.getLogger(__name__)	CODE
LOW	python/sglang/kernels/ops/gemm/lora_tuning_config.py	41	logger = logging.getLogger(__name__)	CODE
LOW	python/sglang/kernels/ops/gemm/__init__.py	111	__all__ = ["fp8_scaled_mm", "dsv3_fused_a_gemm", "dsv3_router_gemm"]	CODE
LOW	python/sglang/kernels/ops/quantization/int8_kernel.py	27	logger = logging.getLogger(__name__)	CODE
LOW	…nels/ops/quantization/nvfp4_gemm_swiglu_nvfp4_quant.py	3011	__all__ = [	CODE
LOW	python/sglang/kernels/ops/quantization/fp8_kernel.py	119	logger = logging.getLogger(__name__)	CODE
LOW	python/sglang/kernels/ops/quantization/__init__.py	117	__all__ = [	CODE
LOW	python/sglang/kernels/ops/speculative/__init__.py	33	__all__ = []	CODE
LOW	python/sglang/kernels/ops/communication/__init__.py	12	__all__ = []	CODE
LOW	python/sglang/kernels/ops/spatial/__init__.py	49	__all__ = ["get_sm_available", "create_greenctx_stream_by_value"]	CODE
LOW	python/sglang/kernels/ops/moe/ep_moe_kernels.py	9	logger = logging.getLogger(__name__)	CODE
LOW	python/sglang/kernels/ops/moe/__init__.py	116	__all__ = ["moe_align_block_size", "topk_softmax"]	CODE
LOW	python/sglang/test/otel_collector.py	23	logger = logging.getLogger(__name__)	CODE
959 more matches not shown…

Over-Commented Block1027 hits · 992 pts

Severity	File	Line	Snippet	Context
LOW	…hmark/kernels/quantization/tuning_block_wise_kernel.py	1	# Copyright 2025 SGLang Team	COMMENT
LOW	benchmark/lora/lora_bench.py	1	# Copyright 2023-2024 SGLang Team	COMMENT
LOW	…ental/sgl-router/tests/proxy/pd_bootstrap_injection.rs	1	// SPDX-FileCopyrightText: Copyright (c) 2026 The SGLang Authors	COMMENT
LOW	…perimental/sgl-router/tests/proxy/graceful_shutdown.rs	1	// SPDX-FileCopyrightText: Copyright (c) 2026 The SGLang Authors	COMMENT
LOW	…mental/sgl-router/tests/proxy/cache_aware_input_ids.rs	1	// SPDX-FileCopyrightText: Copyright (c) 2026 The SGLang Authors	COMMENT
LOW	…perimental/sgl-router/tests/proxy/pd_pool_isolation.rs	1	// SPDX-FileCopyrightText: Copyright (c) 2026 The SGLang Authors	COMMENT
LOW	experimental/sgl-router/tests/proxy/chat_routing.rs	841	.await;	COMMENT
LOW	experimental/sgl-router/tests/proxy/sticky_input_ids.rs	1	// SPDX-FileCopyrightText: Copyright (c) 2026 The SGLang Authors	COMMENT
LOW	experimental/sgl-router/tests/proxy/common/streaming.rs	21	use http_body_util::BodyExt;	COMMENT
LOW	…imental/sgl-router/tests/component/tokenizer/parity.rs	1	// SPDX-FileCopyrightText: Copyright (c) 2026 The SGLang Authors	COMMENT
LOW	…sgl-router/tests/component/policies/cache_aware_zmq.rs	1	// SPDX-FileCopyrightText: Copyright (c) 2026 The SGLang Authors	COMMENT
LOW	…/tests/component/policies/kv_events_two_subscribers.rs	1	// SPDX-FileCopyrightText: Copyright (c) 2026 The SGLang Authors	COMMENT
LOW	…/tests/component/policies/kv_events_two_subscribers.rs	141	elapsed < Duration::from_millis(100),	COMMENT
LOW	…/tests/component/policies/kv_events_tree_concurrent.rs	1	// SPDX-FileCopyrightText: Copyright (c) 2026 The SGLang Authors	COMMENT
LOW	…uter/tests/component/policies/kv_events_hash_parity.rs	1	// SPDX-FileCopyrightText: Copyright (c) 2026 The SGLang Authors	COMMENT
LOW	…sgl-router/tests/component/workers/concurrent_state.rs	81	let snapshot = r.workers_for(&model);	COMMENT
LOW	experimental/sgl-router/tests/e2e/conftest.py	41	# Mirrors SMG's e2e_test/conftest.py sys.path setup.	COMMENT
LOW	experimental/sgl-router/tests/e2e/requirements.txt	1	httpx==0.27.2	COMMENT
LOW	…ts/e2e/chat_completions/test_two_router_convergence.py	41	import time	COMMENT
LOW	…ts/e2e/chat_completions/test_two_router_convergence.py	221	# 1. Direct-warm each worker with its own prefix. Must happen	COMMENT
LOW	experimental/sgl-router/benches/tree_lookup.rs	1	// SPDX-FileCopyrightText: Copyright (c) 2026 The SGLang Authors	COMMENT
LOW	experimental/sgl-router/src/main.rs	1	// SPDX-FileCopyrightText: Copyright (c) 2026 The SGLang Authors	COMMENT
LOW	experimental/sgl-router/src/proxy/sse.rs	21	///	COMMENT
LOW	experimental/sgl-router/src/proxy/sse.rs	41	/// The opaque `Box<dyn Send + 'static>` accepts any drop-only payload — most	COMMENT
LOW	experimental/sgl-router/src/config/types.rs	81	#[value(name = "power_of_two")]	COMMENT
LOW	experimental/sgl-router/src/config/types.rs	141	pub struct ModelConfig {	COMMENT
LOW	experimental/sgl-router/src/config/types.rs	161	#[derive(Debug, Clone, Copy)]	COMMENT
LOW	experimental/sgl-router/src/config/types.rs	201	/// (`x-sgl-decode-url`, `x-sgl-router-error-code`).	COMMENT
LOW	experimental/sgl-router/src/config/types.rs	261	#[derive(Debug, Clone)]	COMMENT
LOW	experimental/sgl-router/src/config/types.rs	281	/// Two operating modes, distinguished by which selector flags are set:	COMMENT
LOW	experimental/sgl-router/src/config/types.rs	301	/// resolved [`K8sDiscoveryMode`] once, at construction in	COMMENT
LOW	experimental/sgl-router/src/config/types.rs	381	/// `"app=sglang"`) past the identical-selector check, even though	COMMENT
LOW	experimental/sgl-router/src/config/types.rs	441		COMMENT
LOW	experimental/sgl-router/src/config/types.rs	461	// here and let the K8s API be the syntax authority. PD	COMMENT
LOW	experimental/sgl-router/src/config/types.rs	781	/// `classify_mode`'s prefill-first ordering would funnel every slice	COMMENT
LOW	experimental/sgl-router/src/config/cli.rs	21	/// Discovery is mutually exclusive: pass `--worker-urls` for a static	COMMENT
LOW	experimental/sgl-router/src/config/cli.rs	41	#[arg(long)]	COMMENT
LOW	experimental/sgl-router/src/config/cli.rs	61		COMMENT
LOW	experimental/sgl-router/src/config/cli.rs	81	/// to `round_robin`.	COMMENT
LOW	experimental/sgl-router/src/config/cli.rs	101	#[arg(long)]	COMMENT
LOW	experimental/sgl-router/src/config/cli.rs	121	#[arg(long, default_value_t = default_proxy_request_timeout_secs())]	COMMENT
LOW	experimental/sgl-router/src/tokenizer/chat_template.rs	1	// SPDX-FileCopyrightText: Copyright (c) 2026 The SGLang Authors	COMMENT
LOW	experimental/sgl-router/src/tokenizer/chat_template.rs	81		COMMENT
LOW	experimental/sgl-router/src/tokenizer/chat_template.rs	101	.context("compile chat template from tokenizer_config.json")?;	COMMENT
LOW	experimental/sgl-router/src/tokenizer/adapter.rs	1	// SPDX-FileCopyrightText: Copyright (c) 2026 The SGLang Authors	COMMENT
LOW	experimental/sgl-router/src/tokenizer/dsv4.rs	1	// SPDX-FileCopyrightText: Copyright (c) 2026 The SGLang Authors	COMMENT
LOW	experimental/sgl-router/src/tokenizer/dsv4.rs	21	//! system content at the front of the prompt, so a tools-carrying request	COMMENT
LOW	experimental/sgl-router/src/tokenizer/mod.rs	281	/// `😀` (`\xF0\x9F\x98\x80`) encodes to its raw byte token ids:	COMMENT
LOW	experimental/sgl-router/src/tokenizer/mod.rs	301	vec![240, 159, 152, 128],	COMMENT
LOW	experimental/sgl-router/src/health/circuit_breaker.rs	1	// SPDX-FileCopyrightText: Copyright (c) 2026 The SGLang Authors	COMMENT
LOW	experimental/sgl-router/src/health/circuit_breaker.rs	81	/// probe slot. Calling `allow()` for filtering would leak probe slots	COMMENT
LOW	experimental/sgl-router/src/health/circuit_breaker.rs	101	/// Single-lock snapshot of `(admit, state_code)` for the `/metrics`	COMMENT
LOW	experimental/sgl-router/src/discovery/types.rs	41	/// new worker becomes available, and [`DiscoveryEvent::Removed`] when it	COMMENT
LOW	experimental/sgl-router/src/discovery/types.rs	61	}	COMMENT
LOW	experimental/sgl-router/src/discovery/k8s.rs	81	///	COMMENT
LOW	experimental/sgl-router/src/discovery/k8s.rs	141	});	COMMENT
LOW	experimental/sgl-router/src/discovery/k8s.rs	221	Ok(())	COMMENT
LOW	experimental/sgl-router/src/discovery/k8s.rs	781	/// plumbing is verified, not just "some prefill registered"),	COMMENT
LOW	experimental/sgl-router/src/discovery/static_urls.rs	1	// SPDX-FileCopyrightText: Copyright (c) 2026 The SGLang Authors	COMMENT
LOW	experimental/sgl-router/src/server/error.rs	21	/// build error). `source` captures the full anyhow chain for server-side	COMMENT
967 more matches not shown…

Structural Annotation Overuse333 hits · 579 pts

Severity	File	Line	Snippet	Context
LOW⚡	benchmark/bench_linear_attention/bench_gdn_decode.py	104	# Step 1: split (same as forward_decode)	COMMENT
LOW⚡	benchmark/bench_linear_attention/bench_gdn_decode.py	107	# Step 2: view + reshape (same as forward_decode)	COMMENT
LOW⚡	benchmark/bench_linear_attention/bench_gdn_decode.py	112	# Step 3: fused gating + recurrent update	COMMENT
LOW⚡	…mark/bench_linear_attention/bench_fused_gate_cumsum.py	79	# Step 1: gate activation using torch ops	COMMENT
LOW⚡	…mark/bench_linear_attention/bench_fused_gate_cumsum.py	87	# Step 2: chunk-local cumsum	COMMENT
LOW⚡	…rimental/sgl-router/tests/e2e/k8s_integration/setup.sh	38	# Step 1: Create kind cluster (idempotent)	COMMENT
LOW⚡	…rimental/sgl-router/tests/e2e/k8s_integration/setup.sh	50	# Step 2: Build Docker images (unless SKIP_DOCKER_BUILD=1)	COMMENT
LOW⚡	…rimental/sgl-router/tests/e2e/k8s_integration/setup.sh	75	# Step 3: Load images into kind	COMMENT
LOW⚡	…rimental/sgl-router/tests/e2e/k8s_integration/setup.sh	82	# Step 4: Apply namespace and RBAC	COMMENT
LOW⚡	…rimental/sgl-router/tests/e2e/k8s_integration/setup.sh	89	# Step 5: Deploy 3 fake-worker replicas behind a Service	COMMENT
LOW⚡	…rimental/sgl-router/tests/e2e/k8s_integration/setup.sh	144	# Step 6: Deploy sgl-router. It is configured entirely via CLI flags in	COMMENT
LOW⚡	test/manual/test_tokenizer_manager.py	336	# Step 1: Detect format	COMMENT
LOW⚡	test/manual/test_tokenizer_manager.py	342	# Step 2: Prepare input	COMMENT
LOW⚡	test/manual/test_tokenizer_manager.py	348	# Step 3: Extract results (simulated tokenizer output)	COMMENT
LOW⚡	test/manual/test_tokenizer_manager.py	367	# Step 1: Detect format	COMMENT
LOW⚡	test/manual/test_tokenizer_manager.py	373	# Step 2: Prepare input	COMMENT
LOW⚡	test/manual/test_tokenizer_manager.py	379	# Step 3: Extract results (simulated tokenizer output for cross-encoder)	COMMENT
LOW⚡	test/manual/test_tokenizer_manager.py	396	# Step 1: Detect format	COMMENT
LOW⚡	test/manual/test_tokenizer_manager.py	402	# Step 2: Prepare input	COMMENT
LOW⚡	test/manual/test_tokenizer_manager.py	408	# Step 3: Extract results (simulated tokenizer output)	COMMENT
LOW	test/manual/dsv4/test_fused_compress_attn_hip.py	312	# Step 1: write current tokens to state (same for both paths)	COMMENT
LOW⚡	test/registered/unit/managers/test_hisparse_unit.py	457	# Step 1: load the first TOP_K positions from host (no newest token —	COMMENT
LOW⚡	test/registered/unit/managers/test_hisparse_unit.py	466	# Step 2: half overlap (hit) + half new (miss).	COMMENT
LOW⚡	test/registered/unit/parser/test_reasoning_parser.py	726	# Step 1: Send partial end tag when not in reasoning mode	COMMENT
LOW⚡	test/registered/unit/parser/test_reasoning_parser.py	732	# Step 2: Send normal text that doesn't complete the end tag	COMMENT
LOW	…egistered/unit/mem_cache/test_decode_radix_lock_ref.py	134	# Step 1: inc_lock_ref (pop_preallocated locks the matched node)	COMMENT
LOW⚡	…egistered/unit/mem_cache/test_decode_radix_lock_ref.py	150	# Step 2: cache_unfinished_req (dec old lock, inc new lock)	COMMENT
LOW⚡	…egistered/unit/mem_cache/test_decode_radix_lock_ref.py	153	# Step 3: cache_finished_req with is_insert=True (dec lock)	COMMENT
LOW	…egistered/unit/mem_cache/test_decode_radix_lock_ref.py	182	# Step 1: inc_lock_ref on root (simulates get_new_prebuilt_batch)	COMMENT
LOW⚡	…egistered/unit/mem_cache/test_decode_radix_lock_ref.py	199	# Step 2: cache_unfinished_req (dec root=no-op, inc new leaf)	COMMENT
LOW⚡	…egistered/unit/mem_cache/test_decode_radix_lock_ref.py	202	# Step 3: cache_finished_req (dec leaf)	COMMENT
LOW⚡	test/registered/unit/mem_cache/test_mamba_unittest.py	693	# Step 1: Insert [1,2,3] to create first node	COMMENT
LOW	test/registered/unit/mem_cache/test_mamba_unittest.py	705	# Step 2: Insert [1,2,3,4,5,6,7] with prev_prefix_len=0 (free all matched)	COMMENT
LOW	test/registered/unit/mem_cache/test_mamba_unittest.py	722	# Step 3: Insert [1,2,3,4,5,6,7,8] with prev_prefix_len=2	COMMENT
LOW	test/registered/unit/mem_cache/test_mamba_unittest.py	740	# Step 4: Insert [1,2,3,4,5,6,7,8,9] with prev_prefix_len=8 (covers all matched)	COMMENT
LOW⚡	…ed/unit/mem_cache/test_unified_radix_cache_unittest.py	793	# Step 1: insert 1 page	COMMENT
LOW⚡	…ed/unit/mem_cache/test_unified_radix_cache_unittest.py	797	# Step 2: insert 2 pages with prev_prefix_len=0 → frees overlap of 1 page	COMMENT
LOW	…ed/unit/mem_cache/test_unified_radix_cache_unittest.py	815	# Step 3: insert 3 pages with prev_prefix_len=len(seq_2p) → nothing freed	COMMENT
LOW⚡	…v_canary/test_self_e2e_perturb_real_kv_unused_cache.py	45	# Step 1: first batch builds radix entries that will become orphans once finished.	COMMENT
LOW⚡	…v_canary/test_self_e2e_perturb_real_kv_unused_cache.py	47	# Step 2: second batch drives more forward passes so the sweep cadence fires	COMMENT
LOW	test/registered/cpu/utils.py	72	# Step 2: compute per-block max abs values → scale	COMMENT
LOW⚡	test/registered/function_call/test_kimik2_detector.py	584	# Step 1: reasoning parser	COMMENT
LOW⚡	test/registered/function_call/test_kimik2_detector.py	588	# Step 2: feed normal_text into tool call parser (like serving_chat.py does)	COMMENT
LOW⚡	test/registered/function_call/test_kimik2_detector.py	620	# Step 1: reasoning parser	COMMENT
LOW⚡	test/registered/function_call/test_kimik2_detector.py	625	# Step 2: tool call parser on normal_text	COMMENT
LOW⚡	test/registered/debug_utils/test_dumper.py	4368	# Step 2: baseline runs the known-good attention kernel.	COMMENT
LOW⚡	test/registered/debug_utils/test_dumper.py	4371	# Step 3: graft output. baseline sends attn_out to target with a	COMMENT
LOW⚡	test/registered/debug_utils/test_dumper.py	4383	# Step 1: graft input. target sends its real q to baseline along	COMMENT
LOW⚡	test/registered/debug_utils/test_dumper.py	4392	# Step 2: target runs the (suspected buggy) attention kernel —	COMMENT
LOW⚡	test/registered/debug_utils/test_dumper.py	4396	# Step 3: graft output. baseline sends its (good) attn_out to	COMMENT
LOW	test/registered/debug_utils/test_dumper.py	4357	# Step 1: graft input. target sends its q to baseline; baseline's	COMMENT
LOW⚡	…t/registered/debug_utils/comparator/test_entrypoint.py	2206	# Step 0: prefill with 2 sequences (3+2 tokens)	COMMENT
LOW⚡	…t/registered/debug_utils/comparator/test_entrypoint.py	2215	# Step 1: decode (1 token per sequence)	COMMENT
LOW	…t/registered/debug_utils/comparator/test_entrypoint.py	2267	# Step 0: prefill — seq A (3 tokens) + seq B (2 tokens)	COMMENT
LOW⚡	…t/registered/debug_utils/comparator/test_entrypoint.py	4723	# Step 1: Create 4 target ranks where moe_tp is absent from ranks 2-3.	STRING
LOW⚡	…t/registered/debug_utils/comparator/test_entrypoint.py	4726	# Step 2: _is_jointly_determined is called with parent_axes={tp, moe_tp}	STRING
LOW⚡	…t/registered/debug_utils/comparator/test_entrypoint.py	4728	# Step 3: edp remains undeclared → ValueError emitted as error record.	STRING
LOW⚡	…t/registered/debug_utils/comparator/test_entrypoint.py	4825	# Step 1: 4 target ranks with TP=2, CP=2 (replicated), EDP=4.	STRING
LOW⚡	…t/registered/debug_utils/comparator/test_entrypoint.py	4827	# Step 2: _is_jointly_determined is called with parent_axes={tp, cp}, child=edp.	STRING
LOW⚡	…t/registered/debug_utils/comparator/test_entrypoint.py	4829	# Step 3: CP replicated picks one rank per tp group → TP concat → correct shape.	STRING
273 more matches not shown…

AI Structural Patterns573 hits · 538 pts

Severity	File	Line	Context
LOW	…pseek/benchmark_deepgemm_dsv3_router_gemm_blackwell.py	49	CODE
LOW	…chmark/kernels/deepseek/benchmark_deepgemm_fp8_gemm.py	156	CODE
LOW	…chmark/kernels/deepseek/benchmark_deepgemm_fp8_gemm.py	175	CODE
LOW	…nels/deepseek/benchmark_deepgemm_fp8_gemm_blackwell.py	95	CODE
LOW	…nels/deepseek/benchmark_deepgemm_fp8_gemm_blackwell.py	109	CODE
LOW	benchmark/hicache/bench_serving.py	684	CODE
LOW	benchmark/reasoning_benchmark/answer_extraction.py	211	CODE
LOW	benchmark/multi_turn_chat/long_prompt_multi_turn.py	22	CODE
LOW	benchmark/fla/benchmark_layernorm_gated.py	15	CODE
LOW	benchmark/lora/lora_bench.py	350	CODE
LOW	test/manual/openai_server/features/test_cache_report.py	68	CODE
LOW	test/manual/openai_server/features/test_cache_report.py	80	CODE
LOW	test/manual/openai_server/features/test_cache_report.py	91	CODE
LOW	…t/manual/entrypoints/http_server/test_abort_request.py	78	CODE
LOW	test/registered/kernels/test_flash_mla_backends.py	242	CODE
LOW	test/registered/attention/test_verify_splitkv.py	85	CODE
LOW	…gistered/unit/model_executor/test_pool_configurator.py	37	CODE
LOW	…t/registered/unit/sampling/test_sampling_batch_info.py	425	CODE
LOW	test/registered/unit/mem_cache/test_registry.py	22	CODE
LOW	…stered/unit/mem_cache/test_mooncake_group_semantics.py	235	CODE
LOW	…egistered/unit/mem_cache/test_hicache_file_lru_unit.py	43	CODE
LOW	…egistered/unit/mem_cache/test_hicache_file_lru_unit.py	76	CODE
LOW	…tered/unit/mem_cache/test_unified_radix_cache_bench.py	680	CODE
LOW	test/registered/kv_canary/test_self_unit_violation.py	25	CODE
LOW	…t/registered/openai_server/basic/test_openai_server.py	465	CODE
LOW	…t/registered/openai_server/basic/test_openai_server.py	501	CODE
LOW	…/openai_server/validation/test_large_max_new_tokens.py	79	CODE
LOW	…t/registered/debug_utils/comparator/test_entrypoint.py	2705	CODE
LOW	…g_utils/comparator/tensor_comparator/test_formatter.py	301	CODE
LOW	test/registered/jit/test_flash_attention_4.py	405	CODE
LOW	test/registered/jit/kv_canary/test_write_hand.py	73	CODE
LOW	test/registered/jit/kv_canary/test_pipeline_e2e.py	183	CODE
LOW	…egistered/jit/diffusion/test_fused_norm_scale_shift.py	140	CODE
LOW	…egistered/jit/diffusion/test_fused_norm_scale_shift.py	163	CODE
LOW	test/registered/perf/test_bench_serving_1gpu_part1.py	257	CODE
LOW	test/registered/bench_fn/test_benchmark_datasets_api.py	787	CODE
LOW	test/registered/rl/test_update_weights_from_disk.py	138	CODE
LOW	test/registered/rl/test_update_weights_from_disk.py	146	CODE
LOW	test/registered/rl/test_update_weights_from_disk.py	154	CODE
LOW	test/registered/rl/test_update_weights_from_tensor.py	226	CODE
LOW	test/registered/rl/test_update_weights_from_tensor.py	234	CODE
LOW	test/registered/rl/test_update_weights_from_tensor.py	247	CODE
LOW	python/sglang/benchmark/serving.py	1303	CODE
LOW	python/sglang/benchmark/one_batch_server.py	527	CODE
LOW	python/sglang/kernels/ops/attention/extend_attention.py	622	CODE
LOW	python/sglang/kernels/ops/attention/extend_attention.py	1082	CODE
LOW	python/sglang/kernels/ops/attention/verify_splitkv.py	689	CODE
LOW	…glang/kernels/ops/kvcache/trtllm_mha_graph_metadata.py	125	CODE
LOW	…nels/ops/quantization/nvfp4_gemm_swiglu_nvfp4_quant.py	2825	CODE
LOW	python/sglang/kernels/ops/quantization/awq_triton.py	339	CODE
LOW	python/sglang/kernels/ops/moe/rocm_moe_utils.py	28	CODE
LOW	python/sglang/kernels/ops/moe/rocm_moe_utils.py	66	CODE
LOW	…hon/sglang/kernels/ops/moe/fused_moe_triton_kernels.py	714	CODE
LOW	python/sglang/kernels/ops/moe/mxfp8_moe_amd_gfx95.py	358	CODE
LOW	…ng/kernels/ops/moe/trtllm_lora_temp/virtual_experts.py	625	CODE
LOW	…ng/kernels/ops/moe/trtllm_lora_temp/virtual_experts.py	1114	CODE
LOW	python/sglang/test/test_deterministic.py	96	CODE
LOW	python/sglang/test/test_utils.py	1093	CODE
LOW	python/sglang/test/test_utils.py	1167	CODE
LOW	python/sglang/test/simple_eval_longbench_v2.py	113	CODE
513 more matches not shown…

Verbosity Indicators276 hits · 492 pts

Severity	File	Line	Snippet	Context
LOW⚡	benchmark/bench_linear_attention/bench_gdn_decode.py	104	# Step 1: split (same as forward_decode)	COMMENT
LOW⚡	benchmark/bench_linear_attention/bench_gdn_decode.py	107	# Step 2: view + reshape (same as forward_decode)	COMMENT
LOW⚡	benchmark/bench_linear_attention/bench_gdn_decode.py	112	# Step 3: fused gating + recurrent update	COMMENT
LOW⚡	…mark/bench_linear_attention/bench_fused_gate_cumsum.py	79	# Step 1: gate activation using torch ops	COMMENT
LOW⚡	…mark/bench_linear_attention/bench_fused_gate_cumsum.py	87	# Step 2: chunk-local cumsum	COMMENT
LOW⚡	…rimental/sgl-router/tests/e2e/k8s_integration/setup.sh	38	# Step 1: Create kind cluster (idempotent)	COMMENT
LOW⚡	…rimental/sgl-router/tests/e2e/k8s_integration/setup.sh	50	# Step 2: Build Docker images (unless SKIP_DOCKER_BUILD=1)	COMMENT
LOW⚡	…rimental/sgl-router/tests/e2e/k8s_integration/setup.sh	75	# Step 3: Load images into kind	COMMENT
LOW⚡	…rimental/sgl-router/tests/e2e/k8s_integration/setup.sh	82	# Step 4: Apply namespace and RBAC	COMMENT
LOW⚡	…rimental/sgl-router/tests/e2e/k8s_integration/setup.sh	89	# Step 5: Deploy 3 fake-worker replicas behind a Service	COMMENT
LOW⚡	…rimental/sgl-router/tests/e2e/k8s_integration/setup.sh	144	# Step 6: Deploy sgl-router. It is configured entirely via CLI flags in	COMMENT
LOW⚡	test/manual/test_tokenizer_manager.py	336	# Step 1: Detect format	COMMENT
LOW⚡	test/manual/test_tokenizer_manager.py	342	# Step 2: Prepare input	COMMENT
LOW⚡	test/manual/test_tokenizer_manager.py	348	# Step 3: Extract results (simulated tokenizer output)	COMMENT
LOW⚡	test/manual/test_tokenizer_manager.py	367	# Step 1: Detect format	COMMENT
LOW⚡	test/manual/test_tokenizer_manager.py	373	# Step 2: Prepare input	COMMENT
LOW⚡	test/manual/test_tokenizer_manager.py	379	# Step 3: Extract results (simulated tokenizer output for cross-encoder)	COMMENT
LOW⚡	test/manual/test_tokenizer_manager.py	396	# Step 1: Detect format	COMMENT
LOW⚡	test/manual/test_tokenizer_manager.py	402	# Step 2: Prepare input	COMMENT
LOW⚡	test/manual/test_tokenizer_manager.py	408	# Step 3: Extract results (simulated tokenizer output)	COMMENT
LOW	test/manual/dsv4/test_fused_compress_attn_hip.py	312	# Step 1: write current tokens to state (same for both paths)	COMMENT
LOW⚡	test/registered/unit/managers/test_hisparse_unit.py	457	# Step 1: load the first TOP_K positions from host (no newest token —	COMMENT
LOW⚡	test/registered/unit/managers/test_hisparse_unit.py	466	# Step 2: half overlap (hit) + half new (miss).	COMMENT
LOW⚡	test/registered/unit/parser/test_reasoning_parser.py	726	# Step 1: Send partial end tag when not in reasoning mode	COMMENT
LOW⚡	test/registered/unit/parser/test_reasoning_parser.py	732	# Step 2: Send normal text that doesn't complete the end tag	COMMENT
LOW	…egistered/unit/mem_cache/test_decode_radix_lock_ref.py	134	# Step 1: inc_lock_ref (pop_preallocated locks the matched node)	COMMENT
LOW⚡	…egistered/unit/mem_cache/test_decode_radix_lock_ref.py	150	# Step 2: cache_unfinished_req (dec old lock, inc new lock)	COMMENT
LOW⚡	…egistered/unit/mem_cache/test_decode_radix_lock_ref.py	153	# Step 3: cache_finished_req with is_insert=True (dec lock)	COMMENT
LOW	…egistered/unit/mem_cache/test_decode_radix_lock_ref.py	182	# Step 1: inc_lock_ref on root (simulates get_new_prebuilt_batch)	COMMENT
LOW⚡	…egistered/unit/mem_cache/test_decode_radix_lock_ref.py	199	# Step 2: cache_unfinished_req (dec root=no-op, inc new leaf)	COMMENT
LOW⚡	…egistered/unit/mem_cache/test_decode_radix_lock_ref.py	202	# Step 3: cache_finished_req (dec leaf)	COMMENT
LOW⚡	test/registered/unit/mem_cache/test_mamba_unittest.py	693	# Step 1: Insert [1,2,3] to create first node	COMMENT
LOW	test/registered/unit/mem_cache/test_mamba_unittest.py	705	# Step 2: Insert [1,2,3,4,5,6,7] with prev_prefix_len=0 (free all matched)	COMMENT
LOW	test/registered/unit/mem_cache/test_mamba_unittest.py	722	# Step 3: Insert [1,2,3,4,5,6,7,8] with prev_prefix_len=2	COMMENT
LOW	test/registered/unit/mem_cache/test_mamba_unittest.py	740	# Step 4: Insert [1,2,3,4,5,6,7,8,9] with prev_prefix_len=8 (covers all matched)	COMMENT
LOW⚡	…ed/unit/mem_cache/test_unified_radix_cache_unittest.py	793	# Step 1: insert 1 page	COMMENT
LOW⚡	…ed/unit/mem_cache/test_unified_radix_cache_unittest.py	797	# Step 2: insert 2 pages with prev_prefix_len=0 → frees overlap of 1 page	COMMENT
LOW	…ed/unit/mem_cache/test_unified_radix_cache_unittest.py	815	# Step 3: insert 3 pages with prev_prefix_len=len(seq_2p) → nothing freed	COMMENT
LOW⚡	…v_canary/test_self_e2e_perturb_real_kv_unused_cache.py	45	# Step 1: first batch builds radix entries that will become orphans once finished.	COMMENT
LOW⚡	…v_canary/test_self_e2e_perturb_real_kv_unused_cache.py	47	# Step 2: second batch drives more forward passes so the sweep cadence fires	COMMENT
LOW	test/registered/cpu/utils.py	72	# Step 2: compute per-block max abs values → scale	COMMENT
LOW⚡	test/registered/function_call/test_kimik2_detector.py	584	# Step 1: reasoning parser	COMMENT
LOW⚡	test/registered/function_call/test_kimik2_detector.py	588	# Step 2: feed normal_text into tool call parser (like serving_chat.py does)	COMMENT
LOW⚡	test/registered/function_call/test_kimik2_detector.py	620	# Step 1: reasoning parser	COMMENT
LOW⚡	test/registered/function_call/test_kimik2_detector.py	625	# Step 2: tool call parser on normal_text	COMMENT
LOW⚡	test/registered/debug_utils/test_dumper.py	4368	# Step 2: baseline runs the known-good attention kernel.	COMMENT
LOW⚡	test/registered/debug_utils/test_dumper.py	4371	# Step 3: graft output. baseline sends attn_out to target with a	COMMENT
LOW⚡	test/registered/debug_utils/test_dumper.py	4383	# Step 1: graft input. target sends its real q to baseline along	COMMENT
LOW⚡	test/registered/debug_utils/test_dumper.py	4392	# Step 2: target runs the (suspected buggy) attention kernel —	COMMENT
LOW⚡	test/registered/debug_utils/test_dumper.py	4396	# Step 3: graft output. baseline sends its (good) attn_out to	COMMENT
LOW	test/registered/debug_utils/test_dumper.py	4357	# Step 1: graft input. target sends its q to baseline; baseline's	COMMENT
LOW⚡	…t/registered/debug_utils/comparator/test_entrypoint.py	2206	# Step 0: prefill with 2 sequences (3+2 tokens)	COMMENT
LOW⚡	…t/registered/debug_utils/comparator/test_entrypoint.py	2215	# Step 1: decode (1 token per sequence)	COMMENT
LOW	…t/registered/debug_utils/comparator/test_entrypoint.py	2267	# Step 0: prefill — seq A (3 tokens) + seq B (2 tokens)	COMMENT
LOW⚡	…t/registered/debug_utils/comparator/test_entrypoint.py	4723	# Step 1: Create 4 target ranks where moe_tp is absent from ranks 2-3.	STRING
LOW⚡	…t/registered/debug_utils/comparator/test_entrypoint.py	4726	# Step 2: _is_jointly_determined is called with parent_axes={tp, moe_tp}	STRING
LOW⚡	…t/registered/debug_utils/comparator/test_entrypoint.py	4728	# Step 3: edp remains undeclared → ValueError emitted as error record.	STRING
LOW⚡	…t/registered/debug_utils/comparator/test_entrypoint.py	4825	# Step 1: 4 target ranks with TP=2, CP=2 (replicated), EDP=4.	STRING
LOW⚡	…t/registered/debug_utils/comparator/test_entrypoint.py	4827	# Step 2: _is_jointly_determined is called with parent_axes={tp, cp}, child=edp.	STRING
LOW⚡	…t/registered/debug_utils/comparator/test_entrypoint.py	4829	# Step 3: CP replicated picks one rank per tp group → TP concat → correct shape.	STRING
216 more matches not shown…

Self-Referential Comments140 hits · 430 pts

Severity	File	Line	Snippet	Context
MEDIUM	benchmark/dspy/bench_dspy_intro.py	95	# Define the predictor.	COMMENT
MEDIUM	benchmark/dspy/bench_dspy_intro.py	107	# Define the predictor. Notice we're just changing the class. The signature BasicQA is unchanged.	COMMENT
MEDIUM	benchmark/mmmu/data_utils.py	157	# Create a JSON object with the filename as the key and caption as the value	COMMENT
MEDIUM	benchmark/hicache/bench_mix.py	24	# Create a lock for thread-safe debug log writing	COMMENT
MEDIUM	benchmark/prefill_only/bench_score.py	157	# Create the request builder function with shared tokenizer	COMMENT
MEDIUM⚡	benchmark/prefill_only/util.py	173	# Create the full request list by cycling through unique requests	COMMENT
MEDIUM	benchmark/multi_turn_chat/long_prompt_multi_turn.py	29	# Create a unique cache filename based on the arguments that affect generation	COMMENT
MEDIUM	experimental/sgl-router/tests/e2e/infra/gateway.py	43	# This file is at `experimental/sgl-router/tests/e2e/infra/gateway.py`,	COMMENT
MEDIUM	test/manual/test_weight_validation.py	75	# Create a minimal valid safetensors file	COMMENT
MEDIUM	test/manual/test_weight_validation.py	166	# Create a broken symlink for the index file	COMMENT
MEDIUM	test/manual/test_modelopt_fp8kvcache.py	15	# Create a ModelOptFp8Config object	COMMENT
MEDIUM	test/manual/test_config_integration.py	27	# Create a temporary config file	COMMENT
MEDIUM	test/manual/test_config_integration.py	82	# Create a temporary config file	COMMENT
MEDIUM	test/manual/test_config_integration.py	113	# Create a temporary config file	COMMENT
MEDIUM	test/manual/test_async_dynamic_batch_tokenizer.py	54	# Create a proper BatchEncoding-like object that supports dict operations	COMMENT
MEDIUM	test/manual/test_async_dynamic_batch_tokenizer.py	193	# Create a new async tokenizer with a failing tokenizer	COMMENT
MEDIUM	test/manual/attention/test_flashattn_backend.py	66	# Create a large enough req_to_token_pool to fit the test usage.	COMMENT
MEDIUM	test/manual/attention/test_flashattn_mla_backend.py	54	# Create a proper req_to_token_pool with the req_to_token attribute	COMMENT
MEDIUM⚡	…ual/lang_frontend/test_separate_reasoning_execution.py	72	# Create a mock backend to avoid AttributeError in __del__	COMMENT
MEDIUM⚡	…ual/lang_frontend/test_separate_reasoning_execution.py	75	# Create a StreamExecutor with necessary setup	COMMENT
MEDIUM	…ual/lang_frontend/test_separate_reasoning_execution.py	106	# Create a gen expression and a separate_reasoning expression	COMMENT
MEDIUM	test/manual/debug_utils/run_with_retry.py	37	# Create a TestFile with a reasonable estimated time	COMMENT
MEDIUM	test/manual/eval/test_longbench_v2_eval.py	73	# Create a temporary JSON file with sample data	COMMENT
MEDIUM	test/registered/quant/test_quant_config_parsing.py	23	# Create a raw instance	COMMENT
MEDIUM	test/registered/kernels/test_dsa_indexer.py	78	# Create a simple page table for testing	COMMENT
MEDIUM	test/registered/kernels/test_dsa_indexer.py	94	# Create a simple page table for testing with page size 1	COMMENT
MEDIUM	…st/registered/attention/test_wave_attention_kernels.py	186	# Define the varying parameter values	COMMENT
MEDIUM	…/registered/attention/test_triton_attention_kernels.py	314	# Define the varying parameter values	COMMENT
MEDIUM	…egistered/attention/test_normal_decode_set_metadata.py	129	# Create a simple SWA KV pool for testing	COMMENT
MEDIUM	…egistered/attention/test_normal_decode_set_metadata.py	150	# Create a minimal mock that inherits from SWAKVPool to pass isinstance check	COMMENT
MEDIUM	test/registered/unit/distributed/test_parallel_state.py	95	# Create a mock group object	COMMENT
MEDIUM	test/registered/unit/distributed/test_parallel_state.py	195	# Create a mock group object	COMMENT
MEDIUM	…stered/unit/function_call/test_function_call_parser.py	496	# Create a concrete implementation of BaseFormatDetector for testing	COMMENT
MEDIUM	…tered/unit/mem_cache/test_radix_cache_slru_accuracy.py	56	# Create a cache with the memory pools	COMMENT
MEDIUM	…ed/unit/mem_cache/test_unified_radix_cache_unittest.py	2031	# Create a chain: root -> A -> B -> C (3 levels)	COMMENT
MEDIUM	…t/registered/unit/model_loader/test_modelopt_loader.py	77	# Create a basic model config with unified quantization flag	COMMENT
MEDIUM	…egistered/unit/entrypoints/openai/test_serving_chat.py	732	# Create a mock conversation object that will be returned by generate_chat_conv	COMMENT
MEDIUM	…egistered/unit/entrypoints/openai/test_serving_chat.py	1380	# Create a mock conversation object	COMMENT
MEDIUM	test/registered/cpu/utils.py	465	# Create a new tensor with alternating values	COMMENT
MEDIUM	test/registered/cpu/utils.py	488	# Create a tensor with the E2M1 values	COMMENT
MEDIUM	test/registered/cpu/test_mamba.py	23	"""This function is intended to align with the l2norm implementation in the FLA library."""	STRING
MEDIUM	…registered/hicache/test_hicache_storage_3fs_backend.py	26	# Create a temporary JSON config file for HF3FS	COMMENT
MEDIUM	test/registered/lora/test_chunked_sgmv_backend.py	199	# Create a minimal mock ForwardBatch for the test	COMMENT
MEDIUM	test/registered/lora/test_chunked_sgmv_backend.py	231	# Create a minimal backend instance to access _get_segments_info	COMMENT
MEDIUM	test/registered/rl/test_update_weights_from_tensor.py	134	# Create a small set of parameters for testing	COMMENT
MEDIUM⚡	test/registered/rl/test_update_weights_from_tensor.py	148	# Create a flattened bucket	COMMENT
MEDIUM⚡	test/registered/rl/test_update_weights_from_tensor.py	155	# Create the dict format expected by _update_weights_from_flattened_bucket	COMMENT
MEDIUM⚡	test/registered/rl/test_update_weights_from_tensor.py	165	# Create a list where each rank contains the same serialized data	COMMENT
MEDIUM	python/sglang/benchmark/serving.py	1372	# Create a temporary DatasetRow object for warmup	COMMENT
MEDIUM	python/sglang/benchmark/serving.py	1388	# Create the test input once	COMMENT
MEDIUM	python/sglang/benchmark/serving.py	2521	# Create a mutually exclusive group for profiling URLs	COMMENT
MEDIUM	python/sglang/kernels/ops/attention/deepseek_v4_rope.py	12	# This module is imported during model-registry discovery. Keep it free of	COMMENT
MEDIUM	python/sglang/kernels/ops/layernorm/mhc.py	17	# This module is imported during model-registry discovery. Do not import the real	COMMENT
MEDIUM	…nels/ops/quantization/nvfp4_gemm_swiglu_nvfp4_quant.py	115	"""This class implements batched matrix multiplication (C = A x SFA x B x SFB) with support for various data types	STRING
MEDIUM	python/sglang/kernels/ops/quantization/awq_triton.py	152	# Create the necessary shifts to use to unpack.	COMMENT
MEDIUM	python/sglang/test/nightly_bench_utils.py	72	# Create a combined link or use the first available one	COMMENT
MEDIUM	python/sglang/jit_kernel/ngram_embedding.py	135	# Create an empty tensor for ignore_tokens	COMMENT
MEDIUM	…/sglang/srt/layers/attention/flashattention_backend.py	2984	# Create a modified version for local attention that only processes the last token	COMMENT
MEDIUM	…/sglang/srt/layers/attention/flashattention_backend.py	3357	# Create a block_table for the local attention blocks	COMMENT
MEDIUM	…/sglang/srt/layers/attention/dsa/index_buf_accessor.py	636	# Define the token range within the block and the K dimension range handled by the thread.	COMMENT
80 more matches not shown…

Redundant / Tautological Comments259 hits · 395 pts

Severity	File	Line	Snippet	Context
LOW	…shinfer_allreduce_fusion/benchmark_fused_collective.py	1160	# Check if running with torchrun (required for collective operations)	STRING
LOW	benchmark/json_schema/bench_sglang.py	97	# Check if the outputs are valid	COMMENT
LOW	benchmark/ceval/bench_sglang.py	112	# Print results	COMMENT
LOW⚡	benchmark/prefill_only/util.py	163	# Check if we have any valid requests	COMMENT
LOW	benchmark/boolq/bench_sglang.py	89	# Print results	COMMENT
LOW	benchmark/gsm8k/bench_sglang.py	143	# Print results	COMMENT
LOW	benchmark/gsm8k/bench_other.py	129	# Print results	COMMENT
LOW	benchmark/mmlu/bench_sglang.py	179	# Print results	COMMENT
LOW	test/manual/test_schedule_policy.py	89	# Check if FCFS keeps the original order	COMMENT
LOW	test/manual/test_schedule_policy.py	114	# Check if priority enabled fcfs ordering is applied.	COMMENT
LOW	test/manual/test_schedule_policy.py	140	# Check if priority enabled fcfs ordering is applied.	COMMENT
LOW	test/manual/test_schedule_policy.py	162	# Check if priority enabled fcfs ordering is applied.	COMMENT
LOW	test/manual/test_schedule_policy.py	186	# Check if priority enabled fcfs ordering is applied.	COMMENT
LOW	test/manual/test_schedule_policy.py	212	# Check if priority enabled fcfs ordering is applied.	COMMENT
LOW	…nual/kv_transfer/test_mooncake_transfer_engine_init.py	201	# Check if mooncake should be used	COMMENT
LOW	…tered/unit/mem_cache/test_radix_cache_slru_accuracy.py	108	# Check if the frequently accessed key-value is still present	COMMENT
LOW	…tered/unit/mem_cache/test_radix_cache_slru_accuracy.py	114	# Check if the first low-frequency key-value has been evicted	COMMENT
LOW	…tered/unit/mem_cache/test_radix_cache_slru_accuracy.py	126	# Check if the tensor is empty, which indicates the key was not found (evicted)	COMMENT
LOW	…t/registered/unit/model_loader/test_modelopt_export.py	27	# Check if modelopt is available	COMMENT
LOW	…d/unit/batch_invariant_ops/test_batch_invariant_ops.py	53	# Check if results are identical	COMMENT
LOW	…d/unit/batch_invariant_ops/test_batch_invariant_ops.py	178	# Check if results are identical	COMMENT
LOW⚡	…registered/disaggregation/test_disaggregation_basic.py	214	# Check if servers are still healthy	COMMENT
LOW	…/ascend/interface/test_npu_openai_server_ignore_eos.py	84	# Check if ignore_eos resulted in more tokens or exactly max_tokens	COMMENT
LOW	…nd/basic_function/offloading/test_npu_offload_modes.py	70	# Check if server is running (basic functionality test)	COMMENT
LOW	…nai_server/validation/test_openai_server_ignore_eos.py	75	# Check if ignore_eos resulted in more tokens or exactly max_tokens	COMMENT
LOW	test/registered/observability/test_tracing.py	77	# Check if it's a RequestStageConfig (has stage_name and level attributes)	COMMENT
LOW⚡	…stered/amd/disaggregation/test_disaggregation_basic.py	321	# Check if servers are still healthy	COMMENT
LOW	…t/registered/amd/accuracy/mi30x/test_gsm8k_eval_amd.py	286	# Print results	COMMENT
LOW	…gistered/amd/accuracy/mi30x/test_vlms_mmmu_eval_amd.py	255	# Print results	COMMENT
LOW	…istered/jit/test_amd_deterministic_custom_allreduce.py	93	# Check if inputs fit in buffer	COMMENT
LOW	test/registered/jit/test_pos_enc.py	480	# Print results	COMMENT
LOW	…/registered/rl/test_update_weights_from_distributed.py	643	# Check if the weights of lm_head are tied with embed_tokens.	COMMENT
LOW	python/sglang/utils.py	420	# Check if the cache file already exists	COMMENT
LOW	python/sglang/benchmark/serving.py	1411	# Check if at least one warmup request succeeded	COMMENT
LOW	python/sglang/benchmark/one_batch.py	710	# Print output texts	COMMENT
LOW	python/sglang/benchmark/utils.py	102	# Check if the cache file already exists	COMMENT
LOW	python/sglang/benchmark/one_batch_server.py	802	# Print results	COMMENT
LOW	python/sglang/kernels/ops/attention/metadata.py	371	# Check if we should use the specialized fast path for page_size=1, no SWA	COMMENT
LOW	python/sglang/kernels/ops/attention/extend_attention.py	963	# Check if we can skip this tile	COMMENT
LOW	python/sglang/kernels/ops/attention/pad.py	295	# Check if we're in CUDA graph mode (buffers are pre-allocated)	COMMENT
LOW	python/sglang/kernels/ops/gemm/embedding_lora_a.py	56	# Check if this token is within the segment	COMMENT
LOW	python/sglang/kernels/ops/gemm/embedding_lora_a.py	70	# Check if this is an extra token	COMMENT
LOW	…nels/ops/quantization/nvfp4_gemm_swiglu_nvfp4_quant.py	490	# Check if input data types are compatible with MMA instruction	COMMENT
LOW	python/sglang/kernels/ops/speculative/spec_tree.py	223	# Check if we've reached end of sibling list	COMMENT
LOW	python/sglang/test/few_shot_gsm8k_engine.py	135	# Print results	COMMENT
LOW	python/sglang/test/send_one.py	238	# Print results	COMMENT
LOW	python/sglang/test/runners.py	497	# Check if the text is empty or only whitespace.	COMMENT
LOW	python/sglang/test/runners.py	792	# Check if the text is empty or only whitespace.	COMMENT
LOW	python/sglang/test/few_shot_gsm8k.py	138	# Print results	COMMENT
LOW	python/sglang/test/ascend/e2e/run_npu_e2e_test.py	378	# Check if current line matches expected pattern	COMMENT
LOW	python/sglang/test/ascend/e2e/run_npu_e2e_test.py	465	# Check if pattern was successfully matched	COMMENT
LOW	python/sglang/test/ci/ci_utils.py	238	# Check if we should retry	COMMENT
LOW	…/sglang/test/server_fixtures/disaggregation_fixture.py	274	# Check if port is active	COMMENT
LOW	python/sglang/srt/dllm/mixin/scheduler.py	309	# Check if batch is full	COMMENT
LOW	python/sglang/srt/disaggregation/decode.py	1889	# Check if request was aborted due to corruption	COMMENT
LOW	python/sglang/srt/layers/sampler.py	541	# Check if any requests actually need logprobs computation	COMMENT
LOW	python/sglang/srt/layers/attention/xpu_backend.py	508	# Check if we should use local attention	COMMENT
LOW	python/sglang/srt/layers/attention/dsa_backend.py	902	# Check if MHA FP8 dequantization is needed	COMMENT
LOW	python/sglang/srt/layers/attention/dsa_backend.py	2799	# Check if sequence meets criteria for MHA_ONE_SHOT	COMMENT
LOW	…/sglang/srt/layers/attention/flashattention_backend.py	1149	# Check if we should use local attention	COMMENT
199 more matches not shown…

Cross-Language Confusion37 hits · 200 pts

Severity	File	Line	Snippet	Context
HIGH	benchmark/hicache/bench_mix.py	465	self.user_generator.push(	CODE
HIGH	experimental/sgl-router/tests/e2e/infra/gateway.py	329	"Build it first: `cd experimental/sgl-router && cargo build --release` "	CODE
HIGH	test/manual/models/test_qwen3_asr.py	630	f"first commit's previous_item_id must be JSON null, got {committed_1!r}",	CODE
HIGH	test/manual/debug_utils/test_log_parser.py	16	"""[{"line":"(SGLangEngine pid=35555) [2025-10-31 03:45:20 TP0] Decode batch [51341], #running-req: 317, #to	STRING
HIGH	…nels/ops/quantization/nvfp4_gemm_swiglu_nvfp4_quant.py	2552	:param c_sf_ptr: Pointer to scale factor tensor for C (can be null)	STRING
HIGH	…nels/ops/quantization/nvfp4_gemm_swiglu_nvfp4_quant.py	2556	:param norm_const_ptr: Pointer to normalization constant for SFC generation (can be null)	STRING
HIGH	…tion_unittest/runner_modes/cuda_graph_decode_runner.py	412	# capture, which makes the capture-time output undefined; only	COMMENT
HIGH	python/sglang/srt/managers/scheduler.py	3870	f"dspark_force_budget_frac must be in (0, 1] or null, got {v}."	CODE
HIGH	python/sglang/srt/function_call/utils.py	297	return "string" # If only null, default to string	CODE
HIGH	python/sglang/srt/function_call/mimo_detector.py	144	<parameter=command>pwd && ls</parameter>	STRING
HIGH	python/sglang/srt/utils/field_validators.py	66	raise ValueError(f"must be list or null; got {type(v).__name__}")	CODE
HIGH	python/sglang/srt/utils/hf_transformers/tokenizer.py	367	# When the config omits the key or has null, use the v4 default so that	COMMENT
HIGH	python/sglang/srt/entrypoints/http_server.py	483	{"error": {"message": "...", "type": "...", "param": null, "code": <status>}}	STRING
HIGH⚡	…en/test/unit/realtime/test_realtime_control_signals.py	16	queue.push("camera_actions", [["w"], ["d"]])	CODE
HIGH⚡	…en/test/unit/realtime/test_realtime_control_signals.py	28	queue.push(	CODE
HIGH⚡	…en/test/unit/realtime/test_realtime_control_signals.py	205	queue.push("camera_actions", [])	CODE
HIGH⚡	…en/test/unit/realtime/test_realtime_control_signals.py	265	queue.push(ControlStateTransition(payload=["w"], seq_id=7))	CODE
HIGH⚡	…en/test/unit/realtime/test_realtime_control_signals.py	266	queue.push(ControlStateTransition(payload=[], seq_id=8))	CODE
HIGH⚡	…en/test/unit/realtime/test_realtime_control_signals.py	277	queue.push(ControlStateTransition(payload=["w"], seq_id=7))	CODE
HIGH⚡	…en/test/unit/realtime/test_realtime_control_signals.py	287	queue.push(ControlStateTransition(payload=["w"], seq_id=7))	CODE
HIGH⚡	…en/test/unit/realtime/test_realtime_control_signals.py	288	queue.push(ControlStateTransition(payload=["w", "d"], seq_id=8))	CODE
HIGH⚡	…en/test/unit/realtime/test_realtime_control_signals.py	289	queue.push(ControlStateTransition(payload=["d"], seq_id=9))	CODE
HIGH	…en/test/unit/realtime/test_realtime_control_signals.py	46	queue.push(	CODE
HIGH	…en/test/unit/realtime/test_realtime_control_signals.py	70	queue.push("camera_actions", [["w"], ["a"], ["s"]])	CODE
HIGH	…en/test/unit/realtime/test_realtime_control_signals.py	87	queue.push("camera_actions", [["w"]])	CODE
HIGH	…en/test/unit/realtime/test_realtime_control_signals.py	104	queue.push("camera_actions", [["w"]])	CODE
HIGH	…en/test/unit/realtime/test_realtime_control_signals.py	122	queue.push("camera_actions", [[]])	CODE
HIGH	…en/test/unit/realtime/test_realtime_control_signals.py	139	queue.push(	CODE
HIGH	…en/test/unit/realtime/test_realtime_control_signals.py	174	queue.push("camera_actions", [["w"], ["w"], ["w"], ["w"]])	CODE
HIGH	…ang/multimodal_gen/runtime/realtime/control_signals.py	190	self.push(	CODE
HIGH	…ang/multimodal_gen/runtime/realtime/control_signals.py	351	self._signals.push(self.kind, script, event_id=event_id)	CODE
HIGH	…ang/multimodal_gen/runtime/realtime/control_signals.py	411	self.push(transition)	CODE
HIGH	…ai/realtime/adapters/lingbot_world_realtime_adapter.py	57	self.prompt_queue.push("prompt", prompt, event_id=event_id)	CODE
HIGH	…rify/scripts/mechanical_refactor_reproduction_utils.py	48	exec_command(f"git add -A && git commit -m {shlex.quote(message)}", cwd=cwd)	CODE
HIGH	…scripts/tests/reproduction_utils/test_infra_helpers.py	69	"shell $HOME && rm -rf / ; metacharacters",	CODE
HIGH⚡	scripts/ci/test_list_stage_models.py	193	f.write('{"deny": null, "by_file": null}')	CODE
HIGH	scripts/ci/utils/runner_utilization_report.py	188	equal to `created_at` (not null). The previous code required both a	STRING

AI Slop Vocabulary72 hits · 181 pts

Severity	File	Line	Snippet	Context
MEDIUM	docker/k8s-sglang-distributed-sts.yaml	58	privileged: true # to leverage RDMA/InfiniBand device, co-work with HostNetwork=true	CODE
MEDIUM	…t/manual/layers/attention/dsa/test_act_quant_triton.py	269	# Run comprehensive benchmark	COMMENT
MEDIUM⚡	test/manual/layers/moe/bench_mxfp4_sm90_kernels.py	277	# Timing harness	COMMENT
MEDIUM	test/manual/4-gpu-models/test_qwen35_models_archived.py	17	# This eval harness applies the chat_template, which is critical for qwen3.5	COMMENT
MEDIUM	test/manual/4-gpu-models/test_qwen35_fp4_triton.py	5	# This eval harness applies the chat_template, which is critical for qwen3.5	COMMENT
MEDIUM	test/manual/eval/validate_longbench_v2_standalone.py	212	"""Generate comprehensive validation report."""	STRING
MEDIUM	…tered/attention/unittests/dense/test_flex_attention.py	65	# from logical positions, so it's robust to all non-tidy layouts.	COMMENT
MEDIUM	test/registered/attention/unittests/dense/test_fa3.py	48	# FlashAttention kernels are most stable in this harness with FA-friendly dims.	COMMENT
MEDIUM	test/registered/attention/unittests/dense/test_fa4.py	43	# FlashAttention kernels are most stable in this harness with FA-friendly dims.	COMMENT
MEDIUM⚡	…egistered/attention/unittests/swa/test_torch_native.py	105	# arithmetic, so it's robust to all non-tidy layouts.	COMMENT
MEDIUM⚡	test/registered/unit/test_runtime_context.py	299	# raise under the strict harness; override() stays the entry point.	COMMENT
MEDIUM	…/registered/chunked_prefill/test_scripted_core_1gpu.py	82	# output_ids are no longer observable through the harness. That case's	COMMENT
MEDIUM	test/registered/hicache/test_qwen35_hicache.py	15	# This eval harness applies the chat_template, which is critical for qwen3.5	COMMENT
MEDIUM	…t/registered/page_major/test_page_major_qwen_hybrid.py	33	# Measured in this harness: baseline (no page-major) and page-major both	COMMENT
LOW	test/registered/debug_utils/test_dumper.py	3345	# code can simply call `from sglang.srt.debug_utils.dumper import dumper`	COMMENT
MEDIUM	test/registered/jit/kv_canary/test_verify_fuzz.py	110	# harness exercises the chain-check-skip branch (added for SWA-evicted ancestor handling).	COMMENT
MEDIUM	test/registered/jit/minimax/test_minimax_decode_topk.py	38	"""Per-row sorted-desc multiset of the scores the kernel selected (tie-robust)."""	STRING
MEDIUM	test/registered/jit/minimax/test_minimax_decode_topk.py	111	# tie-robust: the multiset of selected scores must match torch.topk's	COMMENT
MEDIUM	…tion_unittest/runner_modes/speculative_draft_runner.py	181	# in EagleDraftWorker.__init__, which the harness skips), so build them	COMMENT
MEDIUM	…tion_unittest/runner_modes/speculative_draft_runner.py	199	# canary_manager); for the harness that's the fixture runner.	COMMENT
MEDIUM	…attention_unittest/attention_methods/dsv4_attention.py	1291	# In runner-harness flows the reference is called BEFORE	COMMENT
MEDIUM	python/sglang/jit_kernel/tests/utils.py	47	# single test exceeds half the harness budget, so a hung collective	COMMENT
MEDIUM	python/sglang/srt/server_args.py	6911	# under the strict test harness, a bare assignment outside	COMMENT
MEDIUM	python/sglang/srt/environ.py	203	# must go through ServerArgs.override() (enabled by the test harness).	COMMENT
LOW	python/sglang/srt/disaggregation/nixl/conn.py	183	), # hacky just add it into the message that will be sent	CODE
MEDIUM	python/sglang/srt/disaggregation/common/conn.py	1200	# Enable higher PP ranks to be bootstrapped earlier to make PP PD requests bootstrap more robust	COMMENT
LOW	…/sglang/srt/layers/attention/flashattention_backend.py	3291	# then we can simply use a cdiv for the rest.	COMMENT
LOW	…ang/srt/layers/attention/mamba/causal_conv1d_triton.py	154	# first chunk and does not have prior-token, so just set to 0	COMMENT
MEDIUM	…hon/sglang/srt/layers/attention/dsa/tilelang_kernel.py	1086	# to better utilize FP8 dynamic range, then apply the inverse scale after GEMM.	COMMENT
MEDIUM	python/sglang/srt/layers/quantization/fpgemm_fp8.py	47	# For GPUs that lack FP8 hardware suspport, we can leverage the Marlin	COMMENT
MEDIUM	python/sglang/srt/layers/quantization/fp8.py	311	# Keep both "model." and non-"model." variants for robust prefix matching.	COMMENT
MEDIUM	python/sglang/srt/layers/quantization/fp8.py	439	# For GPUs that lack FP8 hardware support, we can leverage the Marlin	COMMENT
MEDIUM	python/sglang/srt/layers/quantization/fp8_utils.py	754	# TODO: add more robust shape check here	COMMENT
MEDIUM	…hon/sglang/srt/layers/quantization/marlin_utils_fp8.py	69	# For GPUs that lack FP8 hardware support, we can leverage the	COMMENT
LOW	…srt/layers/quantization/mxfp4_flashinfer_trtllm_moe.py	374	# in-place. Otherwise `routed` is already scale-final and we just add	COMMENT
MEDIUM	python/sglang/srt/layers/moe/moe_runner/humming.py	218	# Neighboring nodes are required to utilize distinct workspaces.	COMMENT
LOW	python/sglang/srt/managers/io_struct.py	826	lora_id: Optional[str] = None # None means just use the base model	CODE
LOW	python/sglang/srt/managers/io_struct.py	1154	lora_id: Optional[str] = None # None means just use the base model	CODE
LOW	python/sglang/srt/managers/scheduler_pp_mixin.py	457	# otherwise, just pass along previous consensus	COMMENT
LOW	python/sglang/srt/managers/scheduler.py	4070	# In-place pause: just set the flag and return immediately.	COMMENT
LOW	python/sglang/srt/function_call/qwen3_coder_detector.py	41	# Base class already initializes _buffer, we just use it directly	COMMENT
LOW	python/sglang/srt/parser/harmony_parser.py	492	# If no emit, just return the held content	COMMENT
LOW	python/sglang/srt/mem_cache/hiradix_cache.py	1461	# unknown prefetch stop policy, just return True	COMMENT
MEDIUM	python/sglang/multimodal_gen/test/test_utils.py	1241	"""Probe whether a remote GT file exists, robust to transient failures."""	STRING
MEDIUM	…ultimodal_gen/test/unit/sana_wm/test_realtime_chain.py	186	# Tick 2: horizon exhausted -> seamless continuation, uniform chunk from	COMMENT
MEDIUM	…/sglang/multimodal_gen/test/server/testcase_configs.py	254	# Realtime video consistency harness. When set, server tests use	COMMENT
LOW	python/sglang/multimodal_gen/test/server/gpu_cases.py	70	# To test different models, simply add more DiffusionCase entries	COMMENT
MEDIUM	…_gen/test/single_test_file/component_accuracy/hooks.py	26	# These are harness defaults for synthetic accuracy inputs.	COMMENT
MEDIUM	…gen/test/single_test_file/component_accuracy/config.py	57	# of real divergence or unsupported reference loading in the harness.	COMMENT
MEDIUM	…lang/multimodal_gen/runtime/layers/quantization/fp8.py	191	# For GPUs that lack FP8 hardware support, we can leverage the Marlin	COMMENT
MEDIUM	…n/sglang/multimodal_gen/runtime/models/dits/sana_wm.py	849	# parity harness (env-gated, no-op in prod): on the FIRST sink-path call	COMMENT
MEDIUM	…imodal_gen/runtime/pipelines_core/stages/validators.py	503	"""Get a comprehensive summary of all validation failures."""	STRING
MEDIUM	…stages/model_specific_stages/sana_wm/realtime_chain.py	257	parity_probe.dump_tensor( # parity harness (no-op in prod)	CODE
MEDIUM	…core/stages/model_specific_stages/sana_wm/streaming.py	17	# --- debug parity harness (gated by env; no-op in production) ---	COMMENT
MEDIUM	…core/stages/model_specific_stages/sana_wm/streaming.py	288	if _dump_dir and state.chunk_idx == 0: # parity harness	CODE
MEDIUM	…core/stages/model_specific_stages/sana_wm/streaming.py	333	if _dump_dir: # parity harness	CODE
MEDIUM	…core/stages/model_specific_stages/sana_wm/streaming.py	379	if _dump_dir and chunk_idx == 0: # parity harness	CODE
MEDIUM	…core/stages/model_specific_stages/sana_wm/streaming.py	419	parity_probe.dump_tensor( # parity harness	CODE
MEDIUM	…core/stages/model_specific_stages/sana_wm/streaming.py	567	): # parity harness: run the OFFICIAL's exact stage-1 inputs	CODE
MEDIUM	…core/stages/model_specific_stages/sana_wm/streaming.py	576	) # parity harness: seeded pre-noise (cond @ frame 0)	CODE
12 more matches not shown…

Docstring Block Structure32 hits · 160 pts

Severity	File	Line	Snippet	Context
HIGH	python/sglang/kernels/ops/quantization/fp8_kernel.py	1902	Quantize input tensor to FP8 (8-bit floating point) format. Args: input (torch.Tensor): Input tensor to be quantiz	STRING
HIGH	python/sglang/test/ascend/test_ascend_utils.py	549	Start the service and obtain the inference results. Parameters: model: Model name num_prompts: Tota	STRING
HIGH	python/sglang/srt/server_args_config_parser.py	53	Merge configuration file arguments with command-line arguments. Configuration arguments are inserted a	STRING
HIGH	python/sglang/srt/server_args_config_parser.py	102	Parse YAML configuration file and convert to argument list. Args: file_path: Path to the Y	STRING
HIGH	python/sglang/srt/layers/model_parallel.py	32	Locally shards a full tensor based on indicated sharding arrangement, and returns a DTensor containing the loca	STRING
HIGH	…thon/sglang/srt/layers/attention/flashinfer_backend.py	517	Process multi-item scoring tensors for FlashInfer attention. This method handles sequences containing multiple	STRING
HIGH	python/sglang/srt/layers/attention/fla/chunk.py	146	Args: q (torch.Tensor): queries of shape `[B, T, H, K]` if `head_first=False` else `[B, H, T, K	STRING
HIGH	…hon/sglang/srt/layers/attention/fla/fused_recurrent.py	718	Args: q (torch.Tensor): queries of shape `[B, T, H, K]`. k (torch.Tensor):	STRING
HIGH	python/sglang/srt/layers/moe/cutlass_moe.py	50	Performs Fused MoE computation using CUTLASS-like kernels with FP8 weights and activations. This function implement	STRING
HIGH	python/sglang/srt/function_call/utils.py	223	Get consolidated $defs from all tools, validating for conflicts. Args: tools: List of tools to process	STRING
HIGH	python/sglang/srt/utils/network.py	130	Bind a TCP socket on the first available address family (IPv4/IPv6). Iterates over address families returned by _ge	STRING
HIGH	python/sglang/srt/utils/network.py	320	Automatically detect the local IP address using multiple fallback strategies. This function attempts to obtain	STRING
HIGH	python/sglang/srt/utils/nvtx_pytorch_hooks.py	46	Descends iterators that contains Tensors and prints the Tensor Recursive function that descends iterator type a	STRING
HIGH	python/sglang/srt/utils/nvtx_pytorch_hooks.py	74	Extract the static parameters from LLM and VLM relevant layer types Args: module_obj(class): Module	STRING
HIGH	python/sglang/srt/utils/nvtx_pytorch_hooks.py	198	Callback function that ends the NVTX marker Records the module name and tensor information Called after	STRING
HIGH	python/sglang/srt/utils/nvtx_pytorch_hooks.py	218	Creates an NVTX marker with the module name in it. This function is called before the module executes	STRING
HIGH	python/sglang/srt/utils/nvtx_pytorch_hooks.py	256	User level function that activates all the hooks The user needs to call this method from the network source cod	STRING
HIGH	python/sglang/srt/models/minicpmo.py	137	In streaming audio generation, determine which `text` positions the TTS model can attend to when generating each ch	STRING
HIGH	python/sglang/srt/models/minicpmo.py	616	Merge `input_ids` and `lm_spk_emb_last_hidden_states` to `inputs_embeds`. Args: input_ids (torch.Te	STRING
HIGH	python/sglang/srt/multimodal/processors/qwen_vl.py	133	calculate the number of frames for video used for model inputs. Args: ele (dict): a dict contains the confi	STRING
HIGH	python/sglang/srt/multimodal/processors/ernie45_vl.py	133	calculate the number of frames for video used for model inputs. Args: ele (dict): a dict contains the confi	STRING
HIGH	python/sglang/srt/mem_cache/storage/backend_factory.py	73	Create a storage backend instance. Args: backend_name: Name of the backend to create sto	STRING
HIGH	python/sglang/srt/model_loader/ci_weight_validation.py	1840	CI-specific download with validation and automatic retry on corruption. This function handles the download of	STRING
HIGH	python/sglang/srt/speculative/dflash_utils.py	259	Select target layer indices used to build DFlash context features. Args: num_target_layers: Number of trans	STRING
HIGH	python/sglang/srt/speculative/dflash_utils.py	552	Compute DFlash accept lengths and bonus tokens (greedy verify rule). Args: candidates: Token ids proposed b	STRING
HIGH	…/runtime/managers/memory_managers/layerwise_offload.py	424	Update consolidated CPU buffers with new weights. When layerwise offload (--dit-layerwise-offload) is enabled,	STRING
HIGH	…ang/multimodal_gen/runtime/utils/nvtx_pytorch_hooks.py	97	Walk ``model`` and attach forward pre/post hooks to every module. Args: model: Root module to instr	STRING
HIGH	…ng/multimodal_gen/runtime/models/encoders/qwen2_5vl.py	648	Calculate the 3D rope index based on image and video's temporal, height and width in LLM. Explanation:	STRING
HIGH	…imodal_gen/runtime/pipelines_core/stages/validators.py	345	Add a validation check for a field. Args: field_name: Name of the field being checked	STRING
HIGH	sgl-model-gateway/e2e_test/infra/run_eval.py	64	Run an evaluation and return metrics. Args: args: Configuration object with attributes: - base_	STRING
HIGH	sgl-model-gateway/e2e_test/infra/model_pool.py	727	Get a model instance by model_id, mode, and worker_type. If the model is not running, it will be launched on-de	STRING
HIGH	sgl-model-gateway/e2e_test/responses/test_basic_crud.py	24	Wait for background task to complete. Args: client: OpenAI client response_id: Response ID to poll	STRING

Modern AI Meta-Vocabulary29 hits · 92 pts

Severity	File	Line	Snippet	Context
MEDIUM	benchmark/dspy/bench_dspy_intro.py	113	# Print the input, the chain of thought, and the prediction.	COMMENT
MEDIUM⚡	benchmark/dspy/bench_dspy_intro.py	141	# Set up a basic teleprompter, which will compile our RAG program.	COMMENT
MEDIUM⚡	benchmark/dspy/bench_dspy_intro.py	147	# Ask any question you like to this simple RAG program.	COMMENT
MEDIUM	benchmark/line_retrieval/gen_data.py	89	# Append few-shot examples	COMMENT
MEDIUM	…t/manual/prefill_only/test_encoder_embedding_models.py	12	# Encoder embedding model tests (CUDA only)	COMMENT
MEDIUM	test/registered/pp/test_pp_single_node.py	184	# Gemma4 is instruction-tuned and doesn't follow few-shot completion	COMMENT
MEDIUM⚡	test/registered/jit/kv_canary/test_verify_hand.py	64	# Shared per-test scaffolding helpers.	COMMENT
MEDIUM	test/registered/eval/test_text_models_gsm8k_eval.py	29	# sgl-eval (zero-shot chat, \boxed{}, math_verify grading). Thresholds are	COMMENT
MEDIUM	python/pyproject.toml	153	# Pin sgl-eval to a git SHA: upgrading changes zero-shot \boxed{} grading, so	COMMENT
MEDIUM	python/sglang/benchmark/datasets/agentic_trace.py	13	# ~220-token average assistant reply of OpenHands-style agentic traces.	COMMENT
MEDIUM	python/sglang/test/run_eval.py	302	# Only the nightly correctness eval opts into sgl-eval (zero-shot	COMMENT
MEDIUM	python/sglang/test/simple_eval_mixed_prefix_gsm8k.py	64	# The evaluation data should not include the few-shot examples to prevent data leakage.	COMMENT
MEDIUM	python/sglang/jit_kernel/flash_attention.py	31	window_size=(-1, -1), # -1 means infinite context window	CODE
MEDIUM	python/sglang/jit_kernel/flash_attention_v3.py	124	window_size=(-1, -1), # -1 means infinite context window	CODE
MEDIUM	python/sglang/srt/managers/multi_tokenizer_mixin.py	446	# read SHM only. Drain it event-driven via the socket's fd instead of	COMMENT
MEDIUM	python/sglang/srt/managers/detokenizer_manager.py	204	# If it is embedding model, no detokenization is needed.	COMMENT
MEDIUM	python/sglang/srt/models/locate_anything.py	358	# <box>: a closed <box>...</box> in a few-shot / multi-turn prompt is	COMMENT
MEDIUM	python/sglang/srt/parser/conversation.py	87	# The number of few shot examples	COMMENT
MEDIUM	sgl-kernel/python/sgl_kernel/flash_attn.py	58	window_size=(-1, -1), # -1 means infinite context window	CODE
MEDIUM	.claude/skills/large-class-style/SKILL.md	29	### 1.3 Allowed: orchestration	COMMENT
MEDIUM	.claude/skills/large-class-style/SKILL.md	41	# model_runner.py — orchestration only.	COMMENT
MEDIUM	…-verify/scripts/mechanical_refactor_proof_generator.py	514	# are exactly the relocated symbols (the prep commit inlined them, scaffolding included, as	COMMENT
MEDIUM	…-verify/scripts/mechanical_refactor_proof_generator.py	680	# A move source the commit deletes (its defs all relocated, leaving only scaffolding) is	COMMENT
MEDIUM	docs_new/src/snippets/configs/poolside/laguna-m1.jsx	158	// agentic coding, repeated long contexts. "Enable" emits --enable-hierarchical-cache (+host L2);	COMMENT
MEDIUM	…les/frontend_language/usage/openai_chat_speculative.py	150	# expect answer with same format as in the few shot	COMMENT
MEDIUM⚡	examples/runtime/chain_of_verification.py	98	# `content` and the chain-of-thought in `reasoning_content`. In some	COMMENT
MEDIUM⚡	scripts/ci_monitor/ci_auto_bisect.py	1060	# Main orchestration	COMMENT
MEDIUM	scripts/ci/utils/ci_coverage_report.py	287	# adding scaffolding for an unused backend would widen every row with a	COMMENT
MEDIUM	…-model-gateway/e2e_test/embeddings/test_correctness.py	151	# Get model path from MODEL_SPECS for the embedding model	COMMENT

Hallucination Indicators8 hits · 80 pts

Severity	File	Line	Snippet	Context
CRITICAL	…tention_unittest/attention_methods/mamba2_attention.py	543	self.mixer.norm.weight.data = self.mixer.norm.weight.data.float()	CODE
CRITICAL	…ng/test/scripted_runtime/context/lock_ref_exhauster.py	43	stack = list(self.scheduler.tree_cache.root_node.children.values())	CODE
CRITICAL	python/sglang/test/scripted_runtime/context/radix.py	32	stack = list(ctx.scheduler.tree_cache.root_node.children.values())	CODE
CRITICAL	python/sglang/srt/layers/attention/fla/utils.py	231	return triton.runtime.driver.active.utils.get_device_properties(tensor_idx)[	CODE
CRITICAL	python/sglang/srt/layers/attention/fla/utils.py	286	triton.runtime.driver.active.utils.get_device_properties(i)[	CODE
CRITICAL	python/sglang/srt/models/glm4_moe.py	487	) and self.shared_experts.gate_up_proj.quant_method.quant_config.get_name() in {	CODE
CRITICAL	python/sglang/srt/models/deepseek_v2.py	761	) and self.shared_experts.gate_up_proj.quant_method.quant_config.get_name() in {	CODE
CRITICAL	…l_gen/runtime/pipelines_core/composed_pipeline_base.py	128	task_name = self.server_args.pipeline_config.task_type.name.lower()	CODE

Dead Code25 hits · 50 pts

Severity	File	Line	Context
MEDIUM	…ed/unit/managers/test_tokenizer_manager_rid_cleanup.py	509	CODE
MEDIUM	…egistered/unit/scripted_runtime/test_scheduler_hook.py	20	CODE
MEDIUM	…egistered/unit/scripted_runtime/test_scheduler_hook.py	25	CODE
MEDIUM	…/registered/unit/entrypoints/anthropic/test_serving.py	1087	CODE
MEDIUM	…egistered/unit/entrypoints/openai/test_serving_chat.py	1543	CODE
MEDIUM	python/sglang/srt/entrypoints/openai/serving_base.py	146	CODE
MEDIUM	python/sglang/srt/entrypoints/openai/serving_base.py	149	CODE
MEDIUM	…ang/multimodal_gen/runtime/models/dits/hunyuanvideo.py	814	CODE
MEDIUM	…ang/multimodal_gen/runtime/models/dits/hunyuanvideo.py	815	CODE
MEDIUM	…ang/multimodal_gen/runtime/models/dits/hunyuanvideo.py	816	CODE
MEDIUM	…ang/multimodal_gen/runtime/models/dits/hunyuanvideo.py	819	CODE
MEDIUM	…ang/multimodal_gen/runtime/models/dits/hunyuanvideo.py	820	CODE
MEDIUM	…ang/multimodal_gen/runtime/models/dits/hunyuanvideo.py	822	CODE
MEDIUM	…ang/multimodal_gen/runtime/models/dits/hunyuanvideo.py	824	CODE
MEDIUM	…ang/multimodal_gen/runtime/models/dits/hunyuanvideo.py	827	CODE
MEDIUM	…ang/multimodal_gen/runtime/models/dits/hunyuanvideo.py	828	CODE
MEDIUM	…ang/multimodal_gen/runtime/models/dits/hunyuanvideo.py	830	CODE
MEDIUM	…ang/multimodal_gen/runtime/models/dits/hunyuanvideo.py	840	CODE
MEDIUM	…ang/multimodal_gen/runtime/models/dits/hunyuanvideo.py	855	CODE
MEDIUM	…ang/multimodal_gen/runtime/models/dits/hunyuanvideo.py	865	CODE
MEDIUM	…ang/multimodal_gen/runtime/models/dits/hunyuanvideo.py	866	CODE
MEDIUM	…ang/multimodal_gen/runtime/models/dits/hunyuanvideo.py	867	CODE
MEDIUM	…ang/multimodal_gen/runtime/models/dits/hunyuanvideo.py	895	CODE
MEDIUM	…ang/multimodal_gen/runtime/models/dits/hunyuanvideo.py	896	CODE
MEDIUM	…ang/multimodal_gen/runtime/models/dits/hunyuanvideo.py	898	CODE

Slop Phrases18 hits · 42 pts

Severity	File	Line	Snippet	Context
MEDIUM	…manual/openai_server/function_call/test_tool_choice.py	845	# # Start the local OpenAI Server. If necessary, you can add other parameters such as --enable-tools.	COMMENT
MEDIUM	…d/ascend/interface/test_npu_openai_function_calling.py	53	# Start the local OpenAI Server. If necessary, you can add other parameters such as --enable-tools.	COMMENT
MEDIUM	…i_server/function_call/test_openai_function_calling.py	43	# Start the local OpenAI Server. If necessary, you can add other parameters such as --enable-tools.	COMMENT
MEDIUM	…i_server/function_call/test_openai_function_calling.py	926	# # Start the local OpenAI Server. If necessary, you can add other parameters such as --enable-tools.	COMMENT
LOW	python/sglang/test/runners.py	488	# make sure to disable compile	COMMENT
MEDIUM	python/sglang/jit_kernel/flash_attention_v3.py	92	# That means if you use A100/A*0/L20/L40/L40s/4090 you can use fa3.	COMMENT
LOW	python/sglang/srt/models/hunyuan.py	776	# make sure to leave KV cache scale factors in a known good (dummy) state	COMMENT
LOW	python/sglang/srt/models/apertus.py	378	# make sure to leave KV cache scale factors in a known good (dummy) state	COMMENT
LOW	python/sglang/srt/models/mimo_v2.py	958	# make sure to leave KV cache scale factors in a known good (dummy) state	COMMENT
LOW	python/sglang/srt/models/llama.py	431	# make sure to leave KV cache scale factors in a known good (dummy) state	COMMENT
LOW	python/sglang/srt/models/glm4.py	393	# make sure to leave KV cache scale factors in a known good (dummy) state	COMMENT
LOW	python/sglang/srt/models/qwen2.py	405	# make sure to leave KV cache scale factors in a known good (dummy) state	COMMENT
LOW	…glang/multimodal_gen/runtime/models/vaes/hunyuanvae.py	273	# If you are encountering an error here, make sure to try running encoding/decoding with	COMMENT
MEDIUM	sgl-kernel/python/sgl_kernel/flash_attn.py	24	# That means if you use A100/A*0/L20/L40/L40s/4090 you can use fa3.	COMMENT
MEDIUM	sgl-kernel/tests/test_flash_attention.py	28	# That means if you use A100/A*0/L20/L40/L40s/4090 you can use fa3.	COMMENT
MEDIUM	…ntend_language/quick_start/local_example_llava_next.py	55	# Or you can use the 72B model	COMMENT
MEDIUM	…ntend_language/quick_start/local_example_llava_next.py	62	# Or you can use API models	COMMENT
MEDIUM	…-gateway/examples/wasm/wasm-guest-ratelimit/src/lib.rs	37	// This is a simplified example for demonstration purposes	COMMENT

Overly Generic Function Names12 hits · 15 pts

Severity	File	Line	Snippet	Context
LOW	benchmark/hicache/bench_mix.py	417	async def handle_request(self, user_data):	CODE
LOW	benchmark/hicache/bench_multiturn.py	388	async def handle_request(self, item):	CODE
LOW	python/sglang/srt/debug_utils/dumper.py	1370	def handle_request(self, *, method: str, body: dict[str, Any]) -> list[dict]:	CODE
LOW	python/sglang/srt/entrypoints/openai/serving_base.py	73	async def handle_request(	CODE
LOW	…ipts/tests/reproduction_utils/test_extract_function.py	180	signature=" def helper(self):",	CODE
LOW	…ipts/tests/reproduction_utils/test_extract_function.py	190	assert " def helper(self):\n val = compute_thing()\n" in out	CODE
LOW⚡	…y/scripts/tests/reproduction_utils/test_move_symbol.py	84	"class Old:\n @staticmethod\n def helper(x):\n return x * 2\n"	CODE
LOW⚡	…y/scripts/tests/reproduction_utils/test_move_symbol.py	91	assert "def helper(x):\n return x * 2\n" in (tmp_path / "dst.py").read_text()	CODE
LOW⚡	…y/scripts/tests/reproduction_utils/test_move_symbol.py	292	" def helper(y):\n"	CODE
LOW⚡	…y/scripts/tests/reproduction_utils/test_move_symbol.py	302	assert " @staticmethod\n def helper(y):\n" in dst_out	CODE
LOW⚡	…y/scripts/tests/reproduction_utils/test_move_symbol.py	351	(tmp_path / "src.py").write_text("def helper(x):\n return x\n")	CODE
LOW⚡	…y/scripts/tests/reproduction_utils/test_move_symbol.py	357	assert " def helper(x):\n return x\n" in (tmp_path / "dst.py").read_text()	CODE

Synthetic Comment Markers2 hits · 15 pts

Severity	File	Line	Snippet	Context
HIGH	test/manual/attention/test_trtllm_mla_backend.py	1236	# Reshape as requested	COMMENT
HIGH	…ang/multimodal_gen/runtime/utils/hf_diffusers_utils.py	501	# Special handling for stop token <\|eom_id\|> generated by llama 3 tool use.	COMMENT

Fake / Example Data14 hits · 14 pts

Severity	File	Line	Snippet	Context
LOW	…l/sgl-router/tests/scripts/generate_parity_fixtures.py	41	"Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod "	CODE
LOW	…l/sgl-router/tests/scripts/generate_parity_fixtures.py	41	"Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod "	CODE
LOW	python/sglang/test/kits/ebnf_constrained_kit.py	62	self.__class__.ebnf_grammar = 'root ::= "user@example.com"'	CODE
LOW	python/sglang/srt/compilation/backend.py	233	if node.op in ("output", "placeholder"):	CODE
LOW	…-model-gateway/tests/security/auth_integration_test.rs	174	let claims = create_claims("admin@example.com", vec!["admin"]);	CODE
LOW	…-model-gateway/tests/security/auth_integration_test.rs	187	assert_eq!(validated.subject, "admin@example.com");	CODE
LOW	…-model-gateway/tests/security/auth_integration_test.rs	213	let claims = create_claims("user@example.com", vec!["user", "viewer"]);	CODE
LOW	…-model-gateway/tests/security/auth_integration_test.rs	221	assert_eq!(validated.subject, "user@example.com");	CODE
LOW	…-model-gateway/tests/security/auth_integration_test.rs	252	sub: "user@example.com".to_string(),	CODE
LOW	…-model-gateway/tests/security/auth_integration_test.rs	301	sub: "user@example.com".to_string(),	CODE
LOW	…-model-gateway/tests/security/auth_integration_test.rs	338	let claims = create_claims("user@example.com", vec!["admin"]);	CODE
LOW	…-model-gateway/tests/security/auth_integration_test.rs	590	let claims = create_claims("user@example.com", vec!["admin"]);	CODE
LOW	…-model-gateway/tests/security/auth_integration_test.rs	713	let claims = create_claims("user@example.com", vec!["admin"]);	CODE
LOW	sgl-model-gateway/e2e_test/benchmarks/conftest.py	38	"dummy-token",	CODE

TODO Padding5 hits · 8 pts

Severity	File	Line	Snippet	Context
LOW	test/manual/test_expert_distribution.py	20	# TODO: Add tests for DeepEP gatherer (currently our CI cannot run that)	COMMENT
LOW	python/sglang/srt/layers/torchao_utils.py	21	# TODO: implement a more general filter function	COMMENT
LOW	python/sglang/srt/layers/moe/fused_moe_triton/layer.py	1289	# TODO: consider using symmetric memory	COMMENT
LOW	sgl-kernel/csrc/cpu/common.h	369	// TODO: implement reverse order of [MB / cache_blocks_mb, NB, cache_blocks_mb]	COMMENT
LOW	sgl-kernel/python/sgl_kernel/top_k.py	12	# TODO: implement faster cuda kernels for large vocab sizes	COMMENT

Example Usage Blocks5 hits · 6 pts

Severity	File	Line	Snippet	Context
LOW	…rimental/sgl-router/tests/e2e/k8s_integration/setup.sh	6	# Usage:	COMMENT
LOW	scripts/killall_sglang.sh	8	# Usage:	COMMENT
LOW	scripts/ci/musa/rename_wheels_musa.sh	10	# Usage:	COMMENT
LOW	sgl-model-gateway/e2e_test/k8s_integration/setup.sh	8	# Usage:	COMMENT
LOW	…ngs/golang/examples/oai_server/scripts/analyze_tpot.sh	6	# Usage:	COMMENT

Analysis Overview

What These Metrics Mean

Score History

Severity Breakdown

Directory Score Breakdown

Pattern Findings