NVIDIA/Megatron-LM

11.6

Adjusted Score

11.6

Raw Score

100%

Time Factor

2026-07-14

Last Push

17.1K

Stars

Python

Language

947.4K

Lines of Code

2.8K

Files

7.9K

Pattern Hits

2026-07-14

Scan Date

0.07

HC Hit Rate

What These Metrics Mean

Adjusted Score: Primary synthetic code indicator. Raw score normalised per 1,000 lines of code and multiplied by the temporal discount factor. This is the definitive comparative metric — use it to rank repositories by AI authorship density.
Raw Score: The unmodified sum of all severity-weighted, context-multiplied pattern match scores before temporal discounting. Reflects the absolute signal strength independent of when the repository was last active.
Time Factor: The temporal discount multiplier (0–100%) applied to the raw score. Repositories last updated before ChatGPT's launch (Nov 2022) receive a 5% factor. Full signal is only assigned to repositories active in the post-adoption era (Jan 2024+).
Pattern Hits: Total count of individual pattern matches across all files and categories. A high hit count with a low score may indicate a very large codebase with isolated AI snippets; a low count with a high score indicates dense, concentrated AI signatures.
HC Hit Rate: High+Critical pattern hits per file, averaged across the repository. This orthogonal signal catches repositories where a few files are densely packed with high-severity AI tells — a strong indicator even when the normalised score appears moderate due to codebase size.
Lines of Code / Files: Total lines and files analysed. The scanner examines 94 file extensions. These denominators are used to normalise the score, enabling fair comparison between repositories of vastly different sizes.

Score History

This chart maps the temporal evolution of the adjusted synthetic code score across successive scan runs. An upward trajectory indicates ongoing incorporation of AI-generated code or expanding LLM-assisted scaffolding; a stable or declining trajectory may reflect active human refactoring, code removal, or the adoption of stricter authorship policies. The dashed secondary line (right axis) independently tracks total raw pattern hit count, which can diverge from the normalised score when codebase size changes significantly between scans.

Severity Breakdown

Classifies detected patterns by their diagnostic confidence and structural impact. CRITICAL patterns (coefficient 10) represent definitive synthetic signatures — hallucinated imports, explicit LLM attribution metadata — virtually never produced by human authors. HIGH (5) indicates strong structural tells such as cross-file repetition or cross-linguistic idioms. MEDIUM (2) covers recognisable conversational padding and AI-specific vocabulary. LOW (1) captures subtle indicators like tautological comments and generic boilerplate that require density to carry independent signal.

CRITICAL 30HIGH 163MEDIUM 961LOW 6723

Directory Score Breakdown

This horizontal bar chart decomposes the repository's raw synthetic code score by top-level directory, allowing you to pinpoint precisely which modules or components carry the highest AI authorship density. Directories with disproportionately high scores relative to their size warrant targeted manual review: concentrated AI signatures often trace back to mass-generated configuration layers, auto-ported test suites, LLM-scaffolded boilerplate classes, or entire subsystems authored under heavy copilot assistance. Use this view to prioritise your human code-review effort.

Pattern Findings

The scanner identified 7877 distinct pattern matches across 25 syntactic categories. Each entry below represents a discrete location in the source code where the engine recorded a statistically significant AI authorship indicator. Expand any category row to inspect the individual file paths, line numbers, code snippets, and the lexical context (CODE, COMMENT, or STRING) in which each match was detected.

Reading the findings table: The Severity column indicates the diagnostic confidence level (CRITICAL / HIGH / MEDIUM / LOW). The Context column identifies whether the match occurred inside executable code, an inline comment, or a string literal — comment-context matches receive a ×1.5 weight because LLMs systematically over-annotate. The ⚡ bolt icon marks clustered matches: three or more patterns within a 10-line window, each receiving an additional ×1.5 density multiplier as dense clusters constitute far stronger evidence of synthetic authorship than isolated hits.

Hyper-Verbose Identifiers4353 hits · 4218 pts

Severity	File	Line	Snippet	Context
LOW	pretrain_vlm.py	227	def train_valid_test_datasets_provider(train_val_test_num_samples):	CODE
LOW	pretrain_vlm.py	267	def _preprocess_data_for_llava(data):	CODE
LOW	pretrain_vlm.py	465	def llava_position_embedding_ranks(pp_ranks):	CODE
LOW	train_rl.py	339	def train_valid_test_datasets_provider(train_val_test_num_samples):	CODE
LOW	pretrain_gpt.py	184	def _build_cached_logits_loss_func(	CODE
LOW	pretrain_gpt.py	372	def core_gpt_dataset_config_from_args(args: Any) -> GPTDatasetConfig:	CODE
LOW	pretrain_gpt.py	439	def train_valid_test_datasets_provider(train_val_test_num_samples, vp_stage=None):	CODE
LOW	model_provider.py	62	def count_parameters_in_layer(model, layer_name):	CODE
LOW	pretrain_hybrid.py	184	def _build_cached_logits_loss_func(	CODE
LOW	pretrain_hybrid.py	359	def core_gpt_dataset_config_from_args(args: Any) -> GPTDatasetConfig:	CODE
LOW	pretrain_hybrid.py	403	def train_valid_test_datasets_provider(train_val_test_num_samples, vp_stage=None):	CODE
LOW	gpt_builders.py	108	def _get_transformer_layer_spec(use_te, config):	CODE
LOW	tasks/finetune_utils.py	53	def _cross_entropy_forward_step(batch, model):	CODE
LOW	tasks/finetune_utils.py	95	def _build_infinite_size_dataloader(dataloader):	CODE
LOW	tasks/finetune_utils.py	106	def _build_train_valid_dataloaders(train_dataset, valid_dataset,	CODE
LOW	tasks/data_utils.py	35	def build_tokens_types_paddings_from_text(text_a, text_b,	CODE
LOW	tasks/data_utils.py	49	def build_tokens_types_paddings_from_ids(text_a_ids, text_b_ids, max_seq_length,	CODE
LOW	tasks/eval_utils.py	65	def calculate_correct_answers(name, model, dataloader,	CODE
LOW	tasks/eval_utils.py	111	def correct_answers_forward_step(batch, model):	CODE
LOW	tools/run_vlm_text_generation.py	161	def generate_and_write_samples(model):	CODE
LOW	tools/prepare_cache.py	54	def _normalize_prepare_cache_args(args: Any) -> None:	CODE
LOW	tools/prepare_cache.py	65	def _validate_prepare_cache_args(args: Any) -> None:	CODE
LOW	tools/prepare_cache.py	82	def _disable_cache_load_only_flags(args: Any) -> Dict[str, bool]:	CODE
LOW	tools/prepare_cache.py	102	def _print_effective_configuration(	CODE
LOW	tools/prepare_cache.py	120	def core_gpt_dataset_config_from_args(args: Any) -> GPTDatasetConfig:	CODE
LOW	tools/run_inference_performance_test.py	49	def add_inference_benchmarking_args(parser):	CODE
LOW	tools/build_sequences_per_dataset.py	60	def build_sequences_per_dataset(args):	CODE
LOW	tools/run_dynamic_text_generation_server.py	21	def add_text_generation_server_args(parser: argparse.ArgumentParser):	CODE
LOW	tools/run_dynamic_text_generation_server.py	37	async def run_text_generation_server(	CODE
LOW	tools/checkpoint/saver_hf_llava.py	176	def receive_vision_projection(self):	CODE
LOW	tools/checkpoint/saver_hf_llava.py	352	def save_state_dict_to_hf_checkpoint(self):	CODE
LOW	tools/checkpoint/schema_core.py	10	def get_core_transformer_block_key(model_key):	CODE
LOW	tools/checkpoint/schema_hf.py	173	def get_language_model_schema(	CODE
LOW	tools/checkpoint/loader_base.py	30	def _maybe_parse_additional_megatron_args(self, margs, checkpoint_args):	CODE
LOW	tools/checkpoint/loader_base.py	90	def _maybe_ensure_additional_required_arguments(self):	CODE
LOW	tools/checkpoint/loader_base.py	107	def ensure_required_arguments(self):	CODE
LOW	tools/checkpoint/loader_base.py	193	def get_models_for_pipeline_stage(count, dtype):	CODE
LOW	tools/checkpoint/loader_base.py	425	def build_checkpoint_metadata(self, true_vocab_size):	CODE
LOW⚡	tools/checkpoint/gpt_hybrid_conversion.py	404	def initialize_ssm_layer_params(	CODE
LOW	tools/checkpoint/gpt_hybrid_conversion.py	152	def parse_hybrid_layer_pattern(pattern):	CODE
LOW	tools/checkpoint/gpt_hybrid_conversion.py	183	def build_layer_index_mapping(layer_types, direction):	CODE
LOW	tools/checkpoint/gpt_hybrid_conversion.py	316	def validate_pattern_gpt_compatible(layer_types, direction):	CODE
LOW	tools/checkpoint/gpt_hybrid_conversion.py	361	def validate_source_args_gpt_compatible(source_args, direction):	CODE
LOW	tools/checkpoint/dist_checkpoint_io.py	69	def resolve_checkpoint_subdir(load_dir):	CODE
LOW	tools/checkpoint/dist_checkpoint_io.py	113	def ensure_single_rank_process_group():	CODE
LOW	tools/checkpoint/dist_checkpoint_io.py	159	def load_dist_checkpoint_full(load_dir):	CODE
LOW	tools/checkpoint/dist_checkpoint_io.py	221	def save_dist_checkpoint_full(	CODE
LOW	tools/checkpoint/dist_checkpoint_io.py	260	def write_latest_iteration_marker(save_dir, iteration):	CODE
LOW	tools/checkpoint/loader_mixtral_hf.py	28	def load_args_from_checkpoint(args):	CODE
LOW⚡	tools/checkpoint/loader_mixtral_hf.py	63	def verify_transformers_version():	CODE
LOW	tools/checkpoint/saver_llava.py	134	def _maybe_parse_additional_megatron_args(self, margs):	CODE
LOW	tools/checkpoint/saver_llava.py	320	def receive_vision_projection(self):	CODE
LOW	tools/checkpoint/checkpoint_inspector.py	288	def save_checkpoint_with_pickle_protocol(state_dict, output_dir, pickle_protocol=4):	CODE
LOW	tools/checkpoint/checkpoint_inspector.py	292	def transform_object_override(write_item, obj):	CODE
LOW	tools/checkpoint/checkpoint_inspector.py	831	def convert_torch_dist_to_fsdp_dtensor(	CODE
LOW	tools/checkpoint/saver_base.py	31	def _maybe_parse_additional_megatron_args(self, margs):	CODE
LOW	tools/checkpoint/saver_base.py	38	def insert_megatron_path_and_check_te(self):	CODE
LOW	tools/checkpoint/saver_base.py	277	def receive_checkpoint_metadata(self):	CODE
LOW	tools/checkpoint/saver_base.py	341	def save_local_models_to_checkpoint(self):	CODE
LOW	tools/checkpoint/loader_llava.py	54	def _maybe_parse_additional_megatron_args(self, margs, checkpoint_args):	CODE
4293 more matches not shown…

Decorative Section Separators755 hits · 2630 pts

Severity	File	Line	Snippet	Context
MEDIUM⚡	tools/checkpoint/gpt_hybrid_conversion.py	400	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	tools/checkpoint/gpt_hybrid_conversion.py	402	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	tools/checkpoint/gpt_hybrid_conversion.py	132	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	tools/checkpoint/gpt_hybrid_conversion.py	134	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	tools/checkpoint/gpt_hybrid_conversion.py	225	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	tools/checkpoint/gpt_hybrid_conversion.py	227	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	tools/checkpoint/gpt_hybrid_conversion.py	486	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	tools/checkpoint/gpt_hybrid_conversion.py	488	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	tools/checkpoint/gpt_hybrid_conversion.py	530	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	tools/checkpoint/gpt_hybrid_conversion.py	532	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	tools/checkpoint/gpt_hybrid_conversion.py	627	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	tools/checkpoint/gpt_hybrid_conversion.py	629	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	tools/checkpoint/gpt_hybrid_conversion.py	698	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	tools/checkpoint/gpt_hybrid_conversion.py	700	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	tools/checkpoint/gpt_hybrid_conversion.py	722	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	tools/checkpoint/gpt_hybrid_conversion.py	724	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	tools/checkpoint/saver_llava.py	225	#-----------	COMMENT
MEDIUM	tools/checkpoint/saver_llava.py	186	#-----------	COMMENT
MEDIUM	tools/checkpoint/saver_base.py	377	#-----------	COMMENT
MEDIUM	tools/checkpoint/saver_base.py	420	# --------------	COMMENT
MEDIUM	tools/checkpoint/saver_base.py	432	# ------------------	COMMENT
MEDIUM⚡	…s/common_pile_dataset/create_common_pile_ci_dataset.py	360	# ================================================================	COMMENT
MEDIUM⚡	…s/common_pile_dataset/create_common_pile_ci_dataset.py	362	# ================================================================	COMMENT
MEDIUM⚡	…s/common_pile_dataset/create_common_pile_ci_dataset.py	386	# ================================================================	COMMENT
MEDIUM⚡	…s/common_pile_dataset/create_common_pile_ci_dataset.py	388	# ================================================================	COMMENT
MEDIUM⚡	…s/common_pile_dataset/create_common_pile_ci_dataset.py	410	# ================================================================	COMMENT
MEDIUM⚡	…s/common_pile_dataset/create_common_pile_ci_dataset.py	412	# ================================================================	COMMENT
MEDIUM⚡	…s/common_pile_dataset/create_common_pile_ci_dataset.py	433	# ================================================================	COMMENT
MEDIUM⚡	…s/common_pile_dataset/create_common_pile_ci_dataset.py	435	# ================================================================	COMMENT
MEDIUM⚡	…s/common_pile_dataset/create_common_pile_ci_dataset.py	474	# ================================================================	COMMENT
MEDIUM⚡	…s/common_pile_dataset/create_common_pile_ci_dataset.py	476	# ================================================================	COMMENT
MEDIUM⚡	…s/common_pile_dataset/create_common_pile_ci_dataset.py	496	# ================================================================	COMMENT
MEDIUM⚡	…s/common_pile_dataset/create_common_pile_ci_dataset.py	498	# ================================================================	COMMENT
MEDIUM	…ts/performance_tests/shell_test_utils/run_perf_test.sh	24	# ── Parse KEY=VALUE positional args ───────────────────────────────────────────	COMMENT
MEDIUM	…ts/performance_tests/shell_test_utils/run_perf_test.sh	75	# ── Read model_config.yaml ────────────────────────────────────────────────────	COMMENT
MEDIUM	…ts/performance_tests/shell_test_utils/run_perf_test.sh	108	# ── Build model args (substituting ${CHECKPOINT_LOAD_PATH}) ───────────────────	COMMENT
MEDIUM	…ts/performance_tests/shell_test_utils/run_perf_test.sh	131	# ── Make image-bundled extras (mamba-ssm) visible to the cog venv ─────────────	COMMENT
MEDIUM	…ts/performance_tests/shell_test_utils/run_perf_test.sh	171	# ── Launch the inference server in the background ─────────────────────────────	COMMENT
MEDIUM	…ts/performance_tests/shell_test_utils/run_perf_test.sh	228	# ── Wait for server readiness ─────────────────────────────────────────────────	COMMENT
MEDIUM	…ts/performance_tests/shell_test_utils/run_perf_test.sh	248	# ── Benchmark sweep ───────────────────────────────────────────────────────────	COMMENT
MEDIUM	…ts/performance_tests/shell_test_utils/run_perf_test.sh	267	# ── Baseline comparison or recording ──────────────────────────────────────────	COMMENT
MEDIUM⚡	tests/unit_tests/test_hyper_comm_grid.py	313	# ------------------------------------------------------------------	COMMENT
MEDIUM⚡	tests/unit_tests/test_hyper_comm_grid.py	315	# ------------------------------------------------------------------	COMMENT
MEDIUM⚡	tests/unit_tests/test_hyper_comm_grid.py	439	# ------------------------------------------------------------------	COMMENT
MEDIUM⚡	tests/unit_tests/test_hyper_comm_grid.py	441	# ------------------------------------------------------------------	COMMENT
MEDIUM	tests/unit_tests/test_optimizer_param_scheduler.py	302	# ── get_canonical_lr_for_logging tests ──────────────────────────────────────	COMMENT
MEDIUM⚡	tests/unit_tests/test_emerging_optimizers.py	64	# ===========================================================================	COMMENT
MEDIUM⚡	tests/unit_tests/test_emerging_optimizers.py	66	# ===========================================================================	COMMENT
MEDIUM⚡	tests/unit_tests/test_emerging_optimizers.py	858	# ===========================================================================	COMMENT
MEDIUM⚡	tests/unit_tests/test_emerging_optimizers.py	860	# ===========================================================================	COMMENT
MEDIUM⚡	tests/unit_tests/test_emerging_optimizers.py	1305	# ===========================================================================	COMMENT
MEDIUM⚡	tests/unit_tests/test_emerging_optimizers.py	1307	# ===========================================================================	COMMENT
MEDIUM⚡	tests/unit_tests/test_emerging_optimizers.py	1592	# ===========================================================================	COMMENT
MEDIUM⚡	tests/unit_tests/test_emerging_optimizers.py	1594	# ===========================================================================	COMMENT
MEDIUM	tests/unit_tests/test_argument_utils.py	681	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	tests/unit_tests/test_argument_utils.py	683	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	tests/unit_tests/tokenizers/test_tokenizer.py	641	# ---------------------------------------------------------------------------	COMMENT
MEDIUM⚡	tests/unit_tests/tokenizers/test_tokenizer.py	643	# ---------------------------------------------------------------------------	COMMENT
MEDIUM	tests/unit_tests/tokenizers/test_tokenizer.py	473	# ------------------------------------------------------------------------	COMMENT
MEDIUM	tests/unit_tests/tokenizers/test_tokenizer.py	475	# ------------------------------------------------------------------------	COMMENT
695 more matches not shown…

Unused Imports860 hits · 810 pts

Severity	File	Line	Context
LOW	train_rl.py	4	CODE
LOW	mamba_builders.py	14	CODE
LOW	mamba_builders.py	15	CODE
LOW	pretrain_gpt.py	26	CODE
LOW	pretrain_gpt.py	64	CODE
LOW	pretrain_hybrid.py	25	CODE
LOW	pretrain_hybrid.py	64	CODE
LOW	gpt_builders.py	20	CODE
LOW	tools/merge_datasets.py	3	CODE
LOW	tools/preprocess_data_nmt.py	6	CODE
LOW	tools/run_inference_performance_test.py	37	CODE
LOW	tools/run_inference_performance_test.py	41	CODE
LOW	tools/run_inference_performance_test.py	43	CODE
LOW	tools/run_text_generation_server.py	6	CODE
LOW	tools/linter.py	11	CODE
LOW	tools/preprocess_mmdata.py	12	CODE
LOW	tools/checkpoint/saver_core.py	3	CODE
LOW	tools/checkpoint/saver_core.py	5	CODE
LOW	tools/checkpoint/saver_core.py	7	CODE
LOW	tools/checkpoint/saver_core.py	8	CODE
LOW	tools/checkpoint/remap_gpt_dsa_to_mamba.py	38	CODE
LOW	tools/checkpoint/remap_gpt_dsa_to_mamba.py	39	CODE
LOW	tools/checkpoint/remap_gpt_dsa_to_mamba.py	108	CODE
LOW	tools/checkpoint/gpt_hybrid_conversion.py	122	CODE
LOW	tools/checkpoint/dist_checkpoint_io.py	20	CODE
LOW	tools/checkpoint/dist_checkpoint_io.py	31	CODE
LOW	tools/checkpoint/utils.py	3	CODE
LOW	tools/checkpoint/hybrid_conversion.py	10	CODE
LOW	tools/checkpoint/loader_mixtral_hf.py	3	CODE
LOW	tools/checkpoint/loader_mixtral_hf.py	167	CODE
LOW	tools/checkpoint/saver_llava.py	2	CODE
LOW	tools/checkpoint/saver_llava.py	4	CODE
LOW	tools/checkpoint/saver_llava.py	173	CODE
LOW	tools/checkpoint/saver_llava.py	173	CODE
LOW	tools/checkpoint/checkpoint_inspector.py	37	CODE
LOW	tools/checkpoint/loader_core.py	3	CODE
LOW	tools/checkpoint/loader_core.py	4	CODE
LOW	tools/checkpoint/loader_core.py	5	CODE
LOW	tools/checkpoint/loader_core.py	6	CODE
LOW	tools/checkpoint/loader_core.py	7	CODE
LOW	tools/checkpoint/saver_base.py	2	CODE
LOW	tools/checkpoint/loader_llava.py	3	CODE
LOW	tools/checkpoint/loader_llava.py	7	CODE
LOW	tools/bert_embedding/embed.py	3	CODE
LOW	tools/bert_embedding/embed.py	8	CODE
LOW	tools/bert_embedding/embed.py	15	CODE
LOW	tools/bert_embedding/embed.py	16	CODE
LOW	tools/bert_embedding/embed.py	19	CODE
LOW	tools/bert_embedding/embed.py	21	CODE
LOW	tools/bert_embedding/embed.py	21	CODE
LOW	tools/bert_embedding/__init__.py	3	CODE
LOW	tools/bert_embedding/__init__.py	3	CODE
LOW	…formance_tests/shell_test_utils/compare_to_baseline.py	23	CODE
LOW	tests/unit_tests/test_fp8_param.py	3	CODE
LOW	tests/unit_tests/test_fp8_param.py	29	CODE
LOW	tests/unit_tests/test_fp8_param.py	39	CODE
LOW	tests/unit_tests/test_inference.py	3	CODE
LOW	tests/unit_tests/test_basic.py	2	CODE
LOW	tests/unit_tests/conftest.py	4	CODE
LOW	tests/unit_tests/test_emerging_optimizers.py	11	CODE
800 more matches not shown…

Cross-File Repetition111 hits · 555 pts

Severity	File	Line	Snippet	Context
HIGH	train_rl.py	0	forward training step. args: data_iterator : input data iterator model (hybridmodel): the gpt model	STRING
HIGH	pretrain_hybrid.py	0	forward training step. args: data_iterator : input data iterator model (hybridmodel): the gpt model	STRING
HIGH	megatron/elastification/pretrain_hybrid_flex.py	0	forward training step. args: data_iterator : input data iterator model (hybridmodel): the gpt model	STRING
HIGH	pretrain_gpt.py	0	loss function. args: loss_mask (torch.tensor): used to mask out some portions of the loss output_tensor (torch.tensor):	STRING
HIGH	pretrain_hybrid.py	0	loss function. args: loss_mask (torch.tensor): used to mask out some portions of the loss output_tensor (torch.tensor):	STRING
HIGH	megatron/elastification/pretrain_hybrid_flex.py	0	loss function. args: loss_mask (torch.tensor): used to mask out some portions of the loss output_tensor (torch.tensor):	STRING
HIGH	pretrain_gpt.py	0	build the train test and validation datasets. args: train_val_test_num_samples : a list containing the number of samples	STRING
HIGH	pretrain_hybrid.py	0	build the train test and validation datasets. args: train_val_test_num_samples : a list containing the number of samples	STRING
HIGH	examples/post_training/modelopt/finetune.py	0	build the train test and validation datasets. args: train_val_test_num_samples : a list containing the number of samples	STRING
HIGH	examples/t5/pretrain_t5.py	0	build the train test and validation datasets. args: train_val_test_num_samples : a list containing the number of samples	STRING
HIGH	megatron/elastification/pretrain_hybrid_flex.py	0	build the train test and validation datasets. args: train_val_test_num_samples : a list containing the number of samples	STRING
HIGH	tools/run_text_generation_server.py	0	utility to get the relevant backend for running inference this function will automatically chose the trtllmbackend when	STRING
HIGH	examples/inference/advanced/gpt_static_inference.py	0	utility to get the relevant backend for running inference this function will automatically chose the trtllmbackend when	STRING
HIGH	…amples/inference/advanced/simple_t5_batch_inference.py	0	utility to get the relevant backend for running inference this function will automatically chose the trtllmbackend when	STRING
HIGH	tools/checkpoint/saver_hf_llava.py	0	required top-level function that creates the saver and calls its .save().	STRING
HIGH	tools/checkpoint/saver_core.py	0	required top-level function that creates the saver and calls its .save().	STRING
HIGH	tools/checkpoint/saver_llava.py	0	required top-level function that creates the saver and calls its .save().	STRING
HIGH	tools/checkpoint/loader_base.py	0	orchestrates loading a megatron checkpoint and sending model parameters over a given multiprocessing queue. args: args:	STRING
HIGH	tools/checkpoint/loader_core.py	0	orchestrates loading a megatron checkpoint and sending model parameters over a given multiprocessing queue. args: args:	STRING
HIGH	tools/checkpoint/loader_llava.py	0	orchestrates loading a megatron checkpoint and sending model parameters over a given multiprocessing queue. args: args:	STRING
HIGH	tools/checkpoint/loader_base.py	0	parse megatron arguments by forcibly overwriting sys.argv. populates self.margs and self.checkpoint_args.	STRING
HIGH	tools/checkpoint/saver_base.py	0	parse megatron arguments by forcibly overwriting sys.argv. populates self.margs and self.checkpoint_args.	STRING
HIGH	tools/checkpoint/loader_llava.py	0	parse megatron arguments by forcibly overwriting sys.argv. populates self.margs and self.checkpoint_args.	STRING
HIGH	tools/checkpoint/loader_base.py	0	construct a sys.argv list for megatron's argument parser. this centralizes the hack of overwriting sys.argv.	STRING
HIGH	tools/checkpoint/loader_core.py	0	construct a sys.argv list for megatron's argument parser. this centralizes the hack of overwriting sys.argv.	STRING
HIGH	tools/checkpoint/saver_base.py	0	construct a sys.argv list for megatron's argument parser. this centralizes the hack of overwriting sys.argv.	STRING
HIGH	tools/checkpoint/loader_llava.py	0	construct a sys.argv list for megatron's argument parser. this centralizes the hack of overwriting sys.argv.	STRING
HIGH	tests/unit_tests/test_hyper_comm_grid.py	0	set up distributed environment for the entire test class.	STRING
HIGH	…it_tests/pipeline_parallel/test_bridge_communicator.py	0	set up distributed environment for the entire test class.	STRING
HIGH	…sts/pipeline_parallel/test_multimodule_communicator.py	0	set up distributed environment for the entire test class.	STRING
HIGH	…it_tests/pipeline_parallel/test_bridge_communicator.py	0	destroy all tracked grids and bridge communicator pgs.	STRING
HIGH	…_tests/pipeline_parallel/test_multimodule_schedules.py	0	destroy all tracked grids and bridge communicator pgs.	STRING
HIGH	tests/unit_tests/models/mimo/test_mimo_1f1b_schedule.py	0	destroy all tracked grids and bridge communicator pgs.	STRING
HIGH	tests/unit_tests/models/mimo/test_mimo_1f1b_schedule.py	0	return a transformerconfig for the vision projection mlp.	STRING
HIGH	examples/mimo/configs/llava_vlm.py	0	return a transformerconfig for the vision projection mlp.	STRING
HIGH	examples/mimo/configs/llava_avlm.py	0	return a transformerconfig for the vision projection mlp.	STRING
HIGH	…unit_tests/dist_checkpointing/models/test_gpt_model.py	0	test model loading with different vocab size (caused by tp padding).	STRING
HIGH	…nit_tests/dist_checkpointing/models/test_bert_model.py	0	test model loading with different vocab size (caused by tp padding).	STRING
HIGH	tests/unit_tests/dist_checkpointing/models/common.py	0	test model loading with different vocab size (caused by tp padding).	STRING
HIGH	…ultimodal/evaluation/evaluate_video_phys_game_bench.py	0	merge input files to a format compatible with the evaluator.	STRING
HIGH	examples/multimodal/evaluation/evaluate_realworldqa.py	0	merge input files to a format compatible with the evaluator.	STRING
HIGH	examples/multimodal/evaluation/evaluate_ai2d.py	0	merge input files to a format compatible with the evaluator.	STRING
HIGH	…es/multimodal/evaluation/evaluate_video_motionbench.py	0	merge input files to a format compatible with the evaluator.	STRING
HIGH	examples/multimodal/evaluation/evaluate_spdocvqa.py	0	merge input files to a format compatible with the evaluator.	STRING
HIGH	examples/multimodal/evaluation/evaluate_textvqa.py	0	merge input files to a format compatible with the evaluator.	STRING
HIGH	examples/multimodal/evaluation/evaluate_chartqa.py	0	merge input files to a format compatible with the evaluator.	STRING
HIGH	…amples/multimodal/evaluation/evaluate_video_mvbench.py	0	merge input files to a format compatible with the evaluator.	STRING
HIGH	examples/multimodal/evaluation/evaluate_mathvista.py	0	merge input files to a format compatible with the evaluator.	STRING
HIGH	examples/multimodal/evaluation/evaluate_vqav2.py	0	merge input files to a format compatible with the evaluator.	STRING
HIGH	examples/multimodal/evaluation/evaluate_ocrbench.py	0	merge input files to a format compatible with the evaluator.	STRING
HIGH	examples/multimodal/evaluation/evaluate_infovqa.py	0	merge input files to a format compatible with the evaluator.	STRING
HIGH	megatron/core/utils.py	0	check if minimum version of `transformer-engine` is installed.	STRING
HIGH	…/distributed/fsdp/src/megatron_fsdp/mixed_precision.py	0	check if minimum version of `transformer-engine` is installed.	STRING
HIGH	…atron/core/distributed/fsdp/src/megatron_fsdp/utils.py	0	check if minimum version of `transformer-engine` is installed.	STRING
HIGH	megatron/core/ssm/mamba_mixer.py	0	args: name (str \| none): module instance name passed top-down from its paranet module	STRING
HIGH	megatron/core/transformer/attention.py	0	args: name (str \| none): module instance name passed top-down from its paranet module	STRING
HIGH	megatron/core/transformer/transformer_layer.py	0	args: name (str \| none): module instance name passed top-down from its paranet module	STRING
HIGH	megatron/core/transformer/mlp.py	0	args: name (str \| none): module instance name passed top-down from its paranet module	STRING
HIGH	megatron/core/transformer/multi_token_prediction.py	0	args: name (str \| none): module instance name passed top-down from its paranet module	STRING
HIGH	megatron/core/transformer/moe/moe_layer.py	0	args: name (str \| none): module instance name passed top-down from its paranet module	STRING
51 more matches not shown…

Deep Nesting571 hits · 482 pts

Severity	File	Line	Context
LOW	gpt_builders.py	25	CODE
LOW	tasks/finetune_utils.py	147	CODE
LOW	tasks/eval_utils.py	65	CODE
LOW	.gitlab/scripts/check_imports.py	66	CODE
LOW	.gitlab/scripts/check_imports.py	123	CODE
LOW	tools/preprocess_data.py	310	CODE
LOW	tools/run_vlm_text_generation.py	88	CODE
LOW	tools/checkpoint/saver_hf_llava.py	69	CODE
LOW	tools/checkpoint/loader_base.py	254	CODE
LOW	tools/checkpoint/gpt_hybrid_conversion.py	631	CODE
LOW	tools/checkpoint/gpt_hybrid_conversion.py	726	CODE
LOW	tools/checkpoint/dist_checkpoint_io.py	131	CODE
LOW	tools/checkpoint/hybrid_conversion.py	59	CODE
LOW	tools/checkpoint/hybrid_conversion.py	114	CODE
LOW	tools/checkpoint/hybrid_conversion.py	170	CODE
LOW	tools/checkpoint/hybrid_conversion.py	222	CODE
LOW	tools/checkpoint/saver_llava.py	183	CODE
LOW	tools/checkpoint/checkpoint_inspector.py	74	CODE
LOW	tools/checkpoint/checkpoint_inspector.py	335	CODE
LOW	tools/checkpoint/checkpoint_inspector.py	951	CODE
LOW	tools/checkpoint/saver_base.py	365	CODE
LOW	tools/checkpoint/loader_llava.py	161	CODE
LOW	…s/common_pile_dataset/create_common_pile_ci_dataset.py	63	CODE
LOW	…s/common_pile_dataset/create_common_pile_ci_dataset.py	109	CODE
LOW	tools/moe_routing/analyze_routing_predictability.py	54	CODE
LOW	tools/moe_routing/analyze_routing_concentration.py	70	CODE
LOW	…formance_tests/shell_test_utils/compare_to_baseline.py	36	CODE
LOW	…shell_test_utils/determinism/print_nsys_leaderboard.py	85	CODE
LOW	tests/unit_tests/test_fp8_param.py	215	CODE
LOW	tests/unit_tests/test_optimizer.py	844	CODE
LOW	tests/unit_tests/test_utils.py	366	CODE
LOW	tests/unit_tests/test_imports.py	63	CODE
LOW	tests/unit_tests/test_layer_wise_optimizer.py	254	CODE
LOW	tests/unit_tests/test_layer_wise_optimizer.py	724	CODE
LOW	tests/unit_tests/ssm/ops/test_ssd_bmm.py	68	CODE
LOW	tests/unit_tests/tokenizers/test_tokenizer.py	291	CODE
LOW	…t_tests/tools/checkpoint/test_gpt_hybrid_conversion.py	461	CODE
LOW	tests/unit_tests/extension/test_kitchen_sdpa.py	60	CODE
LOW	tests/unit_tests/determinism/utils.py	119	CODE
LOW	…ted/mfsdp_v1/test_mcore_fully_sharded_data_parallel.py	1050	CODE
LOW	…ted/mfsdp_v1/test_mcore_fully_sharded_data_parallel.py	1125	CODE
LOW	…ted/mfsdp_v1/test_mcore_fully_sharded_data_parallel.py	1128	CODE
LOW	…t_tests/distributed/mfsdp_v1/test_mfsdp_fully_shard.py	177	CODE
LOW	…t_tests/distributed/mfsdp_v1/test_mfsdp_fully_shard.py	914	CODE
LOW	…t_tests/distributed/mfsdp_v1/test_mfsdp_fully_shard.py	1084	CODE
LOW	tests/unit_tests/transformer/test_cuda_graphs.py	506	CODE
LOW	tests/unit_tests/transformer/test_cuda_graphs.py	698	CODE
LOW	tests/unit_tests/transformer/test_cuda_graphs.py	1368	CODE
LOW	tests/unit_tests/transformer/test_transformer_block.py	405	CODE
LOW	…mental_attention_variant/test_attention_variant_dsa.py	2057	CODE
LOW	…mental_attention_variant/test_attention_variant_dsa.py	3185	CODE
LOW	…mental_attention_variant/test_attention_variant_dsa.py	3219	CODE
LOW	…mental_attention_variant/test_attention_variant_dsa.py	3305	CODE
LOW	…mental_attention_variant/test_attention_variant_dsa.py	3433	CODE
LOW	…mental_attention_variant/test_attention_variant_dsa.py	3470	CODE
LOW	…mental_attention_variant/test_attention_variant_dsa.py	3708	CODE
LOW	…erimental_attention_variant/dsa_native_parity_utils.py	416	CODE
LOW	tests/unit_tests/training/config/test_container_base.py	248	CODE
LOW	…it_tests/pipeline_parallel/test_bridge_communicator.py	411	CODE
LOW	…sts/pipeline_parallel/test_multimodule_communicator.py	170	CODE
511 more matches not shown…

Hallucination Indicators30 hits · 415 pts

Severity	File	Line	Snippet	Context
CRITICAL⚡	tools/checkpoint/loader_mixtral_hf.py	69	model.embedding.word_embeddings.weight.data.copy_(	CODE
CRITICAL⚡	tools/checkpoint/loader_mixtral_hf.py	74	model.decoder.final_layernorm.weight.data.copy_(hf_model.model.norm.weight)	CODE
CRITICAL	tools/checkpoint/loader_mixtral_hf.py	103	layer.mlp.router.weight.data.copy_(hf_layer.block_sparse_moe.gate.weight)	CODE
CRITICAL	tools/checkpoint/loader_mixtral_hf.py	127	layer.self_attention.linear_qkv.layer_norm_weight.data.copy_(hf_layer.input_layernorm.weight)	CODE
CRITICAL⚡	tools/checkpoint/saver_llava.py	199	model.vision_model.conv1.weight.data.copy_(vit_embeddings_msg["conv1 weight"])	CODE
CRITICAL⚡	tools/checkpoint/saver_llava.py	201	model.vision_model.conv1.bias.data.copy_(vit_embeddings_msg["conv1 bias"])	CODE
CRITICAL⚡	tools/checkpoint/saver_llava.py	202	model.vision_model.position_embeddings.weight.data.copy_(vit_embeddings_msg["position embeddings"])	CODE
CRITICAL⚡	tools/checkpoint/saver_llava.py	208	model.vision_model.embedder.weight.data.copy_(embedder_weight[tp_rank])	CODE
CRITICAL⚡	tools/checkpoint/saver_llava.py	210	model.vision_model.embedder.bias.data.copy_(embedder_bias[tp_rank])	CODE
CRITICAL⚡	tools/checkpoint/saver_llava.py	214	model.vision_model.ln_pre.weight.data.copy_(vit_embeddings_msg["ln pre weight"])	CODE
CRITICAL⚡	tools/checkpoint/saver_llava.py	215	model.vision_model.ln_pre.bias.data.copy_(vit_embeddings_msg["ln pre bias"])	CODE
CRITICAL⚡	tools/checkpoint/saver_llava.py	218	model.vision_model.ln_post.weight.data.copy_(vit_embeddings_msg["ln post weight"])	CODE
CRITICAL⚡	tools/checkpoint/saver_llava.py	219	model.vision_model.ln_post.bias.data.copy_(vit_embeddings_msg["ln post bias"])	CODE
CRITICAL⚡	tools/checkpoint/saver_llava.py	343	model.vision_projection.encoder.linear_fc1.weight.data.copy_(	CODE
CRITICAL⚡	tools/checkpoint/saver_llava.py	345	model.vision_projection.encoder.linear_fc2.weight.data.copy_(	CODE
CRITICAL⚡	tools/checkpoint/saver_llava.py	348	model.vision_projection.encoder.linear_fc1.layer_norm_weight.data.copy_(	CODE
CRITICAL⚡	tools/checkpoint/saver_llava.py	351	model.vision_projection.encoder.linear_fc1.layer_norm_bias.data.copy_(	CODE
CRITICAL⚡	tools/checkpoint/saver_llava.py	354	model.vision_projection.encoder.linear_fc1.bias.data.copy_(	CODE
CRITICAL⚡	tools/checkpoint/saver_llava.py	356	model.vision_projection.encoder.linear_fc2.bias.data.copy_(vision_projection_l1_bias)	CODE
CRITICAL⚡	tests/unit_tests/transformer/moe/test_routers.py	115	assert self.sequential_mlp.router.weight.grad.abs().sum() == 0	CODE
CRITICAL⚡	tests/unit_tests/transformer/moe/test_routers.py	121	assert self.sequential_mlp.router.weight.grad.abs().sum() > 0	CODE
CRITICAL⚡	tests/unit_tests/transformer/moe/test_routers.py	126	self.sequential_mlp.router.weight.grad.fill_(0)	CODE
CRITICAL⚡	tests/unit_tests/transformer/moe/test_routers.py	129	assert self.sequential_mlp.router.weight.grad.abs().sum() > 0	CODE
CRITICAL⚡	tests/unit_tests/models/test_gpt_model.py	87	assert self.gpt_model.embedding.word_embeddings.weight.std().cpu().item() == approx(	CODE
CRITICAL⚡	tests/unit_tests/models/test_gpt_model.py	90	assert self.gpt_model.embedding.word_embeddings.weight.mean().cpu().item() == approx(	CODE
CRITICAL	megatron/core/parallel_state.py	2169	and torch.distributed.distributed_c10d._world.pg_map.get(_DATA_PARALLEL_GROUP_GLOO, None)	CODE
CRITICAL	megatron/core/parallel_state.py	2178	and torch.distributed.distributed_c10d._world.pg_map.get(	CODE
CRITICAL	megatron/core/parallel_state.py	2217	and torch.distributed.distributed_c10d._world.pg_map.get(	CODE
CRITICAL	megatron/core/parallel_state.py	2231	and torch.distributed.distributed_c10d._world.pg_map.get(	CODE
CRITICAL	megatron/core/transformer/attention.py	1965	self.linear_qkv.weight.main_param.data.copy_(	CODE

Self-Referential Comments130 hits · 380 pts

Severity	File	Line	Snippet	Context
MEDIUM	gpt_builders.py	42	# Define the decoder block spec	COMMENT
MEDIUM	gpt_builders.py	54	# Define the decoder layer spec	COMMENT
MEDIUM	gpt_builders.py	67	# Define the decoder block spec	COMMENT
MEDIUM	.gitlab/stages/05.publish.yml	149	# Define the full refspec for the branch	COMMENT
MEDIUM	tools/run_inference_performance_test.py	113	# Create a list of valid token IDs	COMMENT
MEDIUM	tools/common_pile_dataset/setup_common_pile_dataset.sh	87	# Create a virtual environment to avoid system package conflicts	COMMENT
MEDIUM	tools/common_pile_dataset/setup_common_pile_dataset.sh	122	# Create the output directory	COMMENT
MEDIUM	tests/unit_tests/test_optimizer.py	1060	# Create a new state_dict with all params set to 3.	COMMENT
MEDIUM	tests/unit_tests/test_optimizer.py	1202	# Create a simple model for testing	COMMENT
MEDIUM	tests/unit_tests/test_optimizer.py	1284	# Create a simple model	COMMENT
MEDIUM⚡	tests/unit_tests/test_utils.py	509	# Create a straggler_detector with enabled set to false.	COMMENT
MEDIUM	tests/unit_tests/test_hyper_comm_grid.py	637	# Create a process group	COMMENT
MEDIUM	tests/unit_tests/test_hyper_comm_grid.py	640	# Create a tensor for communication test	COMMENT
MEDIUM	tests/unit_tests/test_hyper_comm_grid.py	741	# Create a unique tensor based on rank	COMMENT
MEDIUM	tests/unit_tests/test_emerging_optimizers.py	87	# Create a simple linear model for testing	COMMENT
MEDIUM	tests/unit_tests/test_emerging_optimizers.py	657	# Create a model with QKV-like parameter	COMMENT
MEDIUM	tests/unit_tests/test_training.py	189	# Create a mock state_dict with gradients (use deterministic values for reproducibility).	COMMENT
MEDIUM	tests/unit_tests/ssm/ops/test_ssm_kernel.py	83	# Create the Mixer instance directly	COMMENT
MEDIUM	tests/unit_tests/fusions/test_torch_softmax.py	197	# Create a padding mask	COMMENT
MEDIUM	…/unit_tests/post_training/test_modelopt_module_spec.py	249	# Define the expected signature	COMMENT
MEDIUM	…/unit_tests/post_training/test_modelopt_module_spec.py	288	# Define the expected signature	COMMENT
MEDIUM	…tests/distributed/test_torch_fully_sharded_parallel.py	56	# Create a dummy model and configs.	COMMENT
MEDIUM	…tests/distributed/test_torch_fully_sharded_parallel.py	63	# Create the sharded model.	COMMENT
MEDIUM⚡	…tests/distributed/test_torch_fully_sharded_parallel.py	84	# Create a dummy model and configs.	COMMENT
MEDIUM⚡	…tests/distributed/test_torch_fully_sharded_parallel.py	90	# Create a custom process group (using the default world for testing)	COMMENT
MEDIUM⚡	…tests/distributed/test_torch_fully_sharded_parallel.py	93	# Create the sharded model with explicit process group	COMMENT
MEDIUM	…ests/distributed/mfsdp_v1/test_mfsdp_uneven_dtensor.py	600	# Create a manual uneven sharding along dim 1 with possible zero-length local on some ranks	COMMENT
MEDIUM	…tests/transformer/test_transformer_block_custom_pgs.py	259	# Create a transformer block with default process groups	COMMENT
MEDIUM	…tests/transformer/test_transformer_block_custom_pgs.py	285	# Create a transformer block with custom process groups	COMMENT
MEDIUM	…tests/transformer/test_transformer_block_custom_pgs.py	737	# Create a single transformer block	COMMENT
MEDIUM	tests/unit_tests/transformer/test_cuda_graphs.py	777	# Create the CUDA graphs - this is where the is_last_layer logic is tested	COMMENT
MEDIUM	tests/unit_tests/transformer/test_cuda_graphs.py	1024	# Create a mapping of sample_keys to indices	COMMENT
MEDIUM⚡	tests/unit_tests/transformer/test_transformer_block.py	545	# Create a new build_layers method that uses interleaved attention	COMMENT
MEDIUM	…ts/unit_tests/transformer/moe/test_token_dispatcher.py	174	# Create the answer.	COMMENT
MEDIUM	tests/unit_tests/transformer/moe/test_shared_experts.py	386	# Create a dummy input tensor.	COMMENT
MEDIUM	tests/unit_tests/transformer/moe/test_aux_loss.py	267	# Create a new config with updated parameters	COMMENT
MEDIUM	tests/unit_tests/transformer/moe/test_aux_loss.py	270	# Create the router with the updated config	COMMENT
MEDIUM⚡	…s/unit_tests/training/config/test_instantiate_utils.py	334	# Create a mock that raises an error when used with functools.partial	COMMENT
MEDIUM	tests/unit_tests/training/config/test_yaml_utils.py	215	# Create a mock torch dtype	COMMENT
MEDIUM	tests/unit_tests/training/config/test_yaml_utils.py	262	# Create a mock GenerationConfig	COMMENT
MEDIUM	tests/unit_tests/utils/test_experimental_log_once.py	23	# Define a fresh function with the decorator so it has its own closure state.	COMMENT
MEDIUM	tests/unit_tests/models/test_gpt_model.py	237	# Define the expected signature	COMMENT
MEDIUM	tests/unit_tests/models/test_fastconformer_model.py	45	# Create a parameter with the target dtype so ``next(self.parameters()).dtype``	COMMENT
MEDIUM	…s/unit_tests/models/mimo/test_mimo_audio_submodules.py	212	# Create a time array	COMMENT
MEDIUM	…s/unit_tests/models/mimo/test_mimo_audio_submodules.py	215	# Create a simple sine wave at 440 Hz (A4)	COMMENT
MEDIUM	…nit_tests/models/mimo/test_mimo_embedding_alignment.py	22	# Create a minimal MimoModelConfig	COMMENT
MEDIUM	…nit_tests/models/mimo/test_mimo_embedding_alignment.py	73	# Create a simple batch	COMMENT
MEDIUM	…nit_tests/models/mimo/test_mimo_embedding_alignment.py	249	# Create a test case with 2 batches:	COMMENT
MEDIUM	…nit_tests/models/mimo/test_mimo_embedding_alignment.py	285	# Create the unflattened embeddings that would come from a vision encoder	COMMENT
MEDIUM	tests/unit_tests/models/mimo/test_mimo_submodules.py	92	# Create the main module spec	COMMENT
MEDIUM	tests/unit_tests/models/mimo/test_mimo_submodules.py	351	# Create a data batch without images	COMMENT
MEDIUM	tests/unit_tests/dist_checkpointing/test_fp8.py	33	# Create a quantizer for FP8 conversion	COMMENT
MEDIUM	…s/unit_tests/dist_checkpointing/test_fully_parallel.py	596	# Create a mock that will do what it's supposed to do,	COMMENT
MEDIUM	…/unit_tests/inference/contexts/test_dynamic_context.py	318	# Initialize all variables	COMMENT
MEDIUM	…/unit_tests/inference/contexts/test_dynamic_context.py	1092	# Create an active_requests_mask where requests 0, 2, and 4 are finished (0),	COMMENT
MEDIUM	…/unit_tests/inference/contexts/test_dynamic_context.py	1182	# Create an active_requests_mask where all requests are finished	COMMENT
MEDIUM⚡	…ts/unit_tests/inference/engines/test_dynamic_engine.py	2157	# Create a deterministic mock forward pass that returns logits	COMMENT
MEDIUM	…ts/unit_tests/inference/engines/test_dynamic_engine.py	1727	# Create a request with length 513	COMMENT
MEDIUM	tests/unit_tests/data/test_builder.py	94	# Define the class here to avoid pytest warnings	COMMENT
MEDIUM	tests/unit_tests/rl/test_rl_utils.py	763	# Create a realistic GPTModel as used in RL training	COMMENT
70 more matches not shown…

Excessive Try-Catch Wrapping212 hits · 264 pts

Severity	File	Line	Snippet	Context
LOW	tasks/finetune_utils.py	61	except Exception:	CODE
LOW	tasks/eval_utils.py	114	except Exception:	CODE
MEDIUM	tasks/eval_utils.py	111	def correct_answers_forward_step(batch, model):	CODE
LOW	.gitlab/scripts/check_imports.py	117	except Exception:	CODE
LOW⚡	tools/check_copyright.py	22	except Exception as e:	CODE
MEDIUM⚡	tools/check_copyright.py	23	print(f"Error reading {file_path}: {e}")	CODE
MEDIUM	tools/text_generation_cli.py	21	print(f"Error {response.status_code}: {response.json()['message']}")	CODE
LOW	tools/checkpoint/saver_hf_llava.py	400	except Exception as e:	CODE
LOW	tools/checkpoint/loader_base.py	78	except Exception as e:	CODE
MEDIUM	tools/checkpoint/loader_base.py	79	print(f"Error validating Megatron arguments: {e}")	CODE
LOW	tools/checkpoint/saver_core.py	70	except Exception as e:	CODE
LOW	tools/checkpoint/gpt_hybrid_conversion.py	384	except Exception:	CODE
LOW	tools/checkpoint/dist_checkpoint_io.py	215	except Exception:	CODE
LOW	tools/checkpoint/hybrid_conversion.py	322	except Exception:	CODE
LOW	tools/checkpoint/loader_mixtral_hf.py	340	except Exception:	CODE
MEDIUM	tools/checkpoint/loader_mixtral_hf.py	337	def load_checkpoint(queue, args):	CODE
LOW	tools/checkpoint/saver_llava.py	404	except Exception as e:	CODE
LOW	tools/checkpoint/loader_core.py	90	except Exception as e:	CODE
LOW	tools/checkpoint/loader_llava.py	360	except Exception as e:	CODE
LOW	tools/bert_embedding/embed.py	217	except Exception:	CODE
LOW	tools/bert_embedding/embed.py	223	except Exception:	CODE
LOW	tests/unit_tests/conftest.py	48	except Exception:	CODE
LOW	tests/unit_tests/conftest.py	96	except Exception as e:	CODE
LOW	tests/unit_tests/test_hyper_comm_grid.py	531	except Exception as e:	CODE
LOW	tests/unit_tests/test_api_backwards_compat_setup.py	106	except Exception as e:	CODE
LOW	tests/unit_tests/test_api_backwards_compat_setup.py	134	except Exception as e:	STRING
LOW	tests/unit_tests/test_imports.py	58	except Exception:	CODE
LOW	tests/unit_tests/test_imports.py	77	except Exception:	CODE
LOW	tests/unit_tests/test_utilities.py	114	except Exception:	CODE
LOW⚡	tests/unit_tests/tokenizers/test_tokenizer.py	650	except Exception:	CODE
LOW	tests/unit_tests/tokenizers/test_tokenizer.py	19	except Exception:	CODE
LOW	tests/unit_tests/tokenizers/test_tokenizer.py	152	except Exception:	CODE
LOW	tests/unit_tests/tokenizers/test_tokenizer.py	178	except Exception:	CODE
LOW	tests/unit_tests/tokenizers/test_tokenizer.py	320	except Exception:	CODE
LOW	…ted/mfsdp_v1/test_mcore_fully_sharded_data_parallel.py	1142	except Exception:	CODE
LOW	…t_tests/distributed/mfsdp_v1/test_mfsdp_fully_shard.py	74	except Exception as e:	CODE
LOW	tests/unit_tests/transformer/moe/test_aux_loss.py	38	except Exception: # pragma: no cover - defensive	CODE
LOW	tests/unit_tests/transformer/moe/test_routers.py	25	except Exception: # pragma: no cover - defensive	CODE
LOW	tests/unit_tests/models/test_hybrid_moe_model.py	427	except Exception:	CODE
LOW	…s/unit_tests/models/mimo/test_mimo_audio_submodules.py	123	except Exception as e:	CODE
LOW	…s/unit_tests/models/mimo/test_mimo_audio_submodules.py	197	except Exception as e:	CODE
LOW	tests/unit_tests/models/mimo/test_mimo_submodules.py	55	except Exception as e:	CODE
LOW	tests/unit_tests/models/mimo/test_mimo_submodules.py	105	except Exception as e:	CODE
LOW	tests/unit_tests/models/mimo/test_mimo_submodules.py	229	except Exception as e:	CODE
LOW	tests/unit_tests/models/mimo/test_mimo_submodules.py	284	except Exception as e:	CODE
MEDIUM	tests/unit_tests/models/mimo/test_mimo_model.py	148	def setup_method(self, method):	CODE
MEDIUM	tests/unit_tests/models/mimo/test_mimo_model.py	164	def teardown_method(self, method):	CODE
MEDIUM	tests/unit_tests/models/mimo/test_mimo_model.py	609	def setup_method(self, method):	CODE
MEDIUM	tests/unit_tests/models/mimo/test_mimo_model.py	623	def teardown_method(self, method):	CODE
LOW	tests/unit_tests/models/mimo/test_mimo_model.py	151	except Exception:	CODE
LOW	tests/unit_tests/models/mimo/test_mimo_model.py	167	except Exception:	CODE
LOW	tests/unit_tests/models/mimo/test_mimo_model.py	612	except Exception:	CODE
LOW	tests/unit_tests/models/mimo/test_mimo_model.py	626	except Exception:	CODE
LOW	tests/unit_tests/dist_checkpointing/test_fp8.py	24	except Exception as e:	CODE
LOW	…sts/unit_tests/dist_checkpointing/test_safe_globals.py	166	except Exception as exc:	CODE
MEDIUM	…sts/unit_tests/dist_checkpointing/test_safe_globals.py	163	def load(i):	CODE
LOW	tests/unit_tests/dist_checkpointing/test_async_save.py	30	except Exception as e:	CODE
LOW	tests/unit_tests/resharding/test_model_swap.py	34	except Exception:	CODE
LOW	tests/unit_tests/resharding/test_model_swap.py	44	except Exception:	CODE
LOW	tests/unit_tests/resharding/test_model_swap.py	365	except Exception:	CODE
152 more matches not shown…

Modern Structural Boilerplate195 hits · 196 pts

Severity	File	Line	Snippet	Context
LOW	tools/trigger_internal_ci.py	48	logger = logging.getLogger(__name__)	CODE
LOW	…t_tests/distributed/mfsdp_v1/test_mfsdp_fully_shard.py	33	logger = logging.getLogger(__name__)	CODE
LOW	…sts/unit_tests/distributed/mfsdp_v1/test_annotation.py	45	def _setup_nvtx_recording(monkeypatch: pytest.MonkeyPatch, events: list[NvtxEvent]) -> None:	CODE
LOW	…ts/unit_tests/distributed/mfsdp_v2/test_fully_shard.py	22	logger = logging.getLogger(__name__)	CODE
LOW	…sts/unit_tests/distributed/mfsdp_v2/test_cuda_graph.py	18	logger = logging.getLogger(__name__)	CODE
LOW⚡	tests/unit_tests/models/mimo/test_mimo_1f1b_schedule.py	53	logger = logging.getLogger(__name__)	CODE
LOW	tests/test_utils/python_scripts/recipe_parser.py	13	logger = logging.getLogger(__name__)	CODE
LOW	tests/test_utils/python_scripts/check_status_of_main.py	18	logger = logging.getLogger(__name__)	CODE
LOW	tests/test_utils/python_scripts/launch_jet_workload.py	30	logger = logging.getLogger(__name__)	CODE
LOW	tests/test_utils/python_scripts/launch_jet_workload.py	33	logger = logging.getLogger(__name__)	CODE
LOW	…/test_utils/python_scripts/launch_nemo_run_workload.py	16	logger = logging.getLogger(__name__)	CODE
LOW	tests/test_utils/python_scripts/notify.py	19	logger = logging.getLogger(__name__)	CODE
LOW	…st_utils/python_scripts/download_unit_tests_dataset.py	18	logger = logging.getLogger(__name__)	CODE
LOW	…/test_utils/python_scripts/compare_golden_values_kl.py	63	logger = logging.getLogger(__name__)	CODE
LOW	…test_utils/python_scripts/download_coverage_results.py	15	logger = logging.getLogger(__name__)	CODE
LOW	tests/test_utils/python_scripts/auto_reminder_github.py	23	logger = logging.getLogger(__name__)	CODE
LOW	tests/test_utils/python_scripts/wait_for_resources.py	19	logger = logging.getLogger(__name__)	CODE
LOW	tests/test_utils/python_scripts/dashboard.py	15	logger = logging.getLogger(__name__)	CODE
LOW	tests/test_utils/python_scripts/approve_merge_gate.py	27	logger = logging.getLogger(__name__)	CODE
LOW	tests/test_utils/python_scripts/swap_pr_labels.py	19	logger = logging.getLogger(__name__)	CODE
LOW	…ts/test_utils/python_scripts/download_golden_values.py	19	logger = logging.getLogger(__name__)	CODE
LOW	…l_tests/python_test_utils/compute_golden_statistics.py	43	logger = logging.getLogger(__name__)	CODE
LOW	…t_utils/test_pretraining_resume_checkpoint_pipeline.py	11	logger = logging.getLogger(__name__)	CODE
LOW	…s/python_test_utils/test_inference_regular_pipeline.py	14	logger = logging.getLogger(__name__)	CODE
LOW	…python_test_utils/test_pretraining_regular_pipeline.py	11	logger = logging.getLogger(__name__)	CODE
LOW	tests/functional_tests/python_test_utils/common.py	20	logger = logging.getLogger(__name__)	CODE
LOW	…nal_tests/python_test_utils/test_grpo_training_loop.py	11	logger = logging.getLogger(__name__)	CODE
LOW	…n_test_utils/get_test_results_from_tensorboard_logs.py	13	logger = logging.getLogger(__name__)	CODE
LOW	examples/mimo/training/distributed.py	15	__all__ = ["initialize_distributed", "print_rank_0", "shutdown_distributed"]	CODE
LOW	scripts/check_api_backwards_compatibility.py	26	logger = logging.getLogger(__name__)	CODE
LOW	megatron/post_training/checkpointing.py	22	logger = logging.getLogger(__name__)	CODE
LOW⚡	megatron/core/fault_injector.py	31	__all__ = ["FaultInjectorConfig", "setup_fault_injection", "maybe_raise_workload_exception"]	CODE
LOW⚡	megatron/core/fault_injector.py	42	logger = logging.getLogger(__name__)	CODE
LOW	megatron/core/parallel_state.py	19	logger = logging.getLogger(__name__)	CODE
LOW	megatron/core/full_cuda_graph.py	12	logger = logging.getLogger(__name__)	CODE
LOW	megatron/core/__init__.py	30	__all__ = [	CODE
LOW	megatron/core/msc_utils.py	6	logger = logging.getLogger(__name__)	CODE
LOW	megatron/core/msc_utils.py	67	__all__ = ['MultiStorageClientFeature', 'open_file']	CODE
LOW	megatron/core/optimizer_param_scheduler.py	16	logger = logging.getLogger(__name__)	CODE
LOW	megatron/core/nccl_allocator.py	15	logger = logging.getLogger(__name__)	CODE
LOW	megatron/core/utils.py	54	logger = logging.getLogger(__name__)	CODE
LOW	megatron/core/config_logger.py	126	__all__ = ['has_config_logger_enabled', 'log_config_to_disk']	CODE
LOW	megatron/core/num_microbatches_calculator.py	9	logger = logging.getLogger(__name__)	CODE
LOW	megatron/core/timers.py	32	logger = logging.getLogger(__name__)	CODE
LOW	megatron/core/rerun_state_machine.py	32	logger = logging.getLogger(__name__)	CODE
LOW	megatron/core/rerun_state_machine.py	243	def set_mode(self, mode: RerunMode) -> None:	CODE
LOW	megatron/core/rerun_state_machine.py	1393	def _set_rerun_state_machine(rerun_state_machine) -> None:	CODE
LOW	megatron/core/ssm/mamba_mixer.py	97	logger = logging.getLogger(__name__)	CODE
LOW	megatron/core/ssm/gated_delta_net.py	56	logger = logging.getLogger(__name__)	CODE
LOW	megatron/core/ssm/ops/__init__.py	14	__all__ = ["mamba_chunk_scan_combined_varlen", "causal_conv1d_varlen_fn"]	CODE
LOW	megatron/core/tokenizers/megatron_tokenizer.py	34	logger = logging.getLogger(__name__)	CODE
LOW	megatron/core/tokenizers/utils/build_tokenizer.py	23	logger = logging.getLogger(__name__)	CODE
LOW	megatron/core/tokenizers/text/parsers/__init__.py	12	__all__ = ["PARSER_MAPPING"]	CODE
LOW	…ore/tokenizers/text/parsers/qwen3_coder_tool_parser.py	14	logger = logging.getLogger(__name__)	CODE
LOW	…ore/tokenizers/text/libraries/huggingface_tokenizer.py	17	logger = logging.getLogger(__name__)	CODE
LOW	…n/core/tokenizers/text/libraries/tiktoken_tokenizer.py	18	logger = logging.getLogger(__name__)	CODE
LOW	…n/core/tokenizers/text/libraries/tiktoken_tokenizer.py	28	__all__ = ["TikTokenTokenizer", "reload_mergeable_ranks"]	CODE
LOW	…ore/tokenizers/text/libraries/megatron_hf_tokenizer.py	20	logger = logging.getLogger(__name__)	CODE
LOW	megatron/core/tensor_parallel/__init__.py	49	__all__ = [	CODE
LOW	megatron/core/post_training/modelopt/layers.py	17	logger = logging.getLogger(__name__)	CODE
135 more matches not shown…

Redundant / Tautological Comments127 hits · 194 pts

Severity	File	Line	Snippet	Context
LOW	train_rl.py	62	# Check if fp8_model_init supports preserve_high_precision_init_val	COMMENT
LOW⚡	tools/check_copyright.py	20	# Check if the expected header is at the start of the file	COMMENT
LOW	tools/common_pile_dataset/setup_common_pile_dataset.sh	50	# Check if create_common_pile_ci_dataset.py was scp'd alongside this script	COMMENT
LOW	docker/common/install_source_wheels.sh	25	# Check if required arguments are provided	COMMENT
LOW	tests/unit_tests/test_optimizer.py	45	# Check if FP8 block scaling is available.	COMMENT
LOW⚡	tests/unit_tests/test_utils.py	512	# Check if configuration was success.	COMMENT
LOW⚡	tests/unit_tests/test_utils.py	515	# Check if the instance is in disabled state.	COMMENT
LOW⚡	tests/unit_tests/test_utils.py	519	# Check if all ranks have straggler detector enabled.	COMMENT
LOW	tests/unit_tests/conftest.py	83	# Check if data directory exists and has content	COMMENT
LOW	tests/unit_tests/fusions/test_torch_softmax.py	145	# Check if output is a valid probability distribution	COMMENT
LOW	tests/unit_tests/transformer/test_cuda_graphs.py	572	# Check if cuda graph is correctly setting is first/last layer	COMMENT
LOW	tests/unit_tests/transformer/test_attention.py	416	# Check if output and bias have the correct shape	COMMENT
LOW	tests/unit_tests/transformer/test_attention.py	598	# Check if the output is close	COMMENT
LOW	…/unit_tests/transformer/test_multi_latent_attention.py	1552	# Check if the output is the same	COMMENT
LOW	…it_tests/transformer/moe/test_moe_layer_discrepancy.py	69	# Check if parameters are the same	COMMENT
LOW	…it_tests/transformer/moe/test_moe_layer_discrepancy.py	75	# Check if input is the same across all ranks	COMMENT
LOW	…it_tests/transformer/moe/test_moe_layer_discrepancy.py	150	# Check if output is the same across all ranks	COMMENT
LOW	…it_tests/transformer/moe/test_moe_layer_discrepancy.py	216	# Check if output is the same across all ranks	COMMENT
LOW	tests/unit_tests/transformer/moe/test_moe_layer.py	164	# Check if the moe layer is interleaved correctly	COMMENT
LOW	tests/unit_tests/models/test_llava_model.py	753	# Check if output shape is as expected	COMMENT
LOW	…_tests/dist_checkpointing/test_layer_wise_optimizer.py	147	# Check if optimizer is ChainedOptimizer (expected for standard setup)	COMMENT
LOW	…/unit_tests/inference/contexts/test_dynamic_context.py	1075	# Assign blocks to the requests (one block per request)	COMMENT
LOW	…/unit_tests/inference/contexts/test_dynamic_context.py	1151	# Assign blocks to the requests:	COMMENT
LOW	…ts/unit_tests/inference/engines/test_dynamic_engine.py	3446	# Check if any request was evicted during this step	COMMENT
LOW	…ts/test_utils/python_scripts/download_golden_values.py	282	# Check if we should skip based on only_failing flag	COMMENT
LOW	…nal_tests/test_cases/common/ckpt_converter/__main__.py	895	# Print results.	COMMENT
LOW	…unctional_tests/shell_test_utils/run_batch_ci_tests.sh	173	# Check if file is empty (job still running or not started)	COMMENT
LOW	tests/functional_tests/shell_test_utils/run_ci_test.sh	11	# Set umask to 0002 to allow group read/write permissions	COMMENT
LOW	tests/functional_tests/shell_test_utils/run_ci_test.sh	267	## Loop over the list of model configs in the params file and run each one in sequence, collecting	COMMENT
LOW	docs/conf.py	43	# Check if we should skip autodoc generation	COMMENT
LOW	examples/post_training/modelopt/finetune.py	251	# Check if this is OpenAI chat data?	COMMENT
LOW	examples/mimo/utils/logging.py	40	# Print output projections	COMMENT
LOW	examples/gptoss/02_train.sh	55	# Check if checkpoint path exists	COMMENT
LOW	examples/gptoss/02_train.sh	62	# Check if tensorboard logs path exists	COMMENT
LOW	examples/multimodal/dataset_helpers.py	121	# Check if all samples fit in the knapsack capacity.	COMMENT
LOW	examples/inference/utils.py	235	# Check if we have any prompts (from command line or JSONL)	COMMENT
LOW	examples/inference/advanced/gpt_dynamic_inference.py	263	# Check if all requests are finished.	COMMENT
LOW	examples/rl/benchmark_refit.py	216	# Print results	COMMENT
LOW	examples/rl/benchmark_refit.py	312	# Print results	COMMENT
LOW	examples/rl/environments/countdown/countdown.py	34	# Check if all numbers in equation are available	COMMENT
LOW	scripts/check_api_backwards_compatibility.py	240	# Check if this breakage kind should be ignored globally (not a signature change)	COMMENT
LOW	scripts/check_api_backwards_compatibility.py	253	# Check if this is a breakage kind we ignore for __init__ methods	COMMENT
LOW	scripts/check_api_backwards_compatibility.py	263	# Check if it's a child of a filtered object	COMMENT
LOW	scripts/check_api_backwards_compatibility.py	350	# Print results	COMMENT
LOW	.github/actions/check-nvidia-sso-membership/action.yml	94	# Check if SSO file is available	COMMENT
LOW	.github/actions/check-nvidia-sso-membership/action.yml	103	# Check if username exists as a key in the JSON object	COMMENT
LOW	megatron/post_training/model_builder.py	286	# Set num_layers to 0 for base model in offline mode	COMMENT
LOW	megatron/core/parallel_state.py	836	# Set NCCL_COLLNET_ENABLE to 1 to enable SHARP for the dp group.	COMMENT
LOW	megatron/core/parallel_state.py	1275	# Set NCCL_COLLNET_ENABLE to 1 to enable SHARP for the dp_replica group.	COMMENT
LOW	megatron/core/parallel_state.py	1304	# Set NCCL_COLLNET_ENABLE to 0 to restrict SHARP application to the dp_replica group.	COMMENT
LOW	megatron/core/fp4_utils.py	14	# Check if Transformer Engine is installed	COMMENT
LOW	megatron/core/fp4_utils.py	25	# Check if Transformer Engine has class for fp4 tensors.	COMMENT
LOW	megatron/core/fp8_utils.py	23	# Check if Transformer Engine is installed	COMMENT
LOW	megatron/core/fp8_utils.py	40	# Check if Transformer Engine has class for fp8 tensors.	COMMENT
LOW	megatron/core/fp8_utils.py	55	# Check if Transformer Engine has MXFP8Tensor class	COMMENT
LOW	megatron/core/fp8_utils.py	653	# Check if fp8_model_init supports setting recipe	COMMENT
LOW	megatron/core/fp8_utils.py	658	# Check if fp8_model_init supports preserve_high_precision_init_val	COMMENT
LOW	megatron/core/utils.py	3010	# Check if any deprecated key is present in kwargs	COMMENT
LOW	megatron/core/ssm/ops/causal_conv1d_triton.py	243	# Check if input is 2D, temporarily treat as 3D for uniform processing	COMMENT
LOW	megatron/core/tokenizers/megatron_tokenizer.py	67	# Check if metadata file exists	COMMENT
67 more matches not shown…

Docstring Block Structure37 hits · 185 pts

Severity	File	Line	Snippet	Context
HIGH	tools/checkpoint/remap_gpt_dsa_to_mamba.py	46	Return the HybridModel state-dict key corresponding to key from GPTModel. Args: key: A key from the GPTMo	STRING
HIGH	…ted/mfsdp_v1/test_mcore_fully_sharded_data_parallel.py	756	Run a small deterministic training loop using a mocked hybrid Mamba+MoE model and optimizer. This helpe	STRING
HIGH	tests/test_utils/python_scripts/launch_jet_workload.py	105	Block until a downstream JET pipeline reaches a terminal status. jetclient's own ``pipeline.wait`` polls the GitLab	STRING
HIGH	megatron/core/process_groups_config.py	668	Get process group collection for a specific module. Args: module_name: Name of the module.	STRING
HIGH	megatron/core/hyper_comm_grid.py	209	Create a process group based on a list of dimension names Note: The unique key used to store the process group	STRING
HIGH	megatron/core/utils.py	120	Validates the request to the experimental function. Args: func (Callable): Callee max_l	STRING
HIGH	megatron/core/utils.py	186	Validates the request to the experimental function. Args: func (Callable): Callee max_l	STRING
HIGH	megatron/core/utils.py	218	Pass-through to callee attribute if experimental flag is enabled. Args: super (supe	STRING
HIGH	megatron/core/timers.py	385	Returns the output string with logged timer values according to configured options. Args: names (Li	STRING
HIGH	…ore/tokenizers/text/libraries/huggingface_tokenizer.py	189	Adds a dictionary of special tokens (eos, pad, cls...). If special tokens are NOT in the vocabulary, th	STRING
HIGH	megatron/core/fusions/fused_bias_swiglu.py	210	Implementation of biased SwiGLU that handles different input shapes. This function reshapes the input if necessary,	STRING
HIGH	megatron/core/fusions/fused_bias_geglu.py	154	Implementation of biased GEGLU that handles different input shapes. This function reshapes the input if necessary,	STRING
HIGH	megatron/core/optimizer/optimizer.py	538	Filter and reorder state_dict parameter groups to match current optimizer groups. Keys used for matching align w	STRING
HIGH	…ibuted/fsdp/src/megatron_fsdp/param_and_grad_buffer.py	4605	Release the specified parameter bucket, freeing its associated buffer storage. This function marks or	STRING
HIGH	…ibuted/fsdp/src/megatron_fsdp/param_and_grad_buffer.py	5016	Creates a distributed tensor (DTensor) from a local tensor with support for Megatron-FSDP and Tensor Parallel s	STRING
HIGH	…e/distributed/fsdp/src/megatron_fsdp/uneven_dtensor.py	140	Validates the chunk metadata of an uneven DTensor to ensure correctness and boundary coverage. Notes: - `g	STRING
HIGH	…e/distributed/fsdp/src/megatron_fsdp/uneven_dtensor.py	257	Gather a DTensor with potentially uneven sharding across ranks into a full tensor. This function handles DTens	STRING
HIGH	…core/distributed/fsdp/src/megatron_fsdp/fully_shard.py	455	Fully shard the optimizer for Megatron-FSDP. This is an in-place operation on the optimizer instance, which mod	STRING
HIGH	…tron/core/datasets/blended_megatron_dataset_builder.py	497	Build the DistributedDataset Return None if and only if the underlying dataset class is not built on the curren	STRING
HIGH	megatron/core/datasets/object_storage_utils.py	136	Ascertain whether the object at the given S3 path exists in S3 Args: client (S3Client): The S3 client	STRING
HIGH	megatron/core/datasets/indexed_dataset.py	88	Get the size of the dtype/code in bytes Args: key (Union[int, Type[numpy.number]]): The dtype or co	STRING
HIGH	megatron/core/datasets/indexed_dataset.py	798	Return from the dataset Args: idx (Union[int, numpy.integer, slice]): The index or index slice into	STRING
HIGH	…ron/core/pipeline_parallel/multimodule_communicator.py	496	Compute the total number of pipeline stages across a multi-module chain. Interprets ``topology`` as a directed	STRING
HIGH	megatron/core/models/hybrid/hybrid_layer_allocation.py	161	Count layers by type across the full hybrid pattern (main + MTP). Parses the pattern to extract main and MTP compon	STRING
HIGH	megatron/core/models/hybrid/hybrid_layer_allocation.py	200	Parse a unified hybrid pattern string into main and MTP components. The pattern uses "/" as a separator between the	STRING
HIGH	megatron/core/models/hybrid/hybrid_layer_allocation.py	301	Validate and convert a single pipeline segment pattern to a layer type list. This is used after the main pattern ha	STRING
HIGH	megatron/core/models/hybrid/hybrid_layer_allocation.py	339	Select and validate the pipeline segment for the given PP rank and VP stage. When the main pattern contains '\|' pip	STRING
HIGH	megatron/core/dist_checkpointing/validation.py	294	Raises or logs an error in case missing or unexpected keys are non-empty. Args: missing_keys (Set[str]): mi	STRING
HIGH	megatron/core/dist_checkpointing/validation.py	372	Validate if the ShardedTensors and ShardedObjects from multiple processes define correct sharding. Local ShardedTen	STRING
HIGH	…tron/core/dist_checkpointing/strategies/async_utils.py	669	Finalizes all available calls. This method must be called on all ranks. Args: blocking (bo	STRING
HIGH	…ng/nvshmem_copy_service/memory/tensor_pointer_utils.py	19	Extract the data pointer from a tensor. Args: tensor: Can be torch.Tensor, CuPy array, or	STRING
HIGH	megatron/core/inference/shards_spec.py	101	Parse + validate the ``--inference-shards`` string. Args: spec_str: Raw CLI value, e.g. ``"tp=2,dp=1+tp=1,d	STRING
HIGH	megatron/core/export/trtllm/trtllm_layers.py	85	Helper function to rename model layer names to TRTLLM Layer names We go through each layer (keys) in the model	STRING
HIGH	megatron/training/arguments.py	172	Validate model config arguments from heterogeneous config. This function takes model arguments and validates them b	STRING
HIGH	megatron/training/config/instantiate_utils.py	128	Instantiate an object or callable from a config object. This function takes a configuration object (dictionary, lis	STRING
HIGH	megatron/training/config/instantiate_utils.py	232	Recursively instantiates a node within a configuration structure. This function handles the instantiation of indivi	STRING
HIGH	megatron/rl/rl_utils.py	1291	Pad trajectories and extract the generation masks. Args: rollouts: Rollouts to extract trajectories from.	STRING

Structural Annotation Overuse73 hits · 132 pts

Severity	File	Line	Snippet	Context
LOW⚡	tools/checkpoint/saver_hf_llava.py	17	# Step 1: Reshape back to (num_head, 3*head_dim, -1)	COMMENT
LOW⚡	tools/checkpoint/saver_hf_llava.py	20	# Step 2: Slice along the head_dim dimension to get q, k, v	COMMENT
LOW⚡	tools/checkpoint/saver_hf_llava.py	25	# Step 3: Reshape each back to (num_head * head_dim, -1)	COMMENT
LOW	tools/common_pile_dataset/README.md	113	### Step 1: Vocabulary Files	COMMENT
LOW	tools/common_pile_dataset/README.md	119	### Step 2: Download Raw Text	COMMENT
LOW	tools/common_pile_dataset/README.md	132	### Step 3: GPT Preprocessing	COMMENT
LOW	tools/common_pile_dataset/README.md	137	### Step 4: BERT Preprocessing	COMMENT
LOW	tools/common_pile_dataset/README.md	150	### Step 5: T5 Preprocessing	COMMENT
LOW	tools/common_pile_dataset/README.md	155	### Step 6: Verification	COMMENT
LOW⚡	…s/common_pile_dataset/create_common_pile_ci_dataset.py	361	# Step 1: Get vocabulary files	COMMENT
LOW⚡	…s/common_pile_dataset/create_common_pile_ci_dataset.py	387	# Step 2: Get raw text data	COMMENT
LOW⚡	…s/common_pile_dataset/create_common_pile_ci_dataset.py	411	# Step 3: Preprocess for GPT (GPT2BPETokenizer)	COMMENT
LOW⚡	…s/common_pile_dataset/create_common_pile_ci_dataset.py	434	# Step 4: Preprocess for BERT (BertWordPieceLowerCase + split-sentences)	COMMENT
LOW⚡	…s/common_pile_dataset/create_common_pile_ci_dataset.py	475	# Step 5: Preprocess for T5 (BertWordPieceCase)	COMMENT
LOW⚡	…s/common_pile_dataset/create_common_pile_ci_dataset.py	497	# Step 6: Clean up and verify	COMMENT
LOW⚡	…/unit_tests/inference/contexts/test_dynamic_context.py	2038	# Step 1: Forward pass for all 3 requests	COMMENT
LOW⚡	…/unit_tests/inference/contexts/test_dynamic_context.py	2047	# Step 2: Forward pass where req 10 finishes, req 11 continues. Req 999 is NOT scheduled.	COMMENT
LOW⚡	…/unit_tests/inference/contexts/test_dynamic_context.py	2136	# Step 1: All 3 requests are active, process forward pass	COMMENT
LOW⚡	…/unit_tests/inference/contexts/test_dynamic_context.py	2145	# Step 2: Both decode requests finish, chunked prefill NOT scheduled this step.	COMMENT
LOW	…/unit_tests/inference/contexts/test_dynamic_context.py	2069	# Step 3: Add the next chunk. It should sit exactly at the boundary (index 1) and inherit the state.	COMMENT
LOW⚡	…ts/unit_tests/inference/engines/test_dynamic_engine.py	3657	# Step 1: prefill for request 0 — should NOT count as a spec step.	COMMENT
LOW⚡	…ts/unit_tests/inference/engines/test_dynamic_engine.py	3667	# Step 2: decode for request 0 — should count spec tokens.	COMMENT
LOW	…ts/unit_tests/inference/engines/test_dynamic_engine.py	2753	# Step 1: Prefill. Processes the 4 prompt tokens.	COMMENT
LOW⚡	…l_tests/python_test_utils/compute_golden_statistics.py	11	# Step 1: Run batch tests (from megatron-rl directory):	STRING
LOW⚡	…l_tests/python_test_utils/compute_golden_statistics.py	15	# Step 2: Wait for jobs to complete, then compute statistics:	STRING
LOW⚡	docs/developer/submit.md	15	## Step 1: Mark PR as "Ready for Review"	COMMENT
LOW⚡	docs/developer/submit.md	24	## Step 2: Final Review (`megatron/core` only)	COMMENT
LOW⚡	docs/developer/submit.md	32	## Step 3: Approved	COMMENT
LOW⚡	docs/developer/submit.md	37	## Step 4: Merge	COMMENT
LOW	docs/user-guide/data-loading.md	39	### Step 1: Consolidate dataset files	COMMENT
LOW	docs/user-guide/data-loading.md	53	### Step 2: Pre-build the dataset cache	COMMENT
LOW	docs/user-guide/data-loading.md	77	### Step 3: Optionally pre-build per-dataset metadata	COMMENT
LOW	docs/user-guide/data-loading.md	87	### Step 4: Launch training with optimized data loading	COMMENT
LOW	examples/gptoss/README.md	3	## Step 0: Install Dependencies	COMMENT
LOW	examples/gptoss/README.md	57	## Step 1: Convert HuggingFace to Megatron (Optional - skip if you already have a Megatron checkpoint)	COMMENT
LOW	examples/gptoss/README.md	65	## Step 2: Train from Scratch	COMMENT
LOW	examples/gptoss/README.md	147	## Step 3: Convert Megatron to HuggingFace	COMMENT
LOW⚡	.github/pull_request_template.md	33	#### Step 1: Mark PR as "Ready for Review"	COMMENT
LOW⚡	.github/pull_request_template.md	42	#### Step 2: Final Review	COMMENT
LOW⚡	.github/pull_request_template.md	48	#### Step 3: Approved	COMMENT
LOW⚡	.github/scripts/oncall_manager.py	135	# Step 1: Add new oncall first (include current members to avoid removing anyone yet)	COMMENT
LOW⚡	.github/scripts/oncall_manager.py	142	# Step 2: Now set the usergroup to contain only the new oncall	COMMENT
LOW⚡	…ibuted/fsdp/src/megatron_fsdp/param_and_grad_buffer.py	1871	# Step 3: Split parameter groups to meet communication segmentation requirements.	COMMENT
LOW⚡	…ibuted/fsdp/src/megatron_fsdp/param_and_grad_buffer.py	1916	# Step 4: Generate the groups of collective buckets, where each group aggregates	COMMENT
LOW	…ibuted/fsdp/src/megatron_fsdp/param_and_grad_buffer.py	1751	# Step 0: Register new FSDP unit modules.	COMMENT
LOW	…ibuted/fsdp/src/megatron_fsdp/param_and_grad_buffer.py	1784	# Step 1: Group the parameters according to their execution order and attributes.	COMMENT
LOW	…ibuted/fsdp/src/megatron_fsdp/param_and_grad_buffer.py	1822	# Step 2: Bucket the parameters based on the guide bucket size.	COMMENT
LOW	megatron/core/transformer/moe/README.md	152	### Step 1: Find the feasible parallel mapping under the memory capacity of the GPU	COMMENT
LOW	megatron/core/transformer/moe/README.md	170	### Step 2: Select Optimal Parallelism Strategy	COMMENT
LOW	megatron/core/transformer/moe/README.md	226	### Step 3: Enable Performance Features Based on Profiling Bottlenecks	COMMENT
LOW	…e/transformer/custom_layers/batch_invariant_kernels.py	265	# Step 1: Find maximum value in the row for numerical stability	COMMENT
LOW	…e/transformer/custom_layers/batch_invariant_kernels.py	277	# Step 2: Compute sum of exp(x - max_val)	COMMENT
LOW	…e/transformer/custom_layers/batch_invariant_kernels.py	293	# Step 3: Compute final log_softmax values: x - max_val - log_sum_exp	COMMENT
LOW⚡	megatron/core/pipeline_parallel/bridge_communicator.py	437	# Step 1: broadcast its shape so receivers can allocate	COMMENT
LOW⚡	megatron/core/pipeline_parallel/bridge_communicator.py	443	# Step 2: broadcast the actual tensor	COMMENT
LOW	…n/core/dist_checkpointing/strategies/fully_parallel.py	238	# Step 3: load part of the checkpoint.	COMMENT
LOW	…n/core/dist_checkpointing/strategies/fully_parallel.py	264	# Step 4: exchange data between ranks	COMMENT
LOW⚡	…gatron/core/resharding/nvshmem_copy_service/service.py	232	# Step 1: Segment tasks (break large tasks into chunks)	COMMENT
LOW⚡	…gatron/core/resharding/nvshmem_copy_service/service.py	242	# Step 2: Pack tasks into workload groups	COMMENT
LOW⚡	…gatron/core/resharding/nvshmem_copy_service/service.py	249	# Step 3: Schedule workloads to iterations	COMMENT
13 more matches not shown…

Over-Commented Block142 hits · 128 pts

Severity	File	Line	Snippet	Context
LOW	.gitlab/scripts/check_imports.py	1	# Copyright (c) 2025, NVIDIA CORPORATION.	COMMENT
LOW	tools/bisect.sh	1	#!/usr/bin/env bash	COMMENT
LOW	tools/trigger_internal_ci.py	1	#!/usr/bin/env python3	COMMENT
LOW	tools/checkpoint/convert.py	1	# Copyright (c) 2023, NVIDIA CORPORATION. All rights reserved.	COMMENT
LOW	tools/checkpoint/convert.py	21	# full model weights, nothing split.	COMMENT
LOW	tools/checkpoint/convert.py	41	# consumed_train_samples	COMMENT
LOW	tools/checkpoint/convert.py	61	# "mlp l0 bias"	COMMENT
LOW	tools/checkpoint/gpt_hybrid_conversion.py	221	else:	COMMENT
LOW	tools/checkpoint/gpt_hybrid_conversion.py	241	#	COMMENT
LOW	tools/common_pile_dataset/setup_common_pile_dataset.sh	1	#!/bin/bash	COMMENT
LOW	…t_cases/gpt/gpt_583m_perf_gb200_4gpu/model_config.yaml	1	# Inference perf test: 583M mcore-mistral checkpoint, TP=1 PP=1 DP=4 (4 GPUs).	COMMENT
LOW	…rid/hybrid_nanov3_3b_perf_gb200_4gpu/model_config.yaml	1	# Inference perf test: nanov3 3B hybrid MoE checkpoint, TP=1 PP=1 EP=4 (4 GPUs).	COMMENT
LOW	…ts/performance_tests/shell_test_utils/run_perf_test.sh	1	#!/usr/bin/env bash	COMMENT
LOW	…ts/performance_tests/shell_test_utils/run_perf_test.sh	121	# Override TP/PP/EP from config (the args file ships defaults; config wins).	COMMENT
LOW	tests/unit_tests/test_imports.py	1	# Copyright (c) 2022, NVIDIA CORPORATION. All rights reserved.	COMMENT
LOW	…s/checkpoint/test_gpt_hybrid_conversion_parallelism.py	41	# tiny synthetic DCP checkpoint and round-trips it through the converter on	COMMENT
LOW	tests/unit_tests/determinism/configs.py	121	# mamba-attn-mlp covers Mamba + attention + MLP paths. pure-mamba is	COMMENT
LOW	tests/unit_tests/determinism/configs.py	181	"TP": "tensor_model_parallel_size",	COMMENT
LOW	…unit_tests/transformer/test_fsdp_dtensor_checkpoint.py	1	# Copyright (c) 2026, NVIDIA CORPORATION. All rights reserved.	COMMENT
LOW	…s/unit_tests/elastification/test_hybrid_flex_router.py	81	tensor_model_parallel_size=1, pipeline_model_parallel_size=1	COMMENT
LOW	…s/unit_tests/pipeline_parallel/test_pipeline_layout.py	361		COMMENT
LOW	…s/unit_tests/dist_checkpointing/test_fully_parallel.py	521	# main replica is in DP group with ranks [0, 2, 4, 6],	COMMENT
LOW	tests/unit_tests/inference/test_hybrid_moe.py	61	# Combinatorial sweep: unordered combinations with repetition of ALL_STATES	COMMENT
LOW	tests/test_utils/python_scripts/recipe_parser.py	21	# cadence remains the trigger axis.	COMMENT
LOW	tests/test_utils/python_scripts/test_oncall_manager.py	1	# Copyright (c) 2026, NVIDIA CORPORATION. All rights reserved.	COMMENT
LOW	tests/test_utils/recipes/h100/moe.yaml	101	- environment: [lts]	COMMENT
LOW	tests/test_utils/recipes/h100/moe.yaml	221	# Super important mr, mr-github tests that run for both DEV and LTS per mr, mr-github #	COMMENT
LOW	tests/test_utils/recipes/h100/gpt.yaml	461	# - test_case: [gpt3_mcore_te_tp2_pp1_fsdp2_resume_torch_dist]	COMMENT
LOW	tests/test_utils/recipes/h100/gpt.yaml	521	platforms: [dgx_h100]	COMMENT
LOW	tests/test_utils/recipes/gb200/moe.yaml	101	platforms: [dgx_gb200]	COMMENT
LOW	tests/test_utils/recipes/gb200/moe.yaml	201	# - test_case: [gpt3_mcore_te_tp2_pp1_frozen_resume_torch_dist_te_8experts2parallel_dist_optimizer]	COMMENT
LOW	…unctional_tests/test_cases/common/moe_perf/__main__.py	401		COMMENT
LOW	…unctional_tests/test_cases/common/moe_perf/__main__.py	421	# test_moe_layer_performance(case, debug_mode=True)	COMMENT
LOW	…unctional_tests/shell_test_utils/run_batch_ci_tests.sh	1	#!/bin/bash	COMMENT
LOW	docs/autodoc2_docstrings_parser.py	1	# Copyright (c) 2025, NVIDIA CORPORATION. All rights reserved.	COMMENT
LOW	docs/conf.py	1	# Copyright (c) 2025-2026, NVIDIA CORPORATION. All rights reserved.	COMMENT
LOW	examples/megatron_fsdp/sbatch_mfsdp_deepseek_v3.sh	201	# Note: Update SBATCH parameters below according to your cluster configuration	COMMENT
LOW	examples/megatron_fsdp/train_llama3_8b_fsdp_h100_fp8.sh	121	--ckpt-format fsdp_dtensor	COMMENT
LOW	examples/megatron_fsdp/sbatch_checkpoint_convert.sh	21	SLURM_LOGS="${OUTPUT_PATH}/slurm_logs"	COMMENT
LOW	examples/mimo/train.py	121	# iterator exhausted on all ranks	COMMENT
LOW	examples/inference/run_inference_server.sh	1	#!/bin/bash	COMMENT
LOW	examples/inference/run_offline_inference.sh	1	#!/bin/bash	COMMENT
LOW	.github/workflows/sync-team-usergroups.yml	1	# Copyright (c) 2026, NVIDIA CORPORATION. All rights reserved.	COMMENT
LOW	.github/workflows/cicd-approve-test-queue.yml	1	# Copyright (c) 2025, NVIDIA CORPORATION. All rights reserved.	COMMENT
LOW	.github/workflows/release-docs.yml	1	# Copyright (c) 2025, NVIDIA CORPORATION.	COMMENT
LOW	.github/workflows/community-bot.yml	1	# Copyright (c) 2025, NVIDIA CORPORATION. All rights reserved.	COMMENT
LOW	.github/workflows/nightly-sync-main-to-dev.yml	1	# Copyright (c) 2026, NVIDIA CORPORATION. All rights reserved.	COMMENT
LOW	.github/workflows/release.yaml	1	# Copyright (c) 2019-2026, NVIDIA CORPORATION. All rights reserved.	COMMENT
LOW	.github/workflows/_claude-fix-attempt.yml	1	# Copyright (c) 2026, NVIDIA CORPORATION. All rights reserved.	COMMENT
LOW	.github/workflows/_claude-fix-attempt.yml	21	#	COMMENT
LOW	.github/workflows/_claude-fix-attempt.yml	41	# branch. If an earlier attempt already created that commit, fixed code verifies	COMMENT
LOW	.github/workflows/community-request-assignee.yml	1	# Copyright (c) 2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.	COMMENT
LOW	.github/workflows/claude-fix.yml	1	# Copyright (c) 2026, NVIDIA CORPORATION. All rights reserved.	COMMENT
LOW	.github/workflows/claude-fix.yml	21	#	COMMENT
LOW	.github/workflows/claude-fix.yml	41	# publishes it from a fresh runner, ensures CI exists for the exact SHA, and	COMMENT
LOW	.github/workflows/close-inactive-issue-pr.yml	1	# Copyright (c) 2025, NVIDIA CORPORATION. All rights reserved.	COMMENT
LOW	.github/workflows/oncall-assign.yml	1	# Copyright (c) 2025, NVIDIA CORPORATION. All rights reserved.	COMMENT
LOW	.github/workflows/oncall-rotation.yml	1	# Copyright (c) 2025, NVIDIA CORPORATION. All rights reserved.	COMMENT
LOW	.github/workflows/cherry-pick-release-commit.yml	1	# Copyright (c) 2025, NVIDIA CORPORATION. All rights reserved.	COMMENT
LOW	.github/workflows/release-nightly-docs.yml	1	# Copyright (c) 2026, NVIDIA CORPORATION.	COMMENT
82 more matches not shown…

AI Structural Patterns145 hits · 126 pts

Severity	File	Line	Context
LOW	tools/bert_embedding/huggingface.py	51	CODE
LOW	tests/unit_tests/test_layer_wise_optimizer.py	149	CODE
LOW	…s/checkpoint/test_gpt_hybrid_conversion_parallelism.py	69	CODE
LOW	tests/unit_tests/distributed/mfsdp_v1/utils.py	39	CODE
LOW	…/unit_tests/transformer/test_multi_token_prediction.py	227	CODE
LOW	…er/experimental_attention_variant/test_absorbed_mla.py	83	CODE
LOW	…perimental_attention_variant/test_dsa_native_parity.py	1092	CODE
LOW	…perimental_attention_variant/test_dsa_native_parity.py	1239	CODE
LOW	…perimental_attention_variant/test_dsa_native_parity.py	1594	CODE
LOW	…perimental_attention_variant/test_dsa_native_parity.py	2582	CODE
LOW	…perimental_attention_variant/test_dsa_native_parity.py	2672	CODE
LOW	…perimental_attention_variant/test_dsa_native_parity.py	2759	CODE
LOW	…perimental_attention_variant/test_dsa_native_parity.py	2910	CODE
LOW	…ts/unit_tests/transformer/moe/test_token_dispatcher.py	37	CODE
LOW	tests/unit_tests/transformer/moe/test_paged_stashing.py	59	CODE
LOW	…s/elastification/test_inject_flextron_forward_logic.py	33	CODE
LOW	…it_tests/pipeline_parallel/test_bridge_communicator.py	163	CODE
LOW	tests/unit_tests/dist_checkpointing/utils.py	294	CODE
LOW	tests/unit_tests/resharding/test_planner.py	26	CODE
LOW	tests/unit_tests/inference/test_wandb_logging.py	46	CODE
LOW	tests/unit_tests/inference/coordinator_test_utils.py	16	CODE
LOW	…ests/inference/contexts/test_dynamic_prefix_caching.py	50	CODE
LOW	…eration_controllers/test_text_generation_controller.py	58	CODE
LOW	…eration_controllers/test_text_generation_controller.py	1971	CODE
LOW	…sts/unit_tests/inference/engines/test_static_engine.py	38	CODE
LOW	…sts/unit_tests/inference/engines/test_static_engine.py	322	CODE
LOW	tests/unit_tests/data/test_bin_reader.py	97	CODE
LOW	tests/test_utils/python_scripts/recipe_parser.py	363	CODE
LOW	…/academic_paper_scripts/detxoify_lm/perspective_api.py	88	CODE
LOW	examples/post_training/modelopt/utils.py	54	CODE
LOW	examples/post_training/modelopt/utils.py	132	CODE
LOW	examples/mimo/training/builder.py	114	CODE
LOW	examples/mimo/data/mock.py	165	CODE
LOW	examples/mimo/data/mock.py	41	CODE
LOW	examples/multimodal/model.py	18	CODE
LOW	…ples/multimodal/model_converter/vision_model_tester.py	73	CODE
LOW	megatron/core/parallel_state.py	547	CODE
LOW	megatron/core/ssm/mamba_mixer.py	161	CODE
LOW	megatron/core/ssm/mamba_mixer.py	771	CODE
LOW	megatron/core/ssm/mamba_context_parallel.py	329	CODE
LOW	megatron/core/ssm/mamba_context_parallel.py	356	CODE
LOW	megatron/core/ssm/mamba_context_parallel.py	199	CODE
LOW	megatron/core/ssm/gated_delta_net.py	77	CODE
LOW	megatron/core/ssm/ops/ssd_combined.py	23	CODE
LOW	megatron/core/ssm/ops/ssd_combined.py	163	CODE
LOW	…ore/tokenizers/text/libraries/huggingface_tokenizer.py	26	CODE
LOW	…n/core/tokenizers/text/libraries/tiktoken_tokenizer.py	270	CODE
LOW	megatron/core/fusions/fused_softmax.py	176	CODE
LOW	megatron/core/tensor_parallel/layers.py	827	CODE
LOW	megatron/core/tensor_parallel/mappings.py	57	CODE
LOW	megatron/core/tensor_parallel/mappings.py	81	CODE
LOW	megatron/core/tensor_parallel/mappings.py	100	CODE
LOW	megatron/core/tensor_parallel/mappings.py	115	CODE
LOW	megatron/core/tensor_parallel/mappings.py	592	CODE
LOW	megatron/core/tensor_parallel/mappings.py	621	CODE
LOW	megatron/core/post_training/modelopt/layers.py	108	CODE
LOW	megatron/core/optimizer/emerging_optimizers.py	163	CODE
LOW	megatron/core/optimizer/emerging_optimizers.py	325	CODE
LOW	megatron/core/optimizer/__init__.py	451	CODE
LOW	megatron/core/optimizer/distrib_optimizer.py	238	CODE
85 more matches not shown…

Verbosity Indicators49 hits · 90 pts

Severity	File	Line	Snippet	Context
LOW⚡	tools/checkpoint/saver_hf_llava.py	17	# Step 1: Reshape back to (num_head, 3*head_dim, -1)	COMMENT
LOW⚡	tools/checkpoint/saver_hf_llava.py	20	# Step 2: Slice along the head_dim dimension to get q, k, v	COMMENT
LOW⚡	tools/checkpoint/saver_hf_llava.py	25	# Step 3: Reshape each back to (num_head * head_dim, -1)	COMMENT
LOW⚡	…s/common_pile_dataset/create_common_pile_ci_dataset.py	361	# Step 1: Get vocabulary files	COMMENT
LOW⚡	…s/common_pile_dataset/create_common_pile_ci_dataset.py	387	# Step 2: Get raw text data	COMMENT
LOW⚡	…s/common_pile_dataset/create_common_pile_ci_dataset.py	411	# Step 3: Preprocess for GPT (GPT2BPETokenizer)	COMMENT
LOW⚡	…s/common_pile_dataset/create_common_pile_ci_dataset.py	434	# Step 4: Preprocess for BERT (BertWordPieceLowerCase + split-sentences)	COMMENT
LOW⚡	…s/common_pile_dataset/create_common_pile_ci_dataset.py	475	# Step 5: Preprocess for T5 (BertWordPieceCase)	COMMENT
LOW⚡	…s/common_pile_dataset/create_common_pile_ci_dataset.py	497	# Step 6: Clean up and verify	COMMENT
LOW⚡	…/unit_tests/inference/contexts/test_dynamic_context.py	2038	# Step 1: Forward pass for all 3 requests	COMMENT
LOW⚡	…/unit_tests/inference/contexts/test_dynamic_context.py	2047	# Step 2: Forward pass where req 10 finishes, req 11 continues. Req 999 is NOT scheduled.	COMMENT
LOW⚡	…/unit_tests/inference/contexts/test_dynamic_context.py	2136	# Step 1: All 3 requests are active, process forward pass	COMMENT
LOW⚡	…/unit_tests/inference/contexts/test_dynamic_context.py	2145	# Step 2: Both decode requests finish, chunked prefill NOT scheduled this step.	COMMENT
LOW	…/unit_tests/inference/contexts/test_dynamic_context.py	2069	# Step 3: Add the next chunk. It should sit exactly at the boundary (index 1) and inherit the state.	COMMENT
LOW⚡	…ts/unit_tests/inference/engines/test_dynamic_engine.py	3657	# Step 1: prefill for request 0 — should NOT count as a spec step.	COMMENT
LOW⚡	…ts/unit_tests/inference/engines/test_dynamic_engine.py	3667	# Step 2: decode for request 0 — should count spec tokens.	COMMENT
LOW	…ts/unit_tests/inference/engines/test_dynamic_engine.py	2753	# Step 1: Prefill. Processes the 4 prompt tokens.	COMMENT
LOW⚡	…l_tests/python_test_utils/compute_golden_statistics.py	11	# Step 1: Run batch tests (from megatron-rl directory):	STRING
LOW⚡	…l_tests/python_test_utils/compute_golden_statistics.py	15	# Step 2: Wait for jobs to complete, then compute statistics:	STRING
LOW⚡	.github/scripts/oncall_manager.py	135	# Step 1: Add new oncall first (include current members to avoid removing anyone yet)	COMMENT
LOW⚡	.github/scripts/oncall_manager.py	142	# Step 2: Now set the usergroup to contain only the new oncall	COMMENT
LOW⚡	…ibuted/fsdp/src/megatron_fsdp/param_and_grad_buffer.py	1871	# Step 3: Split parameter groups to meet communication segmentation requirements.	COMMENT
LOW⚡	…ibuted/fsdp/src/megatron_fsdp/param_and_grad_buffer.py	1916	# Step 4: Generate the groups of collective buckets, where each group aggregates	COMMENT
LOW	…ibuted/fsdp/src/megatron_fsdp/param_and_grad_buffer.py	1751	# Step 0: Register new FSDP unit modules.	COMMENT
LOW	…ibuted/fsdp/src/megatron_fsdp/param_and_grad_buffer.py	1784	# Step 1: Group the parameters according to their execution order and attributes.	COMMENT
LOW	…ibuted/fsdp/src/megatron_fsdp/param_and_grad_buffer.py	1822	# Step 2: Bucket the parameters based on the guide bucket size.	COMMENT
LOW	…ibuted/fsdp/src/megatron_fsdp/param_and_grad_buffer.py	4468	# If use double buffer, we need to check if the next bucket	COMMENT
LOW	…e/transformer/custom_layers/batch_invariant_kernels.py	265	# Step 1: Find maximum value in the row for numerical stability	COMMENT
LOW	…e/transformer/custom_layers/batch_invariant_kernels.py	277	# Step 2: Compute sum of exp(x - max_val)	COMMENT
LOW	…e/transformer/custom_layers/batch_invariant_kernels.py	293	# Step 3: Compute final log_softmax values: x - max_val - log_sum_exp	COMMENT
LOW⚡	megatron/core/pipeline_parallel/bridge_communicator.py	437	# Step 1: broadcast its shape so receivers can allocate	COMMENT
LOW⚡	megatron/core/pipeline_parallel/bridge_communicator.py	443	# Step 2: broadcast the actual tensor	COMMENT
LOW	…n/core/dist_checkpointing/strategies/fully_parallel.py	238	# Step 3: load part of the checkpoint.	COMMENT
LOW	…n/core/dist_checkpointing/strategies/fully_parallel.py	264	# Step 4: exchange data between ranks	COMMENT
LOW⚡	…gatron/core/resharding/nvshmem_copy_service/service.py	232	# Step 1: Segment tasks (break large tasks into chunks)	COMMENT
LOW⚡	…gatron/core/resharding/nvshmem_copy_service/service.py	242	# Step 2: Pack tasks into workload groups	COMMENT
LOW⚡	…gatron/core/resharding/nvshmem_copy_service/service.py	249	# Step 3: Schedule workloads to iterations	COMMENT
LOW⚡	…gatron/core/resharding/nvshmem_copy_service/service.py	258	# Step 4: Prepare iteration schedules	COMMENT
LOW⚡	…gatron/core/resharding/nvshmem_copy_service/service.py	264	# Step 5: Build GPU execution plans	COMMENT
LOW⚡	…gatron/core/resharding/nvshmem_copy_service/service.py	273	# Step 6: Create double-buffered events	COMMENT
LOW	…harding/nvshmem_copy_service/core/pipeline_executor.py	153	# Step 1: Pack NEXT iteration (async)	COMMENT
LOW	…harding/nvshmem_copy_service/core/pipeline_executor.py	165	# Step 2: Unpack PRIOR iteration (async)	COMMENT
LOW	…harding/nvshmem_copy_service/core/pipeline_executor.py	180	# Step 3: Send CURRENT iteration	COMMENT
LOW	…harding/nvshmem_copy_service/core/pipeline_executor.py	223	# Step 5: Wait for async pack to complete (double-buffer safety)	COMMENT
LOW⚡	…shmem_copy_service/planning/communication_scheduler.py	39	# Step 1: Collect all batches across all PE pairs	COMMENT
LOW⚡	…shmem_copy_service/planning/communication_scheduler.py	44	# Step 2: Assign batches to iterations using greedy conflict-free algorithm	COMMENT
LOW⚡	…shmem_copy_service/planning/communication_scheduler.py	49	# Step 3: Exchange detailed workload summaries (Task IDs/Sizes)	COMMENT
LOW⚡	…shmem_copy_service/planning/communication_scheduler.py	55	# Step 4: Build schedule map for this PE	COMMENT
LOW	…e/communication/torch_symm_triton/fused_collectives.py	151	# Step 1: - reduce-scatter + residual add for this token + collect sq sum	COMMENT

AI Slop Vocabulary35 hits · 80 pts

Severity	File	Line	Snippet	Context
LOW	tasks/finetune_utils.py	120	# shuffling so we can just use a simple infinite loop.	COMMENT
MEDIUM	tests/unit_tests/test_inference.py	107	# we are replicating what lm-eval-harness::TemplateLM::_encode_pair does	COMMENT
MEDIUM	…s/checkpoint/test_gpt_hybrid_conversion_parallelism.py	43	# harness launched pytest. When that default PG is multi-rank (e.g. Megatron's	COMMENT
MEDIUM	…ests/determinism/correctness/test_transformer_layer.py	210	# stress harness relies on the connection count the launcher	COMMENT
LOW	…t_tests/distributed/mfsdp_v1/test_mfsdp_fully_shard.py	236	# DP: Only relevant when using HSDP, where we need the flattened DP group for data parallelism. (Otherwise, just pas	COMMENT
LOW	…t_tests/distributed/mfsdp_v1/test_mfsdp_fully_shard.py	238	# DP-Shard-CP: Only required if using CP. Otherwise, just pass dp_shard to FSDP.	COMMENT
MEDIUM	tests/unit_tests/transformer/test_utils.py	331	"""Test a comprehensive scenario with multiple configurations."""	STRING
MEDIUM⚡	tests/unit_tests/training/models/test_base.py	171	"""from_dict() reconstructs configs from serialized dicts, handles nested dataclasses, and is robust to unknown keys	STRING
LOW	…s/unit_tests/dist_checkpointing/models/test_mlp_glu.py	129	# Load happens in-place, so we can just use the same tensors	COMMENT
MEDIUM	…/unit_tests/inference/contexts/test_dynamic_context.py	2534	# 6. Verify seamless append (no legacy offset math needed)	COMMENT
MEDIUM	…cipes/h100/gpt-dynamic-inference-with-coordinator.yaml	92	# skills/run-performance-tests/SKILL.md for the harness it runs under.	COMMENT
MEDIUM	…inference_server_smoke_tp1_pp1_dp8_583m/serve_smoke.py	43	# the JET harness expects at ``logs///attempt_0//std.log``) while still	COMMENT
LOW	…sts/functional_tests/shell_test_utils/_run_training.sh	85	# If value is "true", just use the key	COMMENT
MEDIUM⚡	.github/workflows/claude_review.yml	105	# Strict review: comprehensive Megatron-LM focused analysis	COMMENT
LOW	megatron/core/optimizer_param_scheduler.py	235	# If the learning rate is constant, just return the initial value.	COMMENT
LOW	megatron/core/timers.py	476	# polutes the runs list, so we just add each as a scalar	COMMENT
MEDIUM	megatron/core/ssm/triton_cache_manager.py	69	# use temp dir to be robust against program interruptions	COMMENT
LOW	…ore/tokenizers/text/parsers/qwen3_coder_tool_parser.py	16	# These map to vLLM types but we just use dictionaries for now	COMMENT
LOW	megatron/core/tensor_parallel/random.py	108	# if not using cuda graphs, just use the builtin pytorch function	COMMENT
LOW⚡	megatron/core/tensor_parallel/random.py	189	# already graphable, just return it.	COMMENT
LOW⚡	megatron/core/tensor_parallel/random.py	195	# already non-graphable, just return it.	COMMENT
MEDIUM⚡	…ibuted/fsdp/src/megatron_fsdp/param_and_grad_buffer.py	1926	# Set aggregate buckets by FSDP units, i.e. buckets pertaining to the same	COMMENT
MEDIUM	…ibuted/fsdp/src/megatron_fsdp/param_and_grad_buffer.py	2557	# to leverage NCCL UBR for high-precision gradient reduction with	COMMENT
MEDIUM	…ibuted/fsdp/src/megatron_fsdp/param_and_grad_buffer.py	3516	# for a seamless user experience and coverage for ZeRO-1 and ZeRO-2?	COMMENT
MEDIUM	…ibuted/fsdp/src/megatron_fsdp/param_and_grad_buffer.py	4737	# TODO(@cspades): Clean up this logic in conjunction with	COMMENT
LOW	…atron/core/distributed/fsdp/src/megatron_fsdp/utils.py	199	# if not using cuda graphs, just use the builtin pytorch function	COMMENT
LOW	megatron/core/transformer/transformer_config.py	2438	# so just set both if either is specified.	COMMENT
LOW	megatron/core/transformer/transformer_layer.py	906	# elements in bias_chunks are the same for all chunks, so we can just use the first one	COMMENT
MEDIUM	megatron/core/transformer/mlp.py	207	# Weight resharding across TP sizes will have aforementioned problems.	COMMENT
MEDIUM	megatron/core/datasets/indexed_dataset.py	6	# Essentially re-written in entirety	COMMENT
MEDIUM	…e/models/common/embeddings/language_model_embedding.py	145	# the original tensor from being garbage collected. Clone to facilitate GC.	COMMENT
LOW	megatron/core/models/bert/bert_model.py	193	# For local layer spec we just use b1ss	COMMENT
LOW	megatron/core/extensions/transformer_engine.py	689	# TODO should we ditch normalization config and just use spec to choose LayerNorm vs RMSNorm?	COMMENT
LOW	…tron/core/dist_checkpointing/strategies/async_utils.py	479	# to simply call `sync_all_async_calls` to check if other ranks complete the writing	COMMENT
MEDIUM	…/core/inference/data_parallel_inference_coordinator.py	386	# Todo [Siddharth]: Make this more robust to handle invalid messages.	COMMENT

Cross-Language Confusion14 hits · 55 pts

Severity	File	Line	Snippet	Context
HIGH	…core/transformer/heterogeneous/heterogeneous_config.py	177	"n_heads_in_group": null,	STRING
HIGH	…er/experimental_attention_variant/dsa_cudnn_kernels.py	1695	"""KL(target \|\| predict) over selected top-K entries.	STRING
HIGH	…er/experimental_attention_variant/dsa_cudnn_kernels.py	1747	"""KL(target \|\| predict) over the full KV axis, averaged over ``(B, S_q)``.	STRING
HIGH⚡	…mer/experimental_attention_variant/dsa_indexer_loss.py	25	"""Return elementwise ``KL(target \|\| predict)`` contributions."""	STRING
HIGH⚡	…mer/experimental_attention_variant/dsa_indexer_loss.py	35	"""Return ``KL(target \|\| predict)`` reduced over the key dimension."""	STRING
HIGH⚡	…mer/experimental_attention_variant/dsa_indexer_loss.py	43	"""Return ``KL(target \|\| predict)`` reduced over every dimension."""	STRING
HIGH	…e/pipeline_parallel/fine_grained_activation_offload.py	547	self.push(chunk)	CODE
HIGH	…e/pipeline_parallel/fine_grained_activation_offload.py	742	self.push(cur_chunk)	CODE
HIGH⚡	megatron/core/inference/inference_request.py	94	Each block hash is computed as SHA-256(parent_digest \|\| block_bytes), where	STRING
HIGH	megatron/core/inference/unified_memory.py	113	if (device != prev_device && device >= 0) cudaSetDevice(device);	CODE
HIGH	megatron/core/inference/unified_memory.py	149	if (device != prev_device && prev_device >= 0) cudaSetDevice(prev_device);	CODE
HIGH	megatron/core/inference/contexts/dynamic_context.py	4052	# : [ XX \| XX \| 16 XX \| 12 72 24 88 XX ] (XX = undefined)	COMMENT
HIGH	megatron/core/inference/moe/vllm_fused_moe.py	518	`input` are undefined).	STRING
HIGH	megatron/training/arguments.py	3287	"n_heads_in_group": null,	STRING

Fake / Example Data17 hits · 19 pts

Severity	File	Line	Snippet	Context
LOW⚡	tests/unit_tests/transformer/test_full_cuda_graph.py	35	dummy_data = torch.ones(1, 4)	CODE
LOW⚡	tests/unit_tests/transformer/test_full_cuda_graph.py	40	return model(dummy_data), loss_func	CODE
LOW⚡	tests/unit_tests/pipeline_parallel/test_schedules.py	528	dummy_data = torch.ones(1, 4)	CODE
LOW⚡	tests/unit_tests/pipeline_parallel/test_schedules.py	533	return model(dummy_data), loss_func	CODE
LOW	…sts/unit_tests/dist_checkpointing/test_safe_globals.py	28	dummy_obj = Namespace(dummy_value=0)	CODE
LOW	examples/multimodal/text_generation_mistral_clip.sh	7	INPUT_IMAGE_PATH="placeholder"	CODE
LOW	examples/multimodal/text_generation_mistral_clip.sh	8	GROUNDTRUTH_PATH="placeholder"	CODE
LOW	…imodal/nvlm/run_text_generation_yi_34b_internvit_6b.sh	7	INPUT_IMAGE_PATH="placeholder"	CODE
LOW	…imodal/nvlm/run_text_generation_yi_34b_internvit_6b.sh	8	GROUNDTRUTH_PATH="placeholder"	CODE
LOW	…ultimodal/nvlm/run_text_generation_qwen25_7b_siglip.sh	8	INPUT_IMAGE_PATH="placeholder"	CODE
LOW	…ultimodal/nvlm/run_text_generation_qwen25_7b_siglip.sh	9	GROUNDTRUTH_PATH="placeholder"	CODE
LOW	…al/nvlm/run_text_generation_qwen20_72b_internvit_6b.sh	8	INPUT_IMAGE_PATH="placeholder"	CODE
LOW	…al/nvlm/run_text_generation_qwen20_72b_internvit_6b.sh	9	GROUNDTRUTH_PATH="placeholder"	CODE
LOW	…/nvlm/run_text_generation_qwen25_7b_internvit_video.sh	8	INPUT_IMAGE_PATH="placeholder"	CODE
LOW	…/nvlm/run_text_generation_qwen25_7b_internvit_video.sh	9	GROUNDTRUTH_PATH="placeholder"	CODE
LOW	…al/llama_3p1_nemotron_nano_vl_8b_v1/text_generation.sh	7	INPUT_IMAGE_PATH="placeholder"	CODE
LOW	…al/llama_3p1_nemotron_nano_vl_8b_v1/text_generation.sh	8	GROUNDTRUTH_PATH="placeholder"	CODE

Modern AI Meta-Vocabulary4 hits · 12 pts

Severity	File	Line	Snippet	Context
MEDIUM	tests/functional_tests/shell_test_utils/run_ci_test.sh	360	# orchestration's resumable retrigger.	COMMENT
MEDIUM	…ibuted/fsdp/src/megatron_fsdp/param_and_grad_buffer.py	1837	# We may share the embedding model weight and the final output layer,	COMMENT
MEDIUM	megatron/core/inference/apis/async_llm.py	214	# SIGINT or out-of-band orchestration).	COMMENT
MEDIUM	…t_generation_controllers/text_generation_controller.py	2327	# Pick the context window that we need to pass through the network.	COMMENT

Dead Code5 hits · 10 pts

Severity	File	Line	Context
MEDIUM	tools/checkpoint/loader_llava.py	52	CODE
MEDIUM	…nce_tp1_pp1_583m_cuda_graphs_validation/cuda_graphs.py	183	CODE
MEDIUM	…nce_tp1_pp1_583m_cuda_graphs_validation/cuda_graphs.py	184	CODE
MEDIUM	…nce_tp1_pp1_583m_cuda_graphs_validation/cuda_graphs.py	185	CODE
MEDIUM	…nce_tp1_pp1_583m_cuda_graphs_validation/cuda_graphs.py	186	CODE

Synthetic Comment Markers1 hit · 5 pts

Severity	File	Line	Snippet	Context
HIGH	.github/workflows/claude-copy-to-main.yml	118	--body "🤖 This PR was auto-generated by Claude via the Claude copy workflow.\n\nCherry-picked from	CODE

Example Usage Blocks3 hits · 4 pts

Severity	File	Line	Snippet	Context
LOW	tools/bisect.sh	5	# Usage:	COMMENT
LOW	tools/common_pile_dataset/setup_common_pile_dataset.sh	9	# Usage:	COMMENT
LOW	…unctional_tests/shell_test_utils/run_batch_ci_tests.sh	5	# Usage:	COMMENT

TODO Padding3 hits · 3 pts

Severity	File	Line	Snippet	Context
LOW	…/inference/test_data_parallel_inference_coordinator.py	122	self.ep_consensus_interval = 1 # TODO: add tests for interval > 1 (e.g. verify pausing is delayed by at most N	CODE
LOW	…ibuted/fsdp/src/megatron_fsdp/param_and_grad_buffer.py	5081	# TODO: Add validation checks for the legality of DTensor.	COMMENT
LOW	megatron/core/transformer/multi_token_prediction.py	1417	# TODO: implement block-based recompute for MTP	COMMENT

Slop Phrases2 hits · 2 pts

Severity	File	Line	Snippet	Context
MEDIUM	megatron/core/process_groups_config.py	660	"""Check if this rank has a language model.	STRING
MEDIUM	megatron/core/process_groups_config.py	663	True if this rank has a language model, False otherwise.	STRING

Overly Generic Function Names3 hits · 2 pts

Severity	File	Line	Snippet	Context
LOW	…s/unit_tests/training/config/test_instantiate_utils.py	52	def test_function(arg1=None, arg2=None, **kwargs):	CODE
LOW	tests/unit_tests/training/config/test_yaml_utils.py	45	def test_function():	CODE
LOW	megatron/core/utils.py	2713	def my_function():	STRING

Analysis Overview

What These Metrics Mean

Score History

Severity Breakdown

Directory Score Breakdown

Pattern Findings