microsoft/RD-Agent

24.3

Adjusted Score

24.3

Raw Score

100%

Time Factor

2026-06-15

Last Push

13.9K

Stars

Python

Language

113.4K

Lines of Code

790

Files

1.4K

Pattern Hits

2026-07-14

Scan Date

0.29

HC Hit Rate

What These Metrics Mean

Adjusted Score: Primary synthetic code indicator. Raw score normalised per 1,000 lines of code and multiplied by the temporal discount factor. This is the definitive comparative metric — use it to rank repositories by AI authorship density.
Raw Score: The unmodified sum of all severity-weighted, context-multiplied pattern match scores before temporal discounting. Reflects the absolute signal strength independent of when the repository was last active.
Time Factor: The temporal discount multiplier (0–100%) applied to the raw score. Repositories last updated before ChatGPT's launch (Nov 2022) receive a 5% factor. Full signal is only assigned to repositories active in the post-adoption era (Jan 2024+).
Pattern Hits: Total count of individual pattern matches across all files and categories. A high hit count with a low score may indicate a very large codebase with isolated AI snippets; a low count with a high score indicates dense, concentrated AI signatures.
HC Hit Rate: High+Critical pattern hits per file, averaged across the repository. This orthogonal signal catches repositories where a few files are densely packed with high-severity AI tells — a strong indicator even when the normalised score appears moderate due to codebase size.
Lines of Code / Files: Total lines and files analysed. The scanner examines 94 file extensions. These denominators are used to normalise the score, enabling fair comparison between repositories of vastly different sizes.

Score History

This chart maps the temporal evolution of the adjusted synthetic code score across successive scan runs. An upward trajectory indicates ongoing incorporation of AI-generated code or expanding LLM-assisted scaffolding; a stable or declining trajectory may reflect active human refactoring, code removal, or the adoption of stricter authorship policies. The dashed secondary line (right axis) independently tracks total raw pattern hit count, which can diverge from the normalised score when codebase size changes significantly between scans.

Severity Breakdown

Classifies detected patterns by their diagnostic confidence and structural impact. CRITICAL patterns (coefficient 10) represent definitive synthetic signatures — hallucinated imports, explicit LLM attribution metadata — virtually never produced by human authors. HIGH (5) indicates strong structural tells such as cross-file repetition or cross-linguistic idioms. MEDIUM (2) covers recognisable conversational padding and AI-specific vocabulary. LOW (1) captures subtle indicators like tautological comments and generic boilerplate that require density to carry independent signal.

CRITICAL 0HIGH 227MEDIUM 176LOW 1041

Directory Score Breakdown

This horizontal bar chart decomposes the repository's raw synthetic code score by top-level directory, allowing you to pinpoint precisely which modules or components carry the highest AI authorship density. Directories with disproportionately high scores relative to their size warrant targeted manual review: concentrated AI signatures often trace back to mass-generated configuration layers, auto-ported test suites, LLM-scaffolded boilerplate classes, or entire subsystems authored under heavy copilot assistance. Use this view to prioritise your human code-review effort.

Pattern Findings

The scanner identified 1444 distinct pattern matches across 23 syntactic categories. Each entry below represents a discrete location in the source code where the engine recorded a statistically significant AI authorship indicator. Expand any category row to inspect the individual file paths, line numbers, code snippets, and the lexical context (CODE, COMMENT, or STRING) in which each match was detected.

Reading the findings table: The Severity column indicates the diagnostic confidence level (CRITICAL / HIGH / MEDIUM / LOW). The Context column identifies whether the match occurred inside executable code, an inline comment, or a string literal — comment-context matches receive a ×1.5 weight because LLMs systematically over-annotate. The ⚡ bolt icon marks clustered matches: three or more patterns within a 10-line window, each receiving an additional ×1.5 density multiplier as dense clusters constitute far stronger evidence of synthetic authorship than isolated hits.

Cross-File Repetition212 hits · 1060 pts

Severity	File	Line	Snippet	Context
HIGH	rdagent/core/interactor.py	0	interact with the user to get feedback or confirmation. responsibilities: - present the current state of the experiment	STRING
HIGH	rdagent/scenarios/data_science/interactor/__init__.py	0	interact with the user to get feedback or confirmation. responsibilities: - present the current state of the experiment	STRING
HIGH	rdagent/components/interactor/__init__.py	0	interact with the user to get feedback or confirmation. responsibilities: - present the current state of the experiment	STRING
HIGH	rdagent/app/qlib_rd_loop/model.py	0	auto r&d evolving loop for fintech factors. you can continue running session by .. code-block:: python dotenv run -- pyt	STRING
HIGH	rdagent/app/qlib_rd_loop/factor.py	0	auto r&d evolving loop for fintech factors. you can continue running session by .. code-block:: python dotenv run -- pyt	STRING
HIGH	rdagent/app/qlib_rd_loop/quant.py	0	auto r&d evolving loop for fintech factors. you can continue running session by .. code-block:: python dotenv run -- pyt	STRING
HIGH	…ent/components/coder/data_science/ensemble/__init__.py	0	assign the code list to the evolving item. the code list is aligned with the evolving item's sub-tasks. if a task is not	STRING
HIGH	…ent/components/coder/data_science/pipeline/__init__.py	0	assign the code list to the evolving item. the code list is aligned with the evolving item's sub-tasks. if a task is not	STRING
HIGH	…ent/components/coder/data_science/workflow/__init__.py	0	assign the code list to the evolving item. the code list is aligned with the evolving item's sub-tasks. if a task is not	STRING
HIGH	rdagent/components/coder/data_science/model/__init__.py	0	assign the code list to the evolving item. the code list is aligned with the evolving item's sub-tasks. if a task is not	STRING
HIGH	…gent/components/coder/data_science/feature/__init__.py	0	assign the code list to the evolving item. the code list is aligned with the evolving item's sub-tasks. if a task is not	STRING
HIGH	…ponents/coder/data_science/raw_data_loader/__init__.py	0	assign the code list to the evolving item. the code list is aligned with the evolving item's sub-tasks. if a task is not	STRING
HIGH	rdagent/scenarios/data_science/dev/runner/__init__.py	0	assign the code list to the evolving item. the code list is aligned with the evolving item's sub-tasks. if a task is not	STRING
HIGH	…ent/spaceship-titanic_template/fea_share_preprocess.py	0	this method loads the data, drops the unnecessary columns, and splits it into train and validation sets.	STRING
HIGH	…mplates/playground-series-s4e8/fea_share_preprocess.py	0	this method loads the data, drops the unnecessary columns, and splits it into train and validation sets.	STRING
HIGH	…/templates/meta_tpl_deprecated/fea_share_preprocess.py	0	this method loads the data, drops the unnecessary columns, and splits it into train and validation sets.	STRING
HIGH	…lar-playground-series-dec-2021/fea_share_preprocess.py	0	this method loads the data, drops the unnecessary columns, and splits it into train and validation sets.	STRING
HIGH	…york-city-taxi-fare-prediction/fea_share_preprocess.py	0	this method loads the data, drops the unnecessary columns, and splits it into train and validation sets.	STRING
HIGH	…plates/playground-series-s3e26/fea_share_preprocess.py	0	this method loads the data, drops the unnecessary columns, and splits it into train and validation sets.	STRING
HIGH	…/experiment/templates/sf-crime/fea_share_preprocess.py	0	this method loads the data, drops the unnecessary columns, and splits it into train and validation sets.	STRING
HIGH	…s/forest-cover-type-prediction/fea_share_preprocess.py	0	this method loads the data, drops the unnecessary columns, and splits it into train and validation sets.	STRING
HIGH	…nt/templates/spaceship-titanic/fea_share_preprocess.py	0	this method loads the data, drops the unnecessary columns, and splits it into train and validation sets.	STRING
HIGH	…ent/templates/digit-recognizer/fea_share_preprocess.py	0	this method loads the data, drops the unnecessary columns, and splits it into train and validation sets.	STRING
HIGH	…ent/spaceship-titanic_template/fea_share_preprocess.py	0	fits the preprocessor on the training data and returns the fitted preprocessor.	STRING
HIGH	…mplates/playground-series-s4e8/fea_share_preprocess.py	0	fits the preprocessor on the training data and returns the fitted preprocessor.	STRING
HIGH	…/templates/meta_tpl_deprecated/fea_share_preprocess.py	0	fits the preprocessor on the training data and returns the fitted preprocessor.	STRING
HIGH	…plates/playground-series-s3e26/fea_share_preprocess.py	0	fits the preprocessor on the training data and returns the fitted preprocessor.	STRING
HIGH	…/experiment/templates/sf-crime/fea_share_preprocess.py	0	fits the preprocessor on the training data and returns the fitted preprocessor.	STRING
HIGH	…nt/templates/spaceship-titanic/fea_share_preprocess.py	0	fits the preprocessor on the training data and returns the fitted preprocessor.	STRING
HIGH	…ent/spaceship-titanic_template/fea_share_preprocess.py	0	transforms the given dataframe using the fitted preprocessor. ensures the processed data has consistent features across	STRING
HIGH	…/templates/meta_tpl_deprecated/fea_share_preprocess.py	0	transforms the given dataframe using the fitted preprocessor. ensures the processed data has consistent features across	STRING
HIGH	…nt/templates/spaceship-titanic/fea_share_preprocess.py	0	transforms the given dataframe using the fitted preprocessor. ensures the processed data has consistent features across	STRING
HIGH	…ent/spaceship-titanic_template/fea_share_preprocess.py	0	this method applies the preprocessing steps to the training, validation, and test datasets.	STRING
HIGH	…mplates/playground-series-s4e8/fea_share_preprocess.py	0	this method applies the preprocessing steps to the training, validation, and test datasets.	STRING
HIGH	…plates/playground-series-s3e14/fea_share_preprocess.py	0	this method applies the preprocessing steps to the training, validation, and test datasets.	STRING
HIGH	…/templates/meta_tpl_deprecated/fea_share_preprocess.py	0	this method applies the preprocessing steps to the training, validation, and test datasets.	STRING
HIGH	…ventilator-pressure-prediction/fea_share_preprocess.py	0	this method applies the preprocessing steps to the training, validation, and test datasets.	STRING
HIGH	…lar-playground-series-dec-2021/fea_share_preprocess.py	0	this method applies the preprocessing steps to the training, validation, and test datasets.	STRING
HIGH	…york-city-taxi-fare-prediction/fea_share_preprocess.py	0	this method applies the preprocessing steps to the training, validation, and test datasets.	STRING
HIGH	…plates/playground-series-s3e11/fea_share_preprocess.py	0	this method applies the preprocessing steps to the training, validation, and test datasets.	STRING
HIGH	…plates/playground-series-s3e16/fea_share_preprocess.py	0	this method applies the preprocessing steps to the training, validation, and test datasets.	STRING
HIGH	…plates/playground-series-s3e26/fea_share_preprocess.py	0	this method applies the preprocessing steps to the training, validation, and test datasets.	STRING
HIGH	…rize-english-language-learning/fea_share_preprocess.py	0	this method applies the preprocessing steps to the training, validation, and test datasets.	STRING
HIGH	…/experiment/templates/sf-crime/fea_share_preprocess.py	0	this method applies the preprocessing steps to the training, validation, and test datasets.	STRING
HIGH	…s/forest-cover-type-prediction/fea_share_preprocess.py	0	this method applies the preprocessing steps to the training, validation, and test datasets.	STRING
HIGH	…lar-playground-series-may-2022/fea_share_preprocess.py	0	this method applies the preprocessing steps to the training, validation, and test datasets.	STRING
HIGH	…nt/templates/spaceship-titanic/fea_share_preprocess.py	0	this method applies the preprocessing steps to the training, validation, and test datasets.	STRING
HIGH	…l-iceberg-classifier-challenge/fea_share_preprocess.py	0	this method applies the preprocessing steps to the training, validation, and test datasets.	STRING
HIGH	…ent/templates/digit-recognizer/fea_share_preprocess.py	0	this method applies the preprocessing steps to the training, validation, and test datasets.	STRING
HIGH	…paceship-titanic_template/model/select_randomforest.py	0	select relevant features. to be used in fit & predict function.	STRING
HIGH	…periment/spaceship-titanic_template/model/select_nn.py	0	select relevant features. to be used in fit & predict function.	STRING
HIGH	…nt/spaceship-titanic_template/model/select_lightgbm.py	0	select relevant features. to be used in fit & predict function.	STRING
HIGH	…ent/spaceship-titanic_template/model/select_xgboost.py	0	select relevant features. to be used in fit & predict function.	STRING
HIGH	…es/playground-series-s4e9/model/select_randomforest.py	0	select relevant features. to be used in fit & predict function.	STRING
HIGH	…nt/templates/playground-series-s4e9/model/select_nn.py	0	select relevant features. to be used in fit & predict function.	STRING
HIGH	…plates/playground-series-s4e9/model/select_lightgbm.py	0	select relevant features. to be used in fit & predict function.	STRING
HIGH	…mplates/playground-series-s4e9/model/select_xgboost.py	0	select relevant features. to be used in fit & predict function.	STRING
HIGH	…es/playground-series-s4e8/model/select_randomforest.py	0	select relevant features. to be used in fit & predict function.	STRING
HIGH	…nt/templates/playground-series-s4e8/model/select_nn.py	0	select relevant features. to be used in fit & predict function.	STRING
HIGH	…plates/playground-series-s4e8/model/select_lightgbm.py	0	select relevant features. to be used in fit & predict function.	STRING
152 more matches not shown…

Unused Imports310 hits · 306 pts

Severity	File	Line	Context
LOW	test/utils/test_conf.py	4	CODE
LOW	test/utils/test_env.py	1	CODE
LOW	test/utils/test_import.py	2	CODE
LOW	test/utils/env_tpl/read_exp.py	2	CODE
LOW	test/utils/env_tpl/read_exp.py	3	CODE
LOW	test/finetune/test_benchmark_tablebench.py	6	CODE
LOW	test/finetune/test_benchmark.py	10	CODE
LOW	test/finetune/test_benchmark_api.py	10	CODE
LOW	test/oai/test_advanced.py	7	CODE
LOW	test/oai/test_base.py	1	CODE
LOW	test/oai/test_completion.py	3	CODE
LOW	test/notebook/testfiles/main2.py	5	CODE
LOW	test/notebook/testfiles/main2.py	14	CODE
LOW	test/notebook/testfiles/main2.py	16	CODE
LOW	rdagent/core/evolving_agent.py	1	CODE
LOW	rdagent/core/conf.py	1	CODE
LOW	rdagent/core/evolving_framework.py	1	CODE
LOW	rdagent/core/proposal.py	3	CODE
LOW	rdagent/core/experiment.py	1	CODE
LOW	rdagent/core/utils.py	1	CODE
LOW	rdagent/core/developer.py	1	CODE
LOW	rdagent/core/interactor.py	1	CODE
LOW	rdagent/app/cli.py	9	CODE
LOW	rdagent/app/benchmark/factor/analysis.py	9	CODE
LOW	rdagent/app/benchmark/model/eval.py	14	CODE
LOW	rdagent/app/CI/run.py	1	CODE
LOW	rdagent/app/utils/ws.py	1	CODE
LOW	rdagent/app/utils/health_check.py	6	CODE
LOW	rdagent/app/utils/info.py	4	CODE
LOW	rdagent/app/utils/info.py	9	CODE
LOW	rdagent/app/utils/ws_ft.py	1	CODE
LOW	rdagent/app/utils/ws_ft.py	5	CODE
LOW	rdagent/app/utils/ws_ft.py	7	CODE
LOW	rdagent/app/finetune/data_science/scen.py	4	CODE
LOW	rdagent/app/finetune/data_science/scen.py	5	CODE
LOW	rdagent/app/finetune/data_science/scen.py	8	CODE
LOW	rdagent/app/finetune/data_science/loop.py	8	CODE
LOW	rdagent/app/finetune/data_science/loop.py	9	CODE
LOW	rdagent/app/qlib_rd_loop/factor_from_report.py	4	CODE
LOW	rdagent/app/qlib_rd_loop/factor_from_report.py	15	CODE
LOW	rdagent/app/rl/ui/data_loader.py	17	CODE
LOW	rdagent/utils/env.py	14	CODE
LOW	rdagent/utils/env.py	45	CODE
LOW	rdagent/utils/__init__.py	21	CODE
LOW	rdagent/utils/agent/apply_patch.py	9	CODE
LOW	rdagent/utils/agent/__init__.py	1	CODE
LOW	rdagent/utils/agent/workflow.py	2	CODE
LOW	rdagent/utils/agent/workflow.py	2	CODE
LOW	rdagent/utils/workflow/__init__.py	1	CODE
LOW	rdagent/utils/workflow/__init__.py	1	CODE
LOW	rdagent/utils/workflow/__init__.py	2	CODE
LOW	rdagent/utils/workflow/__init__.py	3	CODE
LOW	rdagent/utils/workflow/loop.py	21	CODE
LOW	rdagent/utils/workflow/tracking.py	18	CODE
LOW	rdagent/utils/repo/repo_utils.py	2	CODE
LOW	rdagent/oai/llm_utils.py	1	CODE
LOW	rdagent/oai/llm_utils.py	10	CODE
LOW	rdagent/oai/llm_conf.py	1	CODE
LOW	rdagent/oai/backend/deprec.py	2	CODE
LOW	rdagent/oai/backend/deprec.py	4	CODE
250 more matches not shown…

Excessive Try-Catch Wrapping193 hits · 242 pts

Severity	File	Line	Snippet	Context
LOW	test/utils/test_import.py	56	except Exception as e:	CODE
MEDIUM	test/finetune/test_benchmark_tablebench.py	122	print(f"Error: {result.stdout[-2000:] if result.stdout else 'No output'}")	CODE
MEDIUM	test/finetune/test_benchmark.py	125	print(f"Error: {result.stdout[-2000:] if result.stdout else 'No output'}")	CODE
MEDIUM	test/finetune/test_benchmark_api.py	296	print(f"Error: {result.stdout[-2000:] if result.stdout else 'No output'}")	CODE
LOW	test/oai/test_llm_connectivity.py	42	except Exception as e:	CODE
LOW	test/notebook/test_util.py	1475	except Exception as e:	CODE
LOW	test/notebook/test_util.py	1483	except Exception as e:	CODE
MEDIUM⚡	test/notebook/testfiles/main2.py	56	print(f"Error loading train.csv: {e}")	CODE
MEDIUM⚡	test/notebook/testfiles/main2.py	62	print(f"Error listing train dir: {e}")	CODE
MEDIUM⚡	test/notebook/testfiles/main2.py	68	print(f"Error listing test dir: {e}")	CODE
MEDIUM⚡	test/notebook/testfiles/main2.py	79	print(f"Error reading sample_submission.csv: {e}")	CODE
MEDIUM⚡	test/notebook/testfiles/main2.py	127	print(f"Error reading {filepath}: {e}")	CODE
MEDIUM	test/notebook/testfiles/main2.py	372	print("Error computing log_loss on val:", ex)	CODE
MEDIUM	test/notebook/testfiles/main2.py	420	print("Error computing log_loss on validation:", ex)	CODE
MEDIUM	test/notebook/testfiles/main2.py	117	def load_img_as_numpy_with_mask(filepath):	CODE
LOW⚡	test/notebook/testfiles/main2.py	55	except Exception as e:	CODE
LOW⚡	test/notebook/testfiles/main2.py	61	except Exception as e:	CODE
LOW⚡	test/notebook/testfiles/main2.py	67	except Exception as e:	CODE
LOW⚡	test/notebook/testfiles/main2.py	78	except Exception as e:	CODE
LOW⚡	test/notebook/testfiles/main2.py	126	except Exception as e:	CODE
LOW	test/notebook/testfiles/main2.py	198	except Exception as e:	CODE
LOW	test/notebook/testfiles/main2.py	370	except Exception as ex:	CODE
LOW	test/notebook/testfiles/main2.py	418	except Exception as ex:	CODE
LOW	test/notebook/testfiles/main2.py	446	except Exception:	CODE
LOW	test/notebook/testfiles/main_missing_sections.py	303	except Exception as e:	CODE
LOW	test/notebook/testfiles/main_missing_sections.py	311	except Exception as e:	CODE
MEDIUM	test/notebook/testfiles/main_missing_sections.py	300	def main():	CODE
LOW	test/notebook/testfiles/main_missing_main_fn.py	303	except Exception as e:	CODE
LOW	test/notebook/testfiles/main_missing_main_fn.py	311	except Exception as e:	CODE
LOW	test/notebook/testfiles/main.py	305	except Exception as e:	CODE
LOW	test/notebook/testfiles/main.py	313	except Exception as e:	CODE
LOW	rdagent/app/kaggle/loop.py	86	except Exception as e:	CODE
LOW	rdagent/app/kaggle/loop.py	106	except Exception as e:	CODE
LOW	rdagent/app/utils/health_check.py	73	except Exception as e:	CODE
LOW	rdagent/app/utils/health_check.py	89	except Exception as e:	CODE
LOW	rdagent/app/finetune/llm/ui/ft_summary.py	55	except Exception:	CODE
LOW	rdagent/app/finetune/llm/ui/ft_summary.py	96	except Exception:	CODE
LOW	rdagent/app/finetune/llm/ui/ft_summary.py	139	except Exception:	CODE
LOW	rdagent/app/finetune/llm/ui/ft_summary.py	181	except Exception:	CODE
LOW	rdagent/app/finetune/llm/ui/ft_summary.py	452	except Exception:	CODE
LOW	rdagent/app/finetune/llm/ui/ft_summary.py	485	except Exception:	CODE
LOW	rdagent/app/finetune/llm/ui/components.py	225	except Exception:	CODE
LOW	rdagent/app/rl/ui/rl_summary.py	45	except Exception:	CODE
MEDIUM	rdagent/utils/env.py	948	def prepare(self) -> None:	CODE
MEDIUM	rdagent/utils/env.py	994	def prepare(self) -> None:	CODE
LOW	rdagent/utils/env.py	110	except Exception as cleanup_error:	CODE
LOW	rdagent/utils/env.py	346	except Exception as e:	CODE
LOW	rdagent/utils/env.py	863	except Exception as e:	CODE
LOW	rdagent/utils/env.py	887	except Exception as exc: # pragma: no cover - best-effort helper	CODE
LOW	rdagent/utils/env.py	971	except Exception as e:	CODE
LOW	rdagent/utils/env.py	1005	except Exception as e:	CODE
LOW	rdagent/utils/env.py	1405	except Exception:	CODE
LOW	rdagent/utils/__init__.py	83	except Exception as e:	CODE
LOW	rdagent/utils/__init__.py	172	except Exception as e:	CODE
LOW	rdagent/utils/__init__.py	181	except Exception as e:	CODE
LOW	rdagent/utils/agent/workflow.py	53	except Exception as e:	CODE
LOW	rdagent/utils/workflow/misc.py	40	except Exception as e:	CODE
MEDIUM	rdagent/utils/workflow/misc.py	41	print(f"Error: {e}")	CODE
LOW	rdagent/utils/workflow/loop.py	247	except Exception as e:	CODE
LOW	rdagent/utils/workflow/loop.py	556	except Exception as ex:	CODE
133 more matches not shown…

Hyper-Verbose Identifiers199 hits · 202 pts

Severity	File	Line	Snippet	Context
LOW	test/qlib/test_model_factor_proposal.py	42	def test_model_proposal_import():	CODE
LOW	test/qlib/test_model_factor_proposal.py	46	def test_factor_proposal_import():	CODE
LOW	test/utils/test_env.py	146	def test_cleanup_container_import(self):	CODE
LOW	test/utils/test_import.py	17	def import_all_modules_from_directory(directory):	CODE
LOW	test/utils/test_ws.py	27	def test_checkpoint_roundtrip(self) -> None:	CODE
LOW	test/utils/test_kaggle.py	12	def test_competition_template(self):	CODE
LOW⚡	test/oai/test_embedding_and_similarity.py	22	def test_embedding_similarity(self) -> None:	CODE
LOW⚡	test/oai/test_embedding_and_similarity.py	29	def test_embedding_long_text_truncation(self) -> None:	CODE
LOW	test/oai/test_advanced.py	24	def test_chat_cache_multiprocess(self) -> None:	CODE
LOW	test/oai/test_completion.py	29	def test_chat_completion_json_mode(self) -> None:	CODE
LOW	test/oai/test_completion.py	41	def test_build_messages_and_calculate_token(self) -> None:	CODE
LOW	test/oai/test_completion.py	48	def test_json_mode_with_specific_target_type(self) -> None:	CODE
LOW	test/oai/test_completion.py	68	def test_response_format_with_basemodel(self) -> None:	CODE
LOW	test/notebook/test_util.py	143	def test_happy_path_no_header(self):	CODE
LOW	test/notebook/test_util.py	268	def test_ignores_indented_calls(self):	CODE
LOW	test/notebook/test_util.py	356	def test_happy_path_no_header(self):	CODE
LOW	test/notebook/test_util.py	477	def test_ignore_unknown_section(self):	CODE
LOW	test/notebook/test_util.py	548	def test_happy_path_multiline(self):	CODE
LOW	test/notebook/test_util.py	600	def test_arbitrary_print_happy_path(self):	CODE
LOW	test/notebook/test_util.py	695	def test_happy_path_with_args(self):	CODE
LOW	test/notebook/test_util.py	703	def test_happy_path_with_args_multiline(self):	CODE
LOW	test/notebook/test_util.py	740	def test_function_does_not_exist(self):	CODE
LOW	test/notebook/test_util.py	807	def test_happy_path_arbitrary_content(self):	CODE
LOW	test/notebook/test_util.py	820	def test_block_does_not_exist(self):	CODE
LOW	test/notebook/test_notebook_converter.py	10	def normalize_nb_json_for_comparison(nb_json_str):	CODE
LOW	test/notebook/test_notebook_converter.py	30	def test_validation_missing_main_fn(self):	CODE
LOW	test/notebook/test_notebook_converter.py	39	def test_validation_missing_sections(self):	CODE
LOW	test/notebook/test_notebook_converter.py	96	def test_argparse_with_dupe_sys(self):	CODE
LOW⚡	test/notebook/testfiles/main2.py	117	def load_img_as_numpy_with_mask(filepath):	CODE
LOW	rdagent/core/conf.py	16	def settings_customise_sources(	CODE
LOW	rdagent/core/evolving_framework.py	148	def load_or_init_knowledge_base(	CODE
LOW	rdagent/core/evolving_framework.py	183	def load_dumped_knowledge_base(self, args: Any, *kwargs: Any) -> None:	CODE
LOW	rdagent/core/proposal.py	178	def get_sota_hypothesis_and_experiment(self) -> tuple[Hypothesis \| None, Experiment \| None]:	CODE
LOW	rdagent/core/experiment.py	209	def link_all_files_in_folder_to_workspace(data_path: Path, workspace_path: Path) -> None:	CODE
LOW	rdagent/core/experiment.py	275	def inject_code_from_file_dict(self, workspace: FBWorkspace) -> None:	CODE
LOW	rdagent/app/utils/health_check.py	37	def check_and_list_free_ports(start_port=19899, max_ports=10) -> None:	CODE
LOW	rdagent/app/finetune/llm/ui/ft_summary.py	457	def extract_baseline_full_benchmark(task_path: Path, split: str = "validation") -> dict \| None:	CODE
LOW	rdagent/app/finetune/llm/ui/ft_summary.py	490	def get_task_full_benchmark_df(task_path: Path, split: str) -> pd.DataFrame:	CODE
LOW	rdagent/app/finetune/llm/ui/ft_summary.py	554	def render_task_detail_selector(job_path: Path) -> None:	CODE
LOW	rdagent/app/finetune/llm/ui/components.py	17	def convert_latex_for_streamlit(text: str) -> str:	CODE
LOW	rdagent/app/finetune/data_science/scen.py	14	def _get_data_folder_description(self) -> str:	CODE
LOW	rdagent/app/general_model/general_model.py	14	def extract_models_and_implement(report_file_path: str) -> None:	CODE
LOW	rdagent/app/qlib_rd_loop/factor_from_report.py	61	def extract_hypothesis_and_exp_from_reports(report_file_path: str) -> QlibFactorExperiment \| None:	CODE
LOW	rdagent/utils/env.py	69	def extract_dir_name_from_path_config(path_str: str) -> str:	CODE
LOW	rdagent/utils/env.py	172	def get_workspace_content_for_hash(self, local_path: str \| Path) -> list[list[str]]:	CODE
LOW	rdagent/utils/env.py	275	def unzip_a_file_into_a_folder(	CODE
LOW	rdagent/utils/env.py	537	def dump_python_code_run_and_get_results(	CODE
LOW	rdagent/utils/env.py	811	def populate_exclude_chmod_paths(self) -> "DockerConf":	CODE
LOW	rdagent/utils/env.py	877	def _sync_conda_cache_with_real_envs() -> None:	CODE
LOW	rdagent/utils/env.py	1117	def get_workspace_content_for_hash(self, local_path: str \| Path) -> list[list[str]]:	CODE
LOW	rdagent/utils/__init__.py	28	def get_module_by_module_path(module_path: Union[str, ModuleType]) -> ModuleType:	CODE
LOW	rdagent/utils/__init__.py	193	def remove_path_info_from_str(base_path: Path, target_string: str) -> str:	CODE
LOW	rdagent/utils/agent/workflow.py	10	def build_cls_from_json_with_retry(	CODE
LOW	rdagent/utils/workflow/loop.py	172	def _check_exit_conditions_on_step(self, loop_id: Optional[int] = None, step_id: Optional[int] = None) -> None:	CODE
LOW	rdagent/oai/llm_utils.py	13	def calculate_embedding_distance_between_str_list(	CODE
LOW	rdagent/oai/backend/deprec.py	273	def _create_embedding_inner_function(self, input_content_list: list[str]) -> list[list[float]]:	CODE
LOW	rdagent/oai/backend/deprec.py	294	def _create_chat_completion_inner_function( # type: ignore[no-untyped-def] # noqa: C901, PLR0912, PLR0915	CODE
LOW	rdagent/oai/backend/deprec.py	467	def _calculate_token_from_messages(self, messages: list[dict[str, Any]]) -> int:	CODE
LOW	rdagent/oai/backend/litellm.py	60	def _calculate_token_from_messages(self, messages: list[dict[str, Any]]) -> int:	CODE
LOW	rdagent/oai/backend/litellm.py	71	def _create_embedding_inner_function(self, input_content_list: list[str]) -> list[list[float]]:	CODE
139 more matches not shown…

Decorative Section Separators66 hits · 197 pts

Severity	File	Line	Snippet	Context
MEDIUM	rdagent/utils/__init__.py	108	\d+/\d+\s+[━]+\s+\d+s?\s+\d+ms/step.*?\u0008+ \| # e.g. "10/100 ━━━━━━ 3s 50ms/step"	CODE
MEDIUM	rdagent/utils/__init__.py	109	\d+/\d+\s+[━]+\s+\d+s?\s+\d+ms/step \| # e.g. "10/100 ━━━━━━ 3s 50ms/step" (no backspaces)	CODE
MEDIUM	…arios/data_science/proposal/exp_gen/trace_scheduler.py	134	# ======================================================================================	COMMENT
MEDIUM	…arios/data_science/proposal/exp_gen/trace_scheduler.py	136	# ======================================================================================	COMMENT
MEDIUM	…enarios/data_science/proposal/exp_gen/select/submit.py	38	# ==============================================================================	COMMENT
MEDIUM	…enarios/data_science/proposal/exp_gen/select/submit.py	40	# ==============================================================================	COMMENT
MEDIUM	…enarios/data_science/proposal/exp_gen/select/submit.py	516	# ==============================================================================	COMMENT
MEDIUM	…enarios/data_science/proposal/exp_gen/select/submit.py	518	# ==============================================================================	COMMENT
MEDIUM	…enarios/data_science/proposal/exp_gen/select/submit.py	634	# ==============================================================================	COMMENT
MEDIUM	…enarios/data_science/proposal/exp_gen/select/submit.py	636	# ==============================================================================	COMMENT
MEDIUM	rdagent/scenarios/finetune/benchmark/data/default.py	17	# ============================================================================	COMMENT
MEDIUM	rdagent/scenarios/finetune/benchmark/data/default.py	19	# ============================================================================	COMMENT
MEDIUM	rdagent/scenarios/finetune/benchmark/data/default.py	203	# ============================================================================	COMMENT
MEDIUM	rdagent/scenarios/finetune/benchmark/data/default.py	205	# ============================================================================	COMMENT
MEDIUM	rdagent/scenarios/finetune/proposal/prompts.yaml	1	# =============================================================================	COMMENT
MEDIUM	rdagent/scenarios/finetune/proposal/prompts.yaml	3	# =============================================================================	COMMENT
MEDIUM	rdagent/scenarios/finetune/proposal/prompts.yaml	18	# ═══════════════════════════════════════════════════════════════════════════	COMMENT
MEDIUM	rdagent/scenarios/finetune/proposal/prompts.yaml	20	# ═══════════════════════════════════════════════════════════════════════════	COMMENT
MEDIUM	rdagent/scenarios/finetune/proposal/prompts.yaml	163	# ═══════════════════════════════════════════════════════════════════════════	COMMENT
MEDIUM	rdagent/scenarios/finetune/proposal/prompts.yaml	165	# ═══════════════════════════════════════════════════════════════════════════	COMMENT
MEDIUM	rdagent/scenarios/finetune/proposal/prompts.yaml	223	# ═══════════════════════════════════════════════════════════════════════════	COMMENT
MEDIUM	rdagent/scenarios/finetune/proposal/prompts.yaml	225	# ═══════════════════════════════════════════════════════════════════════════	COMMENT
MEDIUM	rdagent/scenarios/rl/autorl_bench/core/server.py	356	# ============================================================	COMMENT
MEDIUM	rdagent/scenarios/rl/autorl_bench/core/server.py	358	# ============================================================	COMMENT
MEDIUM	rdagent/scenarios/rl/autorl_bench/core/utils.py	83	# ============================================================	COMMENT
MEDIUM	rdagent/scenarios/rl/autorl_bench/core/utils.py	85	# ============================================================	COMMENT
MEDIUM	rdagent/scenarios/rl/autorl_bench/core/utils.py	98	# ============================================================	COMMENT
MEDIUM	rdagent/scenarios/rl/autorl_bench/core/utils.py	100	# ============================================================	COMMENT
MEDIUM	rdagent/scenarios/rl/autorl_bench/core/utils.py	173	# ============================================================	COMMENT
MEDIUM	rdagent/scenarios/rl/autorl_bench/core/utils.py	175	# ============================================================	COMMENT
MEDIUM	rdagent/scenarios/rl/autorl_bench/core/utils.py	238	# ============================================================	COMMENT
MEDIUM	rdagent/scenarios/rl/autorl_bench/core/utils.py	240	# ============================================================	COMMENT
MEDIUM	rdagent/scenarios/rl/autorl_bench/core/utils.py	272	# ============================================================	COMMENT
MEDIUM	rdagent/scenarios/rl/autorl_bench/core/utils.py	274	# ============================================================	COMMENT
MEDIUM	rdagent/scenarios/rl/autorl_bench/core/utils.py	346	# ============================================================	COMMENT
MEDIUM	rdagent/scenarios/rl/autorl_bench/core/utils.py	348	# ============================================================	COMMENT
MEDIUM	rdagent/scenarios/rl/autorl_bench/core/utils.py	386	# ============================================================	COMMENT
MEDIUM	rdagent/scenarios/rl/autorl_bench/core/utils.py	388	# ============================================================	COMMENT
MEDIUM	rdagent/scenarios/rl/autorl_bench/core/evaluator.py	16	# ============================================================	COMMENT
MEDIUM	rdagent/scenarios/rl/autorl_bench/core/evaluator.py	18	# ============================================================	COMMENT
MEDIUM	rdagent/scenarios/rl/autorl_bench/core/evaluator.py	49	# ============================================================	COMMENT
MEDIUM	rdagent/scenarios/rl/autorl_bench/core/evaluator.py	51	# ============================================================	COMMENT
MEDIUM	rdagent/scenarios/rl/autorl_bench/test/test_fixes.py	35	# ============================================================	COMMENT
MEDIUM	rdagent/scenarios/rl/autorl_bench/test/test_fixes.py	37	# ============================================================	COMMENT
MEDIUM	rdagent/scenarios/rl/autorl_bench/test/test_fixes.py	228	# ============================================================	COMMENT
MEDIUM	rdagent/scenarios/rl/autorl_bench/test/test_fixes.py	230	# ============================================================	COMMENT
MEDIUM	rdagent/scenarios/rl/autorl_bench/test/test_fixes.py	313	# ============================================================	COMMENT
MEDIUM	rdagent/scenarios/rl/autorl_bench/test/test_fixes.py	315	# ============================================================	COMMENT
MEDIUM⚡	…/scenarios/rl/autorl_bench/benchmarks/alfworld/eval.py	67	# ============================================================	COMMENT
MEDIUM⚡	…/scenarios/rl/autorl_bench/benchmarks/alfworld/eval.py	69	# ============================================================	COMMENT
MEDIUM	…/scenarios/rl/autorl_bench/benchmarks/alfworld/eval.py	121	# ============================================================	COMMENT
MEDIUM	…/scenarios/rl/autorl_bench/benchmarks/alfworld/eval.py	123	# ============================================================	COMMENT
MEDIUM	…/scenarios/rl/autorl_bench/benchmarks/alfworld/eval.py	212	# ============================================================	COMMENT
MEDIUM	…/scenarios/rl/autorl_bench/benchmarks/alfworld/eval.py	214	# ============================================================	COMMENT
MEDIUM	…t/scenarios/rl/autorl_bench/benchmarks/webshop/eval.py	56	# ============================================================	COMMENT
MEDIUM	…t/scenarios/rl/autorl_bench/benchmarks/webshop/eval.py	58	# ============================================================	COMMENT
MEDIUM⚡	…t/scenarios/rl/autorl_bench/benchmarks/webshop/eval.py	148	# ============================================================	COMMENT
MEDIUM⚡	…t/scenarios/rl/autorl_bench/benchmarks/webshop/eval.py	150	# ============================================================	COMMENT
MEDIUM	…t/scenarios/rl/autorl_bench/benchmarks/webshop/eval.py	261	# ============================================================	STRING
MEDIUM	…t/scenarios/rl/autorl_bench/benchmarks/webshop/eval.py	263	# ============================================================	STRING
6 more matches not shown…

Deep Nesting178 hits · 168 pts

Severity	File	Line	Context
LOW	test/notebook/testfiles/main2.py	37	CODE
LOW	rdagent/core/evolving_agent.py	140	CODE
LOW	rdagent/core/experiment.py	324	CODE
LOW	rdagent/core/experiment.py	350	CODE
LOW	rdagent/app/CI/run.py	182	CODE
LOW	rdagent/app/CI/run.py	432	CODE
LOW	rdagent/app/CI/run.py	185	CODE
LOW	rdagent/app/finetune/llm/ui/data_loader.py	98	CODE
LOW	rdagent/app/finetune/llm/ui/data_loader.py	365	CODE
LOW	rdagent/app/finetune/llm/ui/data_loader.py	423	CODE
LOW	rdagent/app/finetune/llm/ui/ft_summary.py	27	CODE
LOW	rdagent/app/finetune/llm/ui/ft_summary.py	73	CODE
LOW	rdagent/app/finetune/llm/ui/ft_summary.py	101	CODE
LOW	rdagent/app/finetune/llm/ui/ft_summary.py	224	CODE
LOW	rdagent/app/finetune/llm/ui/ft_summary.py	429	CODE
LOW	rdagent/app/finetune/llm/ui/ft_summary.py	457	CODE
LOW	rdagent/app/finetune/llm/ui/ft_summary.py	490	CODE
LOW	rdagent/app/finetune/llm/ui/app.py	28	CODE
LOW	rdagent/app/finetune/llm/ui/app.py	66	CODE
LOW	rdagent/app/finetune/llm/ui/components.py	42	CODE
LOW	rdagent/app/finetune/llm/ui/components.py	58	CODE
LOW	rdagent/app/finetune/llm/ui/components.py	208	CODE
LOW	rdagent/app/finetune/llm/ui/components.py	313	CODE
LOW	rdagent/app/finetune/llm/ui/components.py	461	CODE
LOW	rdagent/app/finetune/llm/ui/components.py	602	CODE
LOW	rdagent/app/finetune/llm/ui/components.py	665	CODE
LOW	rdagent/app/finetune/llm/ui/benchmarks/bioprobench.py	21	CODE
LOW	rdagent/app/finetune/llm/ui/benchmarks/__init__.py	22	CODE
LOW	rdagent/app/finetune/llm/ui/benchmarks/chemcotbench.py	47	CODE
LOW	rdagent/app/qlib_rd_loop/factor_from_report.py	112	CODE
LOW	rdagent/app/rl/ui/data_loader.py	237	CODE
LOW	rdagent/app/rl/ui/data_loader.py	285	CODE
LOW	rdagent/app/rl/ui/rl_summary.py	74	CODE
LOW	rdagent/app/rl/ui/app.py	46	CODE
LOW	rdagent/app/rl/ui/app.py	75	CODE
LOW	rdagent/app/rl/ui/components.py	24	CODE
LOW	rdagent/app/rl/ui/components.py	38	CODE
LOW	rdagent/app/rl/ui/components.py	211	CODE
LOW	rdagent/utils/env.py	139	CODE
LOW	rdagent/utils/env.py	275	CODE
LOW	rdagent/utils/env.py	593	CODE
LOW	rdagent/utils/env.py	1162	CODE
LOW	rdagent/utils/env.py	1308	CODE
LOW	rdagent/utils/env.py	617	CODE
LOW	rdagent/utils/__init__.py	100	CODE
LOW	rdagent/utils/agent/apply_patch.py	275	CODE
LOW	rdagent/utils/agent/apply_patch.py	385	CODE
LOW	rdagent/utils/agent/apply_patch.py	457	CODE
LOW	rdagent/utils/agent/apply_patch.py	173	CODE
LOW	rdagent/utils/agent/tpl.py	33	CODE
LOW	rdagent/utils/workflow/loop.py	194	CODE
LOW	rdagent/utils/workflow/loop.py	313	CODE
LOW	rdagent/oai/backend/deprec.py	109	CODE
LOW	rdagent/oai/backend/deprec.py	294	CODE
LOW	rdagent/oai/backend/litellm.py	95	CODE
LOW	rdagent/oai/backend/litellm.py	127	CODE
LOW	rdagent/oai/backend/base.py	520	CODE
LOW	rdagent/components/benchmark/eval_method.py	200	CODE
LOW	rdagent/components/knowledge_management/graph.py	197	CODE
LOW	rdagent/components/workflow/rd_loop.py	75	CODE
118 more matches not shown…

Verbosity Indicators41 hits · 85 pts

Severity	File	Line	Snippet	Context
LOW⚡	rdagent/utils/env.py	955	# Step 1: Install base dependencies (torch, llamafactory, etc.)	COMMENT
LOW⚡	rdagent/utils/env.py	959	# Step 2: Install flash-attn (requires torch first, uses --no-build-isolation)	COMMENT
LOW⚡	rdagent/utils/agent/ret.py	97	# Step 1: extract patch by pattern	COMMENT
LOW⚡	rdagent/utils/agent/ret.py	103	# Step 2: apply the patch, this will modify the file in place	COMMENT
LOW	…nt/components/coder/data_science/pipeline/prompts.yaml	302	### Step 6: Similar Successful Implementations to help Code Improvement	COMMENT
LOW⚡	rdagent/components/coder/finetune/unified_validator.py	70	# Step 1: Parameter filtering	COMMENT
LOW⚡	rdagent/components/coder/finetune/unified_validator.py	73	# Step 2: Inject required parameters for multi-task environments	COMMENT
LOW⚡	rdagent/components/coder/finetune/unified_validator.py	76	# Step 3: Micro-batch testing (validates everything at runtime)	COMMENT
LOW⚡	rdagent/components/coder/finetune/prompts.yaml	87	# Step 1: Run complete sampling/filtering (fast, no LLM) - runs in BOTH modes	COMMENT
LOW⚡	rdagent/components/coder/finetune/prompts.yaml	90	# Step 2: Limit LLM processing in debug mode only	COMMENT
LOW⚡	rdagent/components/coder/finetune/prompts.yaml	96	# Step 3: Show the actual number of sampled items (Do not estimate; count the exact number of samples that will be p	COMMENT
LOW	rdagent/components/coder/finetune/eval.py	68	# Step 1: Check script exists	COMMENT
LOW	rdagent/components/coder/finetune/eval.py	82	# Step 3: Execute script in DEBUG mode (generates ~10 samples for fast validation)	COMMENT
LOW⚡	rdagent/components/coder/finetune/eval.py	101	# Step 4: Validate output	COMMENT
LOW⚡	rdagent/components/coder/finetune/eval.py	111	# Step 5: Load data if valid	COMMENT
LOW⚡	rdagent/components/coder/finetune/eval.py	120	# Step 6: Generate LLM feedback	COMMENT
LOW	…cenarios/data_science/proposal/exp_gen/prompts_v2.yaml	803	### Step 2: Workflow Update :	COMMENT
LOW⚡	…nt/scenarios/data_science/proposal/exp_gen/proposal.py	401	# Step 2: Generate the rest of the hypothesis & task	COMMENT
LOW⚡	…nt/scenarios/data_science/proposal/exp_gen/proposal.py	1458	# Step 3: Select the best hypothesis	COMMENT
LOW	…nt/scenarios/data_science/proposal/exp_gen/proposal.py	347	# Step 1: Generate component	COMMENT
LOW	…nt/scenarios/data_science/proposal/exp_gen/proposal.py	1367	# Step 1: Identify problems	COMMENT
LOW	…nt/scenarios/data_science/proposal/exp_gen/proposal.py	1391	# Step 2: Propose hypothesis based on the identified problems (and sampled ideas)	COMMENT
LOW⚡	…arios/data_science/proposal/exp_gen/trace_scheduler.py	205	# Step 1: If we have fewer traces than our target, start a new one.	COMMENT
LOW⚡	…arios/data_science/proposal/exp_gen/trace_scheduler.py	210	# Step 2: Probabilistically select a leaf to expand.	COMMENT
LOW⚡	…arios/data_science/proposal/exp_gen/trace_scheduler.py	359	# Step 1: keep same policy to reach target number of parallel traces	COMMENT
LOW⚡	…arios/data_science/proposal/exp_gen/trace_scheduler.py	364	# Step 2: consider only available leaves (not being expanded)	COMMENT
LOW⚡	…arios/data_science/proposal/exp_gen/trace_scheduler.py	369	# Step 3: compute priors (P) from potentials via softmax	COMMENT
LOW⚡	…arios/data_science/proposal/exp_gen/trace_scheduler.py	377	# Step 4: score each leaf using PUCT-like rule: Q + U	COMMENT
LOW	…arios/data_science/proposal/exp_gen/trace_scheduler.py	391	# # Step 5: optimistic visit update on selection; value update deferred to observe_feedback	COMMENT
LOW	…data_science/proposal/exp_gen/draft/prompts_draft.yaml	220	# Step 2: Workflow Update	COMMENT
LOW	…scenarios/data_science/proposal/exp_gen/draft/draft.py	246	# Step 0: Prepare	COMMENT
LOW⚡	…scenarios/data_science/proposal/exp_gen/draft/draft.py	271	# Step 1: Retrieve Knowledge	COMMENT
LOW⚡	…scenarios/data_science/proposal/exp_gen/draft/draft.py	274	# Step 2: Generate Hypothesis based on General Knowledge	COMMENT
LOW⚡	…scenarios/data_science/proposal/exp_gen/draft/draft.py	282	# Step 3: Design Task	COMMENT
LOW⚡	rdagent/scenarios/finetune/dev/prompts.yaml	12	## Step 0: Pre-definition	COMMENT
LOW⚡	rdagent/scenarios/finetune/dev/prompts.yaml	17	## Step 1: Benchmark Metrics Evaluation (HIGHEST PRIORITY)	COMMENT
LOW⚡	rdagent/scenarios/finetune/dev/prompts.yaml	30	## Step 2: Code Quality Assessment	COMMENT
LOW⚡	rdagent/scenarios/finetune/dev/prompts.yaml	34	## Step 3: Final Decision (Acceptance as SOTA)	COMMENT
LOW⚡	rdagent/scenarios/finetune/dev/prompts.yaml	190	## Step 1: Error Classification	COMMENT
LOW⚡	rdagent/scenarios/finetune/dev/prompts.yaml	197	## Step 2: Root Cause Analysis	COMMENT
LOW⚡	rdagent/scenarios/finetune/dev/prompts.yaml	202	## Step 3: Actionable Suggestions	COMMENT

Structural Annotation Overuse40 hits · 83 pts

Severity	File	Line	Snippet	Context
LOW⚡	rdagent/utils/env.py	955	# Step 1: Install base dependencies (torch, llamafactory, etc.)	COMMENT
LOW⚡	rdagent/utils/env.py	959	# Step 2: Install flash-attn (requires torch first, uses --no-build-isolation)	COMMENT
LOW⚡	rdagent/utils/agent/ret.py	97	# Step 1: extract patch by pattern	COMMENT
LOW⚡	rdagent/utils/agent/ret.py	103	# Step 2: apply the patch, this will modify the file in place	COMMENT
LOW	…nt/components/coder/data_science/pipeline/prompts.yaml	302	### Step 6: Similar Successful Implementations to help Code Improvement	COMMENT
LOW⚡	rdagent/components/coder/finetune/unified_validator.py	70	# Step 1: Parameter filtering	COMMENT
LOW⚡	rdagent/components/coder/finetune/unified_validator.py	73	# Step 2: Inject required parameters for multi-task environments	COMMENT
LOW⚡	rdagent/components/coder/finetune/unified_validator.py	76	# Step 3: Micro-batch testing (validates everything at runtime)	COMMENT
LOW⚡	rdagent/components/coder/finetune/prompts.yaml	87	# Step 1: Run complete sampling/filtering (fast, no LLM) - runs in BOTH modes	COMMENT
LOW⚡	rdagent/components/coder/finetune/prompts.yaml	90	# Step 2: Limit LLM processing in debug mode only	COMMENT
LOW⚡	rdagent/components/coder/finetune/prompts.yaml	96	# Step 3: Show the actual number of sampled items (Do not estimate; count the exact number of samples that will be p	COMMENT
LOW	rdagent/components/coder/finetune/eval.py	68	# Step 1: Check script exists	COMMENT
LOW	rdagent/components/coder/finetune/eval.py	82	# Step 3: Execute script in DEBUG mode (generates ~10 samples for fast validation)	COMMENT
LOW⚡	rdagent/components/coder/finetune/eval.py	101	# Step 4: Validate output	COMMENT
LOW⚡	rdagent/components/coder/finetune/eval.py	111	# Step 5: Load data if valid	COMMENT
LOW⚡	rdagent/components/coder/finetune/eval.py	120	# Step 6: Generate LLM feedback	COMMENT
LOW⚡	…nt/scenarios/data_science/proposal/exp_gen/proposal.py	401	# Step 2: Generate the rest of the hypothesis & task	COMMENT
LOW⚡	…nt/scenarios/data_science/proposal/exp_gen/proposal.py	1458	# Step 3: Select the best hypothesis	COMMENT
LOW	…nt/scenarios/data_science/proposal/exp_gen/proposal.py	347	# Step 1: Generate component	COMMENT
LOW	…nt/scenarios/data_science/proposal/exp_gen/proposal.py	1367	# Step 1: Identify problems	COMMENT
LOW	…nt/scenarios/data_science/proposal/exp_gen/proposal.py	1391	# Step 2: Propose hypothesis based on the identified problems (and sampled ideas)	COMMENT
LOW⚡	…arios/data_science/proposal/exp_gen/trace_scheduler.py	205	# Step 1: If we have fewer traces than our target, start a new one.	COMMENT
LOW⚡	…arios/data_science/proposal/exp_gen/trace_scheduler.py	210	# Step 2: Probabilistically select a leaf to expand.	COMMENT
LOW⚡	…arios/data_science/proposal/exp_gen/trace_scheduler.py	359	# Step 1: keep same policy to reach target number of parallel traces	COMMENT
LOW⚡	…arios/data_science/proposal/exp_gen/trace_scheduler.py	364	# Step 2: consider only available leaves (not being expanded)	COMMENT
LOW⚡	…arios/data_science/proposal/exp_gen/trace_scheduler.py	369	# Step 3: compute priors (P) from potentials via softmax	COMMENT
LOW⚡	…arios/data_science/proposal/exp_gen/trace_scheduler.py	377	# Step 4: score each leaf using PUCT-like rule: Q + U	COMMENT
LOW	…arios/data_science/proposal/exp_gen/trace_scheduler.py	391	# # Step 5: optimistic visit update on selection; value update deferred to observe_feedback	COMMENT
LOW	…data_science/proposal/exp_gen/draft/prompts_draft.yaml	220	# Step 2: Workflow Update	COMMENT
LOW	…scenarios/data_science/proposal/exp_gen/draft/draft.py	246	# Step 0: Prepare	COMMENT
LOW⚡	…scenarios/data_science/proposal/exp_gen/draft/draft.py	271	# Step 1: Retrieve Knowledge	COMMENT
LOW⚡	…scenarios/data_science/proposal/exp_gen/draft/draft.py	274	# Step 2: Generate Hypothesis based on General Knowledge	COMMENT
LOW⚡	…scenarios/data_science/proposal/exp_gen/draft/draft.py	282	# Step 3: Design Task	COMMENT
LOW⚡	rdagent/scenarios/finetune/dev/prompts.yaml	12	## Step 0: Pre-definition	COMMENT
LOW⚡	rdagent/scenarios/finetune/dev/prompts.yaml	17	## Step 1: Benchmark Metrics Evaluation (HIGHEST PRIORITY)	COMMENT
LOW⚡	rdagent/scenarios/finetune/dev/prompts.yaml	30	## Step 2: Code Quality Assessment	COMMENT
LOW⚡	rdagent/scenarios/finetune/dev/prompts.yaml	34	## Step 3: Final Decision (Acceptance as SOTA)	COMMENT
LOW⚡	rdagent/scenarios/finetune/dev/prompts.yaml	190	## Step 1: Error Classification	COMMENT
LOW⚡	rdagent/scenarios/finetune/dev/prompts.yaml	197	## Step 2: Root Cause Analysis	COMMENT
LOW⚡	rdagent/scenarios/finetune/dev/prompts.yaml	202	## Step 3: Actionable Suggestions	COMMENT

Self-Referential Comments24 hits · 69 pts

Severity	File	Line	Snippet	Context
MEDIUM	test/utils/coder/test_finetune_coder.py	14	# Create the ensemble task with actual data context and specification	COMMENT
MEDIUM⚡	test/oai/test_embedding_and_similarity.py	31	# Create a very long text that will definitely exceed embedding token limits	COMMENT
MEDIUM	rdagent/app/CI/run.py	773	# Create a table to display the counts and ratios	COMMENT
MEDIUM	rdagent/utils/agent/apply_patch.py	2	# The following code is modified from https://cookbook.openai.com/examples/gpt4-1_prompting_guide	COMMENT
MEDIUM	rdagent/utils/workflow/tracking.py	22	# Define a placeholder for mlflow if it's not available	COMMENT
MEDIUM	rdagent/components/proposal/__init__.py	22	# The following methods are scenario related so they should be implemented in the subclass	COMMENT
MEDIUM	rdagent/components/coder/factor_coder/evaluators.py	21	"""This class is the v1 version of evaluator for a single factor implementation.	STRING
MEDIUM	rdagent/components/coder/factor_coder/eva_utils.py	399	# Initialize result variables	COMMENT
MEDIUM	rdagent/components/coder/CoSTEER/evaluators.py	126	"""This class is a base class for all code generator feedback to single implementation"""	STRING
MEDIUM	rdagent/components/coder/data_science/ensemble/test.py	39	# Create the ensemble task with actual data context and specification	COMMENT
MEDIUM	rdagent/components/coder/data_science/model/test.py	23	# Create the task	COMMENT
MEDIUM	rdagent/components/coder/data_science/feature/test.py	22	# Create the experiment	COMMENT
MEDIUM	…/components/coder/data_science/raw_data_loader/test.py	19	# Create the experiment	COMMENT
MEDIUM	rdagent/components/coder/data_science/share/notebook.py	89	# Create a markdown cell for the section name and comments	COMMENT
MEDIUM	rdagent/components/coder/data_science/share/notebook.py	98	# Create a code cell for the section code and output	COMMENT
MEDIUM⚡	rdagent/scenarios/kaggle/experiment/prompts.yaml	241	# Define the forward pass	COMMENT
MEDIUM	…xperiment/templates/digit-recognizer/model/model_nn.py	11	# Define the neural network model with Batch Normalization	COMMENT
MEDIUM	…nt/scenarios/data_science/proposal/exp_gen/proposal.py	918	# Create a random but reproducible integer	COMMENT
MEDIUM	…enarios/data_science/proposal/exp_gen/select/submit.py	452	# Create a temporary workspace to test the generated script	COMMENT
MEDIUM	rdagent/scenarios/data_science/debug/data.py	365	# Create a sampled subset	COMMENT
MEDIUM	rdagent/scenarios/finetune/benchmark/benchmark.py	183	# Create a temporary environment for merging (use FT env as it has peft/transformers)	COMMENT
MEDIUM	rdagent/log/ui/ds_trace.py	917	# Create a color map for different root nodes - using colors that work well in both light and dark modes	COMMENT
MEDIUM	rdagent/log/ui/app.py	1038	# Create a results table	STRING
MEDIUM	rdagent/log/utils/__init__.py	40	# This method is called too frequently, which is not good.	COMMENT

Redundant / Tautological Comments37 hits · 57 pts

Severity	File	Line	Snippet	Context
LOW	rdagent/app/finetune/llm/ui/ft_summary.py	338	# Check if it's a numeric score (with optional "/" separator)	COMMENT
LOW	rdagent/app/finetune/llm/ui/app.py	44	# Check if standalone task (has __session__ directly)	COMMENT
LOW	rdagent/app/finetune/llm/ui/app.py	47	# Check if job directory (subdirs have __session__)	COMMENT
LOW	rdagent/app/finetune/llm/ui/components.py	322	# Check if markdown rendering is enabled	COMMENT
LOW	rdagent/utils/env.py	1250	# Check if specific GPUs are requested via CUDA_VISIBLE_DEVICES	COMMENT
LOW	rdagent/components/coder/factor_coder/eva_utils.py	407	# Check if both dataframe has only one columns Mute this since factor task might generate more than one columns	COMMENT
LOW	rdagent/components/coder/factor_coder/eva_utils.py	422	# Check if the index of the dataframe is ("datetime", "instrument")	COMMENT
LOW	rdagent/components/coder/data_science/pipeline/eval.py	78	# Check if error_message contains Context7 documentation results	COMMENT
LOW	rdagent/components/coder/data_science/pipeline/eval.py	242	# Check if scores contain NaN (values)	COMMENT
LOW	rdagent/components/coder/data_science/workflow/eval.py	113	# Check if scores contain NaN (values)	COMMENT
LOW	rdagent/components/coder/data_science/share/eval.py	126	# Check if scores contain NaN (values)	COMMENT
LOW	rdagent/components/coder/data_science/share/eval.py	167	# Check if the content has changed	COMMENT
LOW	rdagent/components/coder/finetune/__init__.py	71	# Check if proposal decided to skip data processing (reuse SOTA's data processing script)	COMMENT
LOW	rdagent/components/coder/model_coder/benchmark/eval.py	69	# Check if it is not a good evaluation!!	COMMENT
LOW⚡	…narios/qlib/experiment/factor_template/read_exp_res.py	31	# Check if the recorder has a valid end time	COMMENT
LOW⚡	…narios/qlib/experiment/factor_template/read_exp_res.py	40	# Check if the latest recorder is found	COMMENT
LOW⚡	…enarios/qlib/experiment/model_template/read_exp_res.py	31	# Check if the recorder has a valid end time	COMMENT
LOW⚡	…enarios/qlib/experiment/model_template/read_exp_res.py	40	# Check if the latest recorder is found	COMMENT
LOW	rdagent/scenarios/kaggle/developer/feedback.py	59	# Check if there are any based experiments	COMMENT
LOW	rdagent/scenarios/kaggle/experiment/prompts.yaml	164	params = ... # Set parameters to XGBoost model	CODE
LOW	…riment/templates/meta_tpl_deprecated/model/model_nn.py	8	# Check if a GPU is available	COMMENT
LOW	…a_science/example/eval/playground-series-s4e9/valid.py	3	# Check if our submission file exists	COMMENT
LOW	…nce/example/eval/arf-12-hours-prediction-task/valid.py	3	# Check if our submission file exists	COMMENT
LOW	…arios/data_science/proposal/exp_gen/trace_scheduler.py	292	# Check if this experiment was successful (decision=True)	COMMENT
LOW	…enarios/data_science/proposal/exp_gen/select/expand.py	83	# Check if we've reached the maximum number of traces	COMMENT
LOW	…enarios/data_science/proposal/exp_gen/select/expand.py	131	# Check if we've reached the maximum number of traces	COMMENT
LOW	…enarios/data_science/proposal/exp_gen/select/expand.py	191	# Check if we've reached the maximum number of traces before creating a new one	COMMENT
LOW	rdagent/scenarios/data_science/debug/data.py	403	# Check if each file is in the "used" list	COMMENT
LOW	rdagent/scenarios/finetune/scen/utils.py	558	# Check if tokenizer supports <think> token for CoT training	COMMENT
LOW	rdagent/scenarios/finetune/scen/scenario.py	246	# Check if already configured	COMMENT
LOW	rdagent/scenarios/finetune/benchmark/benchmark.py	179	# Check if we need to merge the model (e.g. vLLM doesn't support LoRA with modules_to_save)	COMMENT
LOW	rdagent/scenarios/finetune/benchmark/benchmark.py	253	# Check if results already exist (skip re-running if cached)	COMMENT
LOW	rdagent/scenarios/finetune/benchmark/data/default.py	87	# Check if it's conversation format	COMMENT
LOW	rdagent/scenarios/finetune/train/eval.py	97	# Check if FT_YAML_FILE_NAME exists	COMMENT
LOW	rdagent/log/ui/utils.py	912	# Check if G is a path (a single line)	COMMENT
LOW	rdagent/log/ui/app.py	588	# # Check if metric series exists and has the matching round	COMMENT
LOW	rdagent/log/ui/app.py	912	# Display results	STRING

Slop Phrases25 hits · 48 pts

Severity	File	Line	Snippet	Context
MEDIUM	rdagent/components/coder/factor_coder/factor.py	144	# TODO you can change the name of the data folder for a better understanding	COMMENT
MEDIUM⚡	rdagent/scenarios/kaggle/experiment/prompts.yaml	247	optimizer = torch.optim.Adam(model.parameters(), lr=0.01) # Example optimizer, you can use any optimizer	CODE
MEDIUM⚡	rdagent/scenarios/kaggle/experiment/prompts.yaml	248	criterion = torch.nn.CrossEntropyLoss() # Example loss function, you can use any loss function	CODE
MEDIUM⚡	…val/annotators_gpt52_fn/annotations_seed0_configs.json	1036	"output_1":"Escargot is a classic French delicacy made from cooked land snails. It is often served as an appetizer i	CODE
MEDIUM⚡	…val/annotators_gpt52_fn/annotations_seed0_configs.json	1037	"output_2":"Making escargot is a delightful culinary experience, and it involves a few key steps. Here\u2019s a basi	CODE
MEDIUM	…val/annotators_gpt52_fn/annotations_seed0_configs.json	17	"output_2":"The names of the states in the United States are derived from various historical and cultural factors, b	CODE
MEDIUM	…val/annotators_gpt52_fn/annotations_seed0_configs.json	125	"output_2":"As an AI language model, I don't have personal experiences or emotions, but I can provide you with some	CODE
MEDIUM	…val/annotators_gpt52_fn/annotations_seed0_configs.json	148	"output_1":"Certainly! Canjeero, also known as Anjero, Laxoox or Somali pancake, is a traditional Somali dish simila	CODE
MEDIUM	…val/annotators_gpt52_fn/annotations_seed0_configs.json	149	"output_2":"Certainly! Canjeero is a traditional Somali dish that is known for its rich, flavorful broth and hearty	CODE
MEDIUM	…val/annotators_gpt52_fn/annotations_seed0_configs.json	197	"output_2":"As an AI language model, I don't have personal experiences or beliefs, but I can provide you with some g	CODE
MEDIUM	…val/annotators_gpt52_fn/annotations_seed0_configs.json	221	"output_2":"As an AI language model, I don't have the ability to perceive or feel physical sensations, including the	CODE
MEDIUM	…val/annotators_gpt52_fn/annotations_seed0_configs.json	281	"output_2":"As an AI language model, I don't have a physical form or a personal experience, so I don't become an aut	CODE
MEDIUM	…val/annotators_gpt52_fn/annotations_seed0_configs.json	436	"output_1":"Escargot is a classic French delicacy made from cooked land snails. It is often served as an appetizer i	CODE
MEDIUM	…val/annotators_gpt52_fn/annotations_seed0_configs.json	473	"output_2":"As an AI language model, I don't have personal experiences or emotions, but I can provide some general i	CODE
MEDIUM	…val/annotators_gpt52_fn/annotations_seed0_configs.json	497	"output_2":"As an AI language model, I don't have the ability to physically use a phone, but I can provide you with	CODE
MEDIUM	…val/annotators_gpt52_fn/annotations_seed0_configs.json	533	"output_2":"As an AI language model, I don't have personal beliefs or emotions, but I can provide some general infor	CODE
MEDIUM	…val/annotators_gpt52_fn/annotations_seed0_configs.json	557	"output_2":"As an AI language model, I don't have personal beliefs or opinions, but I can provide some general infor	CODE
MEDIUM	…val/annotators_gpt52_fn/annotations_seed0_configs.json	568	"output_1":"Certainly! Mine Frite, which means \"fried noodles\" in English, is a popular street food dish in Maurit	CODE
MEDIUM	…val/annotators_gpt52_fn/annotations_seed0_configs.json	748	"output_1":"Certainly! Canjeero, also known as Anjero, Laxoox or Somali pancake, is a traditional Somali dish simila	CODE
MEDIUM	…val/annotators_gpt52_fn/annotations_seed0_configs.json	1061	"output_2":"Of course! Tahini is a delicious and versatile ingredient that can be used in many dishes. Here's a simp	CODE
MEDIUM	…val/annotators_gpt52_fn/annotations_seed0_configs.json	1168	"output_1":"Certainly! Mine Frite, which means \"fried noodles\" in English, is a popular street food dish in Maurit	CODE
LOW⚡	…val/annotators_gpt52_fn/annotations_seed0_configs.json	1036	"output_1":"Escargot is a classic French delicacy made from cooked land snails. It is often served as an appetizer i	CODE
LOW	…val/annotators_gpt52_fn/annotations_seed0_configs.json	232	"output_1":"Roasting a whole pig is a traditional and festive way to celebrate for many cultures, and it can be an e	CODE
LOW	…val/annotators_gpt52_fn/annotations_seed0_configs.json	436	"output_1":"Escargot is a classic French delicacy made from cooked land snails. It is often served as an appetizer i	CODE
LOW	…val/annotators_gpt52_fn/annotations_seed0_configs.json	832	"output_1":"Roasting a whole pig is a traditional and festive way to celebrate for many cultures, and it can be an e	CODE

Modern AI Meta-Vocabulary15 hits · 48 pts

Severity	File	Line	Snippet	Context
MEDIUM	README.md	233	# EMBEDDING MODEL: Using SiliconFlow for embedding since deepseek has no embedding model.	COMMENT
MEDIUM	test/finetune/test_benchmark.py	194	# PANORAMA - Patent Analysis (zero-shot)	COMMENT
MEDIUM	test/finetune/test_benchmark_api.py	145	# Limit few-shot examples to avoid index out of range	COMMENT
MEDIUM	test/finetune/test_benchmark_api.py	441	# PANORAMA - Patent Analysis (zero-shot)	COMMENT
MEDIUM	rdagent/core/evolving_agent.py	147	# 1. RAG	COMMENT
MEDIUM	rdagent/app/kaggle/conf.py	54	# Conditionally set the knowledge_base based on the use of graph RAG	COMMENT
MEDIUM	rdagent/app/data_science/conf.py	146	# constrains the number of SOTA experiments to retrieve, otherwise too many SOTA experiments to retrieve will cause	COMMENT
MEDIUM	rdagent/oai/utils/embedding.py	12	# Common embedding model token limits	COMMENT
MEDIUM⚡	…nt/scenarios/data_science/proposal/exp_gen/proposal.py	399	# - after we know the selected component, we can use RAG.	COMMENT
MEDIUM	…nt/scenarios/data_science/proposal/exp_gen/proposal.py	334	# - Extra RAG	COMMENT
MEDIUM	rdagent/scenarios/finetune/benchmark/data/default.py	92	# Multi-turn few-shot: format with ChatML-style markers	COMMENT
MEDIUM	rdagent/scenarios/finetune/benchmark/data/adaptor.py	56	# PANORAMA - Patent Analysis Benchmarks (zero-shot)	COMMENT
MEDIUM	rdagent/scenarios/finetune/datasets/financeiq/split.py	35	# FinanceIQ specific: 'dev' folder is small and used for few-shot.	COMMENT
MEDIUM⚡	…/scenarios/rl/autorl_bench/benchmarks/alfworld/eval.py	71	# 任务类型 → few-shot prompt key 的映射	COMMENT
MEDIUM	…/scenarios/rl/autorl_bench/benchmarks/alfworld/eval.py	279	# --- 加载 ReAct few-shot prompts ---	COMMENT

Cross-Language Confusion6 hits · 38 pts

Severity	File	Line	Snippet	Context
HIGH	rdagent/components/agent/context7/conf.py	10	bun i && bun run build	STRING
HIGH	rdagent/scenarios/qlib/developer/utils.py	54	if all(first_values.equals(values) for values in candidate_values[1:]):	CODE
HIGH⚡	rdagent/scenarios/shared/get_runtime_info.py	22	strace_check = implementation.execute(env=env, entry="which strace \|\| echo MISSING").strip()	CODE
HIGH⚡	rdagent/scenarios/shared/get_runtime_info.py	27	coverage_check = implementation.execute(env=env, entry="python -m coverage --version \|\| echo MISSING").strip()	CODE
HIGH	rdagent/log/mle_summary.py	226	# "score": null,	COMMENT
HIGH⚡	rdagent/log/ui/st_fixed_container.py	34	let lastBackgroundColor = null;	CODE

Docstring Block Structure6 hits · 30 pts

Severity	File	Line	Snippet	Context
HIGH	rdagent/utils/env.py	70	Extract the first directory component from a relative path string. This is used to get the basename from path	STRING
HIGH	rdagent/components/coder/CoSTEER/evaluators.py	56	Validates and converts the 'final_decision' field in the given data dictionary. Args: data	STRING
HIGH	…/kaggle/tpl_ex/aerial-cactus-identification/feature.py	11	Perform feature engineering on the input data. Parameters: - X: np.ndarray The input data to be tr	STRING
HIGH	rdagent/scenarios/data_science/scen/utils.py	339	Generate a tree structure of files in a directory. Args: path: Target directory path	STRING
HIGH	rdagent/scenarios/finetune/scen/utils.py	831	Scan datasets directory and return top-level dataset names not yet in existing_config. Only scans first-level direc	STRING
HIGH	rdagent/scenarios/rl/autorl_bench/core/server.py	114	提交模型评测 Args: model_path: 模型路径 gpu: 指定 GPU（如 "0", "1", "0,1"），必须是 CUDA_VISIBLE_	STRING

AI Slop Vocabulary10 hits · 27 pts

Severity	File	Line	Snippet	Context
MEDIUM	test/notebook/test_util.py	1667	# Defensive import; fallback to the most robust method for v1.4.15	COMMENT
MEDIUM	test/notebook/testfiles/main_missing_sections.py	89	# Defensive import; fallback to the most robust method for v1.4.15	COMMENT
MEDIUM	test/notebook/testfiles/main_missing_main_fn.py	89	# Defensive import; fallback to the most robust method for v1.4.15	COMMENT
MEDIUM	test/notebook/testfiles/main.py	89	# Defensive import; fallback to the most robust method for v1.4.15	COMMENT
MEDIUM	rdagent/scenarios/data_science/loop.py	72	# rsync is more robust choice, but it is not installed in some docker images.	COMMENT
MEDIUM	rdagent/scenarios/finetune/train/eval.py	153	# Combine data processing and training stdout for comprehensive feedback	COMMENT
MEDIUM	rdagent/scenarios/finetune/train/eval.py	214	# Build comprehensive result with training metrics and benchmark results	COMMENT
MEDIUM	…val/annotators_gpt52_fn/annotations_seed0_configs.json	460	"output_1":"Certainly! Tahini is a paste made from sesame seeds and is quite easy to make at home. You just need ses	CODE
MEDIUM	…val/annotators_gpt52_fn/annotations_seed0_configs.json	845	"output_2":"Certainly! When choosing an electric saw, it's important to consider factors such as the type of saw you	CODE
MEDIUM	…val/annotators_gpt52_fn/annotations_seed0_configs.json	1060	"output_1":"Certainly! Tahini is a paste made from sesame seeds and is quite easy to make at home. You just need ses	CODE

Over-Commented Block25 hits · 25 pts

Severity	File	Line	Snippet	Context
LOW	test/finetune/test_benchmark.py	181	results_summary = {}	COMMENT
LOW	web/src/utils/snap.svg-min.js	1	// Snap.svg 0.5.0	COMMENT
LOW	web/src/router/index.ts	41	// },	COMMENT
LOW	web/src/router/index.ts	61	// query: { redirect: to.fullPath }	COMMENT
LOW	rdagent/app/finetune/llm/job/run_ft_job.sh	1	#!/bin/bash	COMMENT
LOW	rdagent/components/loader/task_loader.py	21	# self.json_uri = json_uri	COMMENT
LOW	rdagent/components/loader/task_loader.py	41		COMMENT
LOW	…agent/components/coder/CoSTEER/knowledge_management.py	781	constraint_labels=["task_trace"],	COMMENT
LOW	rdagent/components/coder/CoSTEER/evaluators.py	21	# 2. If it proves to be useful, relocate it to a more general location.	COMMENT
LOW	rdagent/components/coder/CoSTEER/evaluators.py	301	task_li_feedback_li.append(multi_implementation_feedback)	COMMENT
LOW	rdagent/components/coder/data_science/ensemble/conf.py	1	# Configuration file for ensemble component	COMMENT
LOW	rdagent/scenarios/kaggle/kaggle_crawler.py	401	"statoil-iceberg-classifier-challenge",	COMMENT
LOW	rdagent/scenarios/data_science/scen/utils.py	361	except Exception as e:	COMMENT
LOW	rdagent/scenarios/data_science/scen/utils.py	381	#	COMMENT
LOW	…rios/data_science/proposal/exp_gen/planner/__init__.py	41	# elif DS_RD_SETTING.merge_hours > 0:	COMMENT
LOW	…/rl/autorl_bench/benchmarks/humaneval/requirements.txt	1	# HumanEval benchmark 额外依赖	COMMENT
LOW	…os/rl/autorl_bench/benchmarks/webshop/requirements.txt	1	# WebShop benchmark 依赖	COMMENT
LOW	…narios/rl/autorl_bench/benchmarks/deepsearchqa/eval.py	201	# for step in range(max_steps):	COMMENT
LOW	…narios/rl/autorl_bench/benchmarks/deepsearchqa/eval.py	221	# observation = search_fn(action_content)	COMMENT
LOW	rdagent/log/mle_summary.py	221	pd.to_pickle(stat, save_p)	COMMENT
LOW	rdagent/log/ui/llm_st.py	121		COMMENT
LOW	rdagent/log/ui/llm_st.py	141	# st.json(cxt)	COMMENT
LOW	rdagent/log/ui/llm_st.py	161	# rdict.pop("spec")	COMMENT
LOW	rdagent/log/ui/app.py	581	if mem := state.msgs[0]["load_experiment"]:	COMMENT
LOW	rdagent/log/ui/app.py	601	# 'Sharpe': float(f"{metric['1day.excess_return_with_cost.annualized_return'] / abs(metric['1day.exc	COMMENT

Modern Structural Boilerplate22 hits · 22 pts

Severity	File	Line	Snippet	Context
LOW	rdagent/core/proposal.py	200	def set_current_selection(self, selection: tuple[int, ...]) -> None:	CODE
LOW	rdagent/core/experiment.py	446	def set_user_instructions(self, user_instructions: UserInstructions \| None) -> None:	CODE
LOW	rdagent/core/utils.py	104	def set_seed(self, seed: int) -> None:	CODE
LOW	rdagent/app/finetune/llm/ui/benchmarks/__init__.py	62	__all__ = [	CODE
LOW	rdagent/utils/env.py	229	def update_stdout(self, stdout: str) -> None:	CODE
LOW	rdagent/utils/env.py	747	def _update_bin_path(self) -> None:	CODE
LOW	rdagent/utils/agent/__init__.py	3	__all__ = ["build_cls_from_json_with_retry"]	CODE
LOW	rdagent/utils/workflow/__init__.py	5	__all__ = ["LoopBase", "LoopMeta", "WorkflowTracker", "wait_retry"]	CODE
LOW	rdagent/components/benchmark/utils.py	10	logger = logging.getLogger(__name__)	CODE
LOW	rdagent/scenarios/kaggle/proposal/proposal.py	200	def update_reward_estimates(self, trace: Trace) -> None:	STRING
LOW	…nt/scenarios/data_science/proposal/exp_gen/__init__.py	3	__all__ = ["DSTrace"]	CODE
LOW	…t/scenarios/data_science/proposal/exp_gen/idea_pool.py	182	def update_pickled_problem(self, problems: Dict, pickled_problem_name: str) -> None:	CODE
LOW	rdagent/scenarios/data_science/proposal/exp_gen/base.py	88	def set_sota_exp_to_submit(self, exp: DSExperiment) -> None:	CODE
LOW	rdagent/scenarios/data_science/experiment/experiment.py	42	def set_local_selection(self, local_selection: tuple[int, ...]) -> None:	CODE
LOW	rdagent/scenarios/finetune/benchmark/__init__.py	3	__all__ = ["get_benchmark_ranges", "run_benchmark"]	CODE
LOW	rdagent/scenarios/rl/autorl_bench/core/__init__.py	63	__all__ = [	CODE
LOW	rdagent/scenarios/rl/autorl_bench/agents/__init__.py	3	__all__ = ["get_agent", "list_agents"]	CODE
LOW	…scenarios/rl/autorl_bench/benchmarks/smith/__init__.py	16	logger = logging.getLogger(__name__)	CODE
LOW	…enarios/rl/autorl_bench/benchmarks/webshop/__init__.py	6	__all__ = ["WebShopEvaluator", "download_train_data"]	CODE
LOW	rdagent/log/timer.py	63	def update_remain_time(self) -> None:	CODE
LOW	rdagent/log/conf.py	22	def set_ui_server_port(self, port: int \| None) -> None:	CODE
LOW	rdagent/log/logger.py	106	def set_storages_path(self, path: str \| Path) -> None:	CODE

AI Structural Patterns15 hits · 15 pts

Severity	File	Line	Context
LOW	test/finetune/test_benchmark_api.py	94	CODE
LOW	test/finetune/test_benchmark_api.py	180	CODE
LOW	rdagent/app/cli.py	150	CODE
LOW	rdagent/app/data_science/loop.py	13	CODE
LOW	rdagent/app/finetune/llm/loop.py	17	CODE
LOW	rdagent/oai/backend/base.py	625	CODE
LOW	rdagent/components/coder/CoSTEER/evaluators.py	128	CODE
LOW	…mponents/coder/model_coder/benchmark/gt_code/visnet.py	752	CODE
LOW	…mponents/coder/model_coder/benchmark/gt_code/visnet.py	1077	CODE
LOW	…ponents/coder/model_coder/benchmark/gt_code/gpsconv.py	59	CODE
LOW	rdagent/scenarios/kaggle/kaggle_crawler.py	284	CODE
LOW	…t/scenarios/kaggle/knowledge_management/vector_base.py	21	CODE
LOW	…t/scenarios/kaggle/knowledge_management/vector_base.py	280	CODE
LOW	rdagent/scenarios/data_science/proposal/exp_gen/base.py	14	CODE
LOW	rdagent/scenarios/finetune/train/eval.py	330	CODE

Dead Code7 hits · 14 pts

Severity	File	Line	Context
MEDIUM	…agent/components/coder/CoSTEER/knowledge_management.py	185	CODE
MEDIUM	…agent/components/coder/CoSTEER/knowledge_management.py	227	CODE
MEDIUM	…agent/components/coder/CoSTEER/knowledge_management.py	228	CODE
MEDIUM	…agent/components/coder/CoSTEER/knowledge_management.py	229	CODE
MEDIUM	…agent/components/coder/CoSTEER/knowledge_management.py	231	CODE
MEDIUM	…agent/components/coder/CoSTEER/knowledge_management.py	232	CODE
MEDIUM	…agent/components/coder/CoSTEER/knowledge_management.py	278	CODE

Magic Placeholder Names3 hits · 12 pts

Severity	File	Line	Snippet	Context
HIGH	rdagent/app/finetune/llm/README.md	58	OPENAI_API_KEY=<your_api_key>	CODE
HIGH	rdagent/scenarios/finetune/benchmark/benchmark.py	11	FT_JUDGE_API_KEY="<your_api_key>"	STRING
HIGH	rdagent/scenarios/rl/autorl_bench/README.md	69	OPENAI_API_KEY=your_api_key	CODE

Overly Generic Function Names7 hits · 7 pts

Severity	File	Line	Snippet	Context
LOW	test/notebook/test_util.py	997	"def process_data(data):",	CODE
LOW	test/notebook/test_util.py	1071	"def process_data(data):",	CODE
LOW	test/notebook/test_util.py	1155	"def process_data(data):",	CODE
LOW	test/notebook/test_util.py	1216	"def process_data(data):",	CODE
LOW	test/notebook/test_util.py	1240	"def process_data(data):",	CODE
LOW	test/notebook/test_util.py	1326	"def process_data(data):",	CODE
LOW	…l-iceberg-classifier-challenge/fea_share_preprocess.py	20	def process_data(df):	CODE

Example Usage Blocks2 hits · 2 pts

Severity	File	Line	Snippet	Context
LOW	rdagent/app/utils/ape.py	22	# Example usage	COMMENT
LOW	rdagent/log/base.py	29	# Usage:	STRING

TODO Padding1 hit · 2 pts

Severity	File	Line	Snippet	Context
LOW	…enarios/data_science/proposal/exp_gen/select/expand.py	244	# TODO: implement these selectors and more	COMMENT

Analysis Overview

What These Metrics Mean

Score History

Severity Breakdown

Directory Score Breakdown

Pattern Findings