Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committed to automating these high-value generic R&D processes through R&D-Agent, which lets AI drive data-driven AI. 🔗https://aka.ms/RD-Agent-Tech-Report
1351 matches across 18 categories. Click a row to expand file-level details.
| Severity | File | Line | Snippet |
|---|---|---|---|
| HIGH | …ent/components/coder/data_science/ensemble/__init__.py | 0 | assign the code list to the evolving item. the code list is aligned with the evolving item's sub-tasks. if a task is not |
| HIGH | …ent/components/coder/data_science/pipeline/__init__.py | 0 | assign the code list to the evolving item. the code list is aligned with the evolving item's sub-tasks. if a task is not |
| HIGH | …ent/components/coder/data_science/workflow/__init__.py | 0 | assign the code list to the evolving item. the code list is aligned with the evolving item's sub-tasks. if a task is not |
| HIGH | rdagent/components/coder/data_science/model/__init__.py | 0 | assign the code list to the evolving item. the code list is aligned with the evolving item's sub-tasks. if a task is not |
| HIGH | …gent/components/coder/data_science/feature/__init__.py | 0 | assign the code list to the evolving item. the code list is aligned with the evolving item's sub-tasks. if a task is not |
| HIGH | …ponents/coder/data_science/raw_data_loader/__init__.py | 0 | assign the code list to the evolving item. the code list is aligned with the evolving item's sub-tasks. if a task is not |
| HIGH | rdagent/scenarios/data_science/dev/runner/__init__.py | 0 | assign the code list to the evolving item. the code list is aligned with the evolving item's sub-tasks. if a task is not |
| HIGH | …ent/spaceship-titanic_template/fea_share_preprocess.py | 0 | this method loads the data, drops the unnecessary columns, and splits it into train and validation sets. |
| HIGH | …mplates/playground-series-s4e8/fea_share_preprocess.py | 0 | this method loads the data, drops the unnecessary columns, and splits it into train and validation sets. |
| HIGH | …/templates/meta_tpl_deprecated/fea_share_preprocess.py | 0 | this method loads the data, drops the unnecessary columns, and splits it into train and validation sets. |
| HIGH | …lar-playground-series-dec-2021/fea_share_preprocess.py | 0 | this method loads the data, drops the unnecessary columns, and splits it into train and validation sets. |
| HIGH | …york-city-taxi-fare-prediction/fea_share_preprocess.py | 0 | this method loads the data, drops the unnecessary columns, and splits it into train and validation sets. |
| HIGH | …plates/playground-series-s3e26/fea_share_preprocess.py | 0 | this method loads the data, drops the unnecessary columns, and splits it into train and validation sets. |
| HIGH | …/experiment/templates/sf-crime/fea_share_preprocess.py | 0 | this method loads the data, drops the unnecessary columns, and splits it into train and validation sets. |
| HIGH | …s/forest-cover-type-prediction/fea_share_preprocess.py | 0 | this method loads the data, drops the unnecessary columns, and splits it into train and validation sets. |
| HIGH | …nt/templates/spaceship-titanic/fea_share_preprocess.py | 0 | this method loads the data, drops the unnecessary columns, and splits it into train and validation sets. |
| HIGH | …ent/templates/digit-recognizer/fea_share_preprocess.py | 0 | this method loads the data, drops the unnecessary columns, and splits it into train and validation sets. |
| HIGH | …ent/spaceship-titanic_template/fea_share_preprocess.py | 0 | fits the preprocessor on the training data and returns the fitted preprocessor. |
| HIGH | …mplates/playground-series-s4e8/fea_share_preprocess.py | 0 | fits the preprocessor on the training data and returns the fitted preprocessor. |
| HIGH | …/templates/meta_tpl_deprecated/fea_share_preprocess.py | 0 | fits the preprocessor on the training data and returns the fitted preprocessor. |
| HIGH | …plates/playground-series-s3e26/fea_share_preprocess.py | 0 | fits the preprocessor on the training data and returns the fitted preprocessor. |
| HIGH | …/experiment/templates/sf-crime/fea_share_preprocess.py | 0 | fits the preprocessor on the training data and returns the fitted preprocessor. |
| HIGH | …nt/templates/spaceship-titanic/fea_share_preprocess.py | 0 | fits the preprocessor on the training data and returns the fitted preprocessor. |
| HIGH | …ent/spaceship-titanic_template/fea_share_preprocess.py | 0 | transforms the given dataframe using the fitted preprocessor. ensures the processed data has consistent features across |
| HIGH | …/templates/meta_tpl_deprecated/fea_share_preprocess.py | 0 | transforms the given dataframe using the fitted preprocessor. ensures the processed data has consistent features across |
| HIGH | …nt/templates/spaceship-titanic/fea_share_preprocess.py | 0 | transforms the given dataframe using the fitted preprocessor. ensures the processed data has consistent features across |
| HIGH | …ent/spaceship-titanic_template/fea_share_preprocess.py | 0 | this method applies the preprocessing steps to the training, validation, and test datasets. |
| HIGH | …mplates/playground-series-s4e8/fea_share_preprocess.py | 0 | this method applies the preprocessing steps to the training, validation, and test datasets. |
| HIGH | …plates/playground-series-s3e14/fea_share_preprocess.py | 0 | this method applies the preprocessing steps to the training, validation, and test datasets. |
| HIGH | …/templates/meta_tpl_deprecated/fea_share_preprocess.py | 0 | this method applies the preprocessing steps to the training, validation, and test datasets. |
| HIGH | …ventilator-pressure-prediction/fea_share_preprocess.py | 0 | this method applies the preprocessing steps to the training, validation, and test datasets. |
| HIGH | …lar-playground-series-dec-2021/fea_share_preprocess.py | 0 | this method applies the preprocessing steps to the training, validation, and test datasets. |
| HIGH | …york-city-taxi-fare-prediction/fea_share_preprocess.py | 0 | this method applies the preprocessing steps to the training, validation, and test datasets. |
| HIGH | …plates/playground-series-s3e11/fea_share_preprocess.py | 0 | this method applies the preprocessing steps to the training, validation, and test datasets. |
| HIGH | …plates/playground-series-s3e16/fea_share_preprocess.py | 0 | this method applies the preprocessing steps to the training, validation, and test datasets. |
| HIGH | …plates/playground-series-s3e26/fea_share_preprocess.py | 0 | this method applies the preprocessing steps to the training, validation, and test datasets. |
| HIGH | …rize-english-language-learning/fea_share_preprocess.py | 0 | this method applies the preprocessing steps to the training, validation, and test datasets. |
| HIGH | …/experiment/templates/sf-crime/fea_share_preprocess.py | 0 | this method applies the preprocessing steps to the training, validation, and test datasets. |
| HIGH | …s/forest-cover-type-prediction/fea_share_preprocess.py | 0 | this method applies the preprocessing steps to the training, validation, and test datasets. |
| HIGH | …lar-playground-series-may-2022/fea_share_preprocess.py | 0 | this method applies the preprocessing steps to the training, validation, and test datasets. |
| HIGH | …nt/templates/spaceship-titanic/fea_share_preprocess.py | 0 | this method applies the preprocessing steps to the training, validation, and test datasets. |
| HIGH | …l-iceberg-classifier-challenge/fea_share_preprocess.py | 0 | this method applies the preprocessing steps to the training, validation, and test datasets. |
| HIGH | …ent/templates/digit-recognizer/fea_share_preprocess.py | 0 | this method applies the preprocessing steps to the training, validation, and test datasets. |
| HIGH | …paceship-titanic_template/model/select_randomforest.py | 0 | select relevant features. to be used in fit & predict function. |
| HIGH | …periment/spaceship-titanic_template/model/select_nn.py | 0 | select relevant features. to be used in fit & predict function. |
| HIGH | …nt/spaceship-titanic_template/model/select_lightgbm.py | 0 | select relevant features. to be used in fit & predict function. |
| HIGH | …ent/spaceship-titanic_template/model/select_xgboost.py | 0 | select relevant features. to be used in fit & predict function. |
| HIGH | …es/playground-series-s4e9/model/select_randomforest.py | 0 | select relevant features. to be used in fit & predict function. |
| HIGH | …nt/templates/playground-series-s4e9/model/select_nn.py | 0 | select relevant features. to be used in fit & predict function. |
| HIGH | …plates/playground-series-s4e9/model/select_lightgbm.py | 0 | select relevant features. to be used in fit & predict function. |
| HIGH | …mplates/playground-series-s4e9/model/select_xgboost.py | 0 | select relevant features. to be used in fit & predict function. |
| HIGH | …es/playground-series-s4e8/model/select_randomforest.py | 0 | select relevant features. to be used in fit & predict function. |
| HIGH | …nt/templates/playground-series-s4e8/model/select_nn.py | 0 | select relevant features. to be used in fit & predict function. |
| HIGH | …plates/playground-series-s4e8/model/select_lightgbm.py | 0 | select relevant features. to be used in fit & predict function. |
| HIGH | …mplates/playground-series-s4e8/model/select_xgboost.py | 0 | select relevant features. to be used in fit & predict function. |
| HIGH | …s/playground-series-s3e14/model/select_randomforest.py | 0 | select relevant features. to be used in fit & predict function. |
| HIGH | …t/templates/playground-series-s3e14/model/select_nn.py | 0 | select relevant features. to be used in fit & predict function. |
| HIGH | …lates/playground-series-s3e14/model/select_lightgbm.py | 0 | select relevant features. to be used in fit & predict function. |
| HIGH | …plates/playground-series-s3e14/model/select_xgboost.py | 0 | select relevant features. to be used in fit & predict function. |
| HIGH | …plates/meta_tpl_deprecated/model/model_randomforest.py | 0 | select relevant features. to be used in fit & predict function. |
| 152 more matches not shown… | |||
| Severity | File | Line | Snippet |
|---|---|---|---|
| LOW | test/utils/test_conf.py | 4 | |
| LOW | test/utils/test_env.py | 1 | |
| LOW | test/utils/test_import.py | 2 | |
| LOW | test/utils/env_tpl/read_exp.py | 2 | |
| LOW | test/utils/env_tpl/read_exp.py | 3 | |
| LOW | test/finetune/test_benchmark_tablebench.py | 6 | |
| LOW | test/finetune/test_benchmark.py | 10 | |
| LOW | test/finetune/test_benchmark_api.py | 10 | |
| LOW | test/oai/test_advanced.py | 7 | |
| LOW | test/oai/test_base.py | 1 | |
| LOW | test/oai/test_completion.py | 3 | |
| LOW | test/notebook/testfiles/main2.py | 5 | |
| LOW | test/notebook/testfiles/main2.py | 14 | |
| LOW | test/notebook/testfiles/main2.py | 16 | |
| LOW | rdagent/core/evolving_agent.py | 1 | |
| LOW | rdagent/core/conf.py | 1 | |
| LOW | rdagent/core/evolving_framework.py | 1 | |
| LOW | rdagent/core/proposal.py | 3 | |
| LOW | rdagent/core/experiment.py | 1 | |
| LOW | rdagent/core/utils.py | 1 | |
| LOW | rdagent/core/developer.py | 1 | |
| LOW | rdagent/core/interactor.py | 1 | |
| LOW | rdagent/app/cli.py | 9 | |
| LOW | rdagent/app/benchmark/factor/analysis.py | 9 | |
| LOW | rdagent/app/benchmark/model/eval.py | 14 | |
| LOW | rdagent/app/CI/run.py | 1 | |
| LOW | rdagent/app/utils/ws.py | 1 | |
| LOW | rdagent/app/utils/health_check.py | 6 | |
| LOW | rdagent/app/utils/info.py | 4 | |
| LOW | rdagent/app/utils/info.py | 9 | |
| LOW | rdagent/app/utils/ws_ft.py | 1 | |
| LOW | rdagent/app/utils/ws_ft.py | 5 | |
| LOW | rdagent/app/utils/ws_ft.py | 7 | |
| LOW | rdagent/app/finetune/data_science/scen.py | 4 | |
| LOW | rdagent/app/finetune/data_science/scen.py | 5 | |
| LOW | rdagent/app/finetune/data_science/scen.py | 8 | |
| LOW | rdagent/app/finetune/data_science/loop.py | 8 | |
| LOW | rdagent/app/finetune/data_science/loop.py | 9 | |
| LOW | rdagent/app/qlib_rd_loop/factor_from_report.py | 4 | |
| LOW | rdagent/app/qlib_rd_loop/factor_from_report.py | 15 | |
| LOW | rdagent/app/rl/ui/data_loader.py | 17 | |
| LOW | rdagent/utils/env.py | 14 | |
| LOW | rdagent/utils/env.py | 45 | |
| LOW | rdagent/utils/__init__.py | 21 | |
| LOW | rdagent/utils/agent/apply_patch.py | 9 | |
| LOW | rdagent/utils/agent/__init__.py | 1 | |
| LOW | rdagent/utils/agent/workflow.py | 2 | |
| LOW | rdagent/utils/agent/workflow.py | 2 | |
| LOW | rdagent/utils/workflow/__init__.py | 1 | |
| LOW | rdagent/utils/workflow/__init__.py | 1 | |
| LOW | rdagent/utils/workflow/__init__.py | 2 | |
| LOW | rdagent/utils/workflow/__init__.py | 3 | |
| LOW | rdagent/utils/workflow/loop.py | 21 | |
| LOW | rdagent/utils/workflow/tracking.py | 18 | |
| LOW | rdagent/utils/repo/repo_utils.py | 2 | |
| LOW | rdagent/oai/llm_utils.py | 1 | |
| LOW | rdagent/oai/llm_utils.py | 10 | |
| LOW | rdagent/oai/llm_conf.py | 1 | |
| LOW | rdagent/oai/backend/deprec.py | 2 | |
| LOW | rdagent/oai/backend/deprec.py | 4 | |
| 250 more matches not shown… | |||
| Severity | File | Line | Snippet |
|---|---|---|---|
| LOW | test/utils/test_import.py | 56 | except Exception as e: |
| MEDIUM | test/finetune/test_benchmark_tablebench.py | 122 | print(f"Error: {result.stdout[-2000:] if result.stdout else 'No output'}") |
| MEDIUM | test/finetune/test_benchmark.py | 125 | print(f"Error: {result.stdout[-2000:] if result.stdout else 'No output'}") |
| MEDIUM | test/finetune/test_benchmark_api.py | 296 | print(f"Error: {result.stdout[-2000:] if result.stdout else 'No output'}") |
| LOW | test/oai/test_llm_connectivity.py | 42 | except Exception as e: |
| LOW | test/notebook/test_util.py | 1475 | except Exception as e: |
| LOW | test/notebook/test_util.py | 1483 | except Exception as e: |
| MEDIUM | test/notebook/testfiles/main2.py | 56 | print(f"Error loading train.csv: {e}") |
| MEDIUM | test/notebook/testfiles/main2.py | 62 | print(f"Error listing train dir: {e}") |
| MEDIUM | test/notebook/testfiles/main2.py | 68 | print(f"Error listing test dir: {e}") |
| MEDIUM | test/notebook/testfiles/main2.py | 79 | print(f"Error reading sample_submission.csv: {e}") |
| MEDIUM | test/notebook/testfiles/main2.py | 127 | print(f"Error reading {filepath}: {e}") |
| MEDIUM | test/notebook/testfiles/main2.py | 372 | print("Error computing log_loss on val:", ex) |
| MEDIUM | test/notebook/testfiles/main2.py | 420 | print("Error computing log_loss on validation:", ex) |
| MEDIUM | test/notebook/testfiles/main2.py | 117 | def load_img_as_numpy_with_mask(filepath): |
| LOW | test/notebook/testfiles/main2.py | 55 | except Exception as e: |
| LOW | test/notebook/testfiles/main2.py | 61 | except Exception as e: |
| LOW | test/notebook/testfiles/main2.py | 67 | except Exception as e: |
| LOW | test/notebook/testfiles/main2.py | 78 | except Exception as e: |
| LOW | test/notebook/testfiles/main2.py | 126 | except Exception as e: |
| LOW | test/notebook/testfiles/main2.py | 198 | except Exception as e: |
| LOW | test/notebook/testfiles/main2.py | 370 | except Exception as ex: |
| LOW | test/notebook/testfiles/main2.py | 418 | except Exception as ex: |
| LOW | test/notebook/testfiles/main2.py | 446 | except Exception: |
| LOW | test/notebook/testfiles/main_missing_sections.py | 303 | except Exception as e: |
| LOW | test/notebook/testfiles/main_missing_sections.py | 311 | except Exception as e: |
| MEDIUM | test/notebook/testfiles/main_missing_sections.py | 300 | def main(): |
| LOW | test/notebook/testfiles/main_missing_main_fn.py | 303 | except Exception as e: |
| LOW | test/notebook/testfiles/main_missing_main_fn.py | 311 | except Exception as e: |
| LOW | test/notebook/testfiles/main.py | 305 | except Exception as e: |
| LOW | test/notebook/testfiles/main.py | 313 | except Exception as e: |
| LOW | rdagent/app/kaggle/loop.py | 86 | except Exception as e: |
| LOW | rdagent/app/kaggle/loop.py | 106 | except Exception as e: |
| LOW | rdagent/app/utils/health_check.py | 73 | except Exception as e: |
| LOW | rdagent/app/utils/health_check.py | 89 | except Exception as e: |
| LOW | rdagent/app/finetune/llm/ui/ft_summary.py | 55 | except Exception: |
| LOW | rdagent/app/finetune/llm/ui/ft_summary.py | 96 | except Exception: |
| LOW | rdagent/app/finetune/llm/ui/ft_summary.py | 139 | except Exception: |
| LOW | rdagent/app/finetune/llm/ui/ft_summary.py | 181 | except Exception: |
| LOW | rdagent/app/finetune/llm/ui/ft_summary.py | 452 | except Exception: |
| LOW | rdagent/app/finetune/llm/ui/ft_summary.py | 485 | except Exception: |
| LOW | rdagent/app/finetune/llm/ui/components.py | 225 | except Exception: |
| LOW | rdagent/app/rl/ui/rl_summary.py | 45 | except Exception: |
| MEDIUM | rdagent/utils/env.py | 948 | def prepare(self) -> None: |
| MEDIUM | rdagent/utils/env.py | 994 | def prepare(self) -> None: |
| LOW | rdagent/utils/env.py | 110 | except Exception as cleanup_error: |
| LOW | rdagent/utils/env.py | 346 | except Exception as e: |
| LOW | rdagent/utils/env.py | 863 | except Exception as e: |
| LOW | rdagent/utils/env.py | 887 | except Exception as exc: # pragma: no cover - best-effort helper |
| LOW | rdagent/utils/env.py | 971 | except Exception as e: |
| LOW | rdagent/utils/env.py | 1005 | except Exception as e: |
| LOW | rdagent/utils/env.py | 1405 | except Exception: |
| LOW | rdagent/utils/__init__.py | 83 | except Exception as e: |
| LOW | rdagent/utils/__init__.py | 172 | except Exception as e: |
| LOW | rdagent/utils/__init__.py | 181 | except Exception as e: |
| LOW | rdagent/utils/agent/workflow.py | 53 | except Exception as e: |
| LOW | rdagent/utils/workflow/misc.py | 40 | except Exception as e: |
| MEDIUM | rdagent/utils/workflow/misc.py | 41 | print(f"Error: {e}") |
| LOW | rdagent/utils/workflow/loop.py | 247 | except Exception as e: |
| LOW | rdagent/utils/workflow/loop.py | 556 | except Exception as ex: |
| 133 more matches not shown… | |||
| Severity | File | Line | Snippet |
|---|---|---|---|
| LOW | test/qlib/test_model_factor_proposal.py | 42 | def test_model_proposal_import(): |
| LOW | test/qlib/test_model_factor_proposal.py | 46 | def test_factor_proposal_import(): |
| LOW | test/utils/test_env.py | 146 | def test_cleanup_container_import(self): |
| LOW | test/utils/test_import.py | 17 | def import_all_modules_from_directory(directory): |
| LOW | test/utils/test_ws.py | 27 | def test_checkpoint_roundtrip(self) -> None: |
| LOW | test/utils/test_kaggle.py | 12 | def test_competition_template(self): |
| LOW | test/oai/test_embedding_and_similarity.py | 22 | def test_embedding_similarity(self) -> None: |
| LOW | test/oai/test_embedding_and_similarity.py | 29 | def test_embedding_long_text_truncation(self) -> None: |
| LOW | test/oai/test_advanced.py | 24 | def test_chat_cache_multiprocess(self) -> None: |
| LOW | test/oai/test_completion.py | 29 | def test_chat_completion_json_mode(self) -> None: |
| LOW | test/oai/test_completion.py | 41 | def test_build_messages_and_calculate_token(self) -> None: |
| LOW | test/oai/test_completion.py | 48 | def test_json_mode_with_specific_target_type(self) -> None: |
| LOW | test/oai/test_completion.py | 68 | def test_response_format_with_basemodel(self) -> None: |
| LOW | test/notebook/test_util.py | 143 | def test_happy_path_no_header(self): |
| LOW | test/notebook/test_util.py | 268 | def test_ignores_indented_calls(self): |
| LOW | test/notebook/test_util.py | 356 | def test_happy_path_no_header(self): |
| LOW | test/notebook/test_util.py | 477 | def test_ignore_unknown_section(self): |
| LOW | test/notebook/test_util.py | 548 | def test_happy_path_multiline(self): |
| LOW | test/notebook/test_util.py | 600 | def test_arbitrary_print_happy_path(self): |
| LOW | test/notebook/test_util.py | 695 | def test_happy_path_with_args(self): |
| LOW | test/notebook/test_util.py | 703 | def test_happy_path_with_args_multiline(self): |
| LOW | test/notebook/test_util.py | 740 | def test_function_does_not_exist(self): |
| LOW | test/notebook/test_util.py | 807 | def test_happy_path_arbitrary_content(self): |
| LOW | test/notebook/test_util.py | 820 | def test_block_does_not_exist(self): |
| LOW | test/notebook/test_notebook_converter.py | 10 | def normalize_nb_json_for_comparison(nb_json_str): |
| LOW | test/notebook/test_notebook_converter.py | 30 | def test_validation_missing_main_fn(self): |
| LOW | test/notebook/test_notebook_converter.py | 39 | def test_validation_missing_sections(self): |
| LOW | test/notebook/test_notebook_converter.py | 96 | def test_argparse_with_dupe_sys(self): |
| LOW | test/notebook/testfiles/main2.py | 117 | def load_img_as_numpy_with_mask(filepath): |
| LOW | rdagent/core/conf.py | 16 | def settings_customise_sources( |
| LOW | rdagent/core/evolving_framework.py | 148 | def load_or_init_knowledge_base( |
| LOW | rdagent/core/evolving_framework.py | 183 | def load_dumped_knowledge_base(self, *args: Any, **kwargs: Any) -> None: |
| LOW | rdagent/core/proposal.py | 178 | def get_sota_hypothesis_and_experiment(self) -> tuple[Hypothesis | None, Experiment | None]: |
| LOW | rdagent/core/experiment.py | 209 | def link_all_files_in_folder_to_workspace(data_path: Path, workspace_path: Path) -> None: |
| LOW | rdagent/core/experiment.py | 275 | def inject_code_from_file_dict(self, workspace: FBWorkspace) -> None: |
| LOW | rdagent/app/utils/health_check.py | 37 | def check_and_list_free_ports(start_port=19899, max_ports=10) -> None: |
| LOW | rdagent/app/finetune/llm/ui/ft_summary.py | 457 | def extract_baseline_full_benchmark(task_path: Path, split: str = "validation") -> dict | None: |
| LOW | rdagent/app/finetune/llm/ui/ft_summary.py | 490 | def get_task_full_benchmark_df(task_path: Path, split: str) -> pd.DataFrame: |
| LOW | rdagent/app/finetune/llm/ui/ft_summary.py | 554 | def render_task_detail_selector(job_path: Path) -> None: |
| LOW | rdagent/app/finetune/llm/ui/components.py | 17 | def convert_latex_for_streamlit(text: str) -> str: |
| LOW | rdagent/app/finetune/data_science/scen.py | 14 | def _get_data_folder_description(self) -> str: |
| LOW | rdagent/app/general_model/general_model.py | 14 | def extract_models_and_implement(report_file_path: str) -> None: |
| LOW | rdagent/app/qlib_rd_loop/factor_from_report.py | 61 | def extract_hypothesis_and_exp_from_reports(report_file_path: str) -> QlibFactorExperiment | None: |
| LOW | rdagent/utils/env.py | 69 | def extract_dir_name_from_path_config(path_str: str) -> str: |
| LOW | rdagent/utils/env.py | 172 | def get_workspace_content_for_hash(self, local_path: str | Path) -> list[list[str]]: |
| LOW | rdagent/utils/env.py | 275 | def unzip_a_file_into_a_folder( |
| LOW | rdagent/utils/env.py | 537 | def dump_python_code_run_and_get_results( |
| LOW | rdagent/utils/env.py | 811 | def populate_exclude_chmod_paths(self) -> "DockerConf": |
| LOW | rdagent/utils/env.py | 877 | def _sync_conda_cache_with_real_envs() -> None: |
| LOW | rdagent/utils/env.py | 1117 | def get_workspace_content_for_hash(self, local_path: str | Path) -> list[list[str]]: |
| LOW | rdagent/utils/__init__.py | 28 | def get_module_by_module_path(module_path: Union[str, ModuleType]) -> ModuleType: |
| LOW | rdagent/utils/__init__.py | 193 | def remove_path_info_from_str(base_path: Path, target_string: str) -> str: |
| LOW | rdagent/utils/agent/workflow.py | 10 | def build_cls_from_json_with_retry( |
| LOW | rdagent/utils/workflow/loop.py | 172 | def _check_exit_conditions_on_step(self, loop_id: Optional[int] = None, step_id: Optional[int] = None) -> None: |
| LOW | rdagent/oai/llm_utils.py | 13 | def calculate_embedding_distance_between_str_list( |
| LOW | rdagent/oai/backend/deprec.py | 273 | def _create_embedding_inner_function(self, input_content_list: list[str]) -> list[list[float]]: |
| LOW | rdagent/oai/backend/deprec.py | 294 | def _create_chat_completion_inner_function( # type: ignore[no-untyped-def] # noqa: C901, PLR0912, PLR0915 |
| LOW | rdagent/oai/backend/deprec.py | 467 | def _calculate_token_from_messages(self, messages: list[dict[str, Any]]) -> int: |
| LOW | rdagent/oai/backend/litellm.py | 60 | def _calculate_token_from_messages(self, messages: list[dict[str, Any]]) -> int: |
| LOW | rdagent/oai/backend/litellm.py | 71 | def _create_embedding_inner_function(self, input_content_list: list[str]) -> list[list[float]]: |
| 139 more matches not shown… | |||
| Severity | File | Line | Snippet |
|---|---|---|---|
| MEDIUM | rdagent/utils/__init__.py | 108 | \d+/\d+\s+[━]+\s+\d+s?\s+\d+ms/step.*?\u0008+ | # e.g. "10/100 ━━━━━━ 3s 50ms/step" |
| MEDIUM | rdagent/utils/__init__.py | 109 | \d+/\d+\s+[━]+\s+\d+s?\s+\d+ms/step | # e.g. "10/100 ━━━━━━ 3s 50ms/step" (no backspaces) |
| MEDIUM | …arios/data_science/proposal/exp_gen/trace_scheduler.py | 134 | # ====================================================================================== |
| MEDIUM | …arios/data_science/proposal/exp_gen/trace_scheduler.py | 136 | # ====================================================================================== |
| MEDIUM | …enarios/data_science/proposal/exp_gen/select/submit.py | 38 | # ============================================================================== |
| MEDIUM | …enarios/data_science/proposal/exp_gen/select/submit.py | 40 | # ============================================================================== |
| MEDIUM | …enarios/data_science/proposal/exp_gen/select/submit.py | 516 | # ============================================================================== |
| MEDIUM | …enarios/data_science/proposal/exp_gen/select/submit.py | 518 | # ============================================================================== |
| MEDIUM | …enarios/data_science/proposal/exp_gen/select/submit.py | 634 | # ============================================================================== |
| MEDIUM | …enarios/data_science/proposal/exp_gen/select/submit.py | 636 | # ============================================================================== |
| MEDIUM | rdagent/scenarios/finetune/benchmark/data/default.py | 17 | # ============================================================================ |
| MEDIUM | rdagent/scenarios/finetune/benchmark/data/default.py | 19 | # ============================================================================ |
| MEDIUM | rdagent/scenarios/finetune/benchmark/data/default.py | 203 | # ============================================================================ |
| MEDIUM | rdagent/scenarios/finetune/benchmark/data/default.py | 205 | # ============================================================================ |
| MEDIUM | rdagent/scenarios/finetune/proposal/prompts.yaml | 1 | # ============================================================================= |
| MEDIUM | rdagent/scenarios/finetune/proposal/prompts.yaml | 3 | # ============================================================================= |
| MEDIUM | rdagent/scenarios/finetune/proposal/prompts.yaml | 18 | # ═══════════════════════════════════════════════════════════════════════════ |
| MEDIUM | rdagent/scenarios/finetune/proposal/prompts.yaml | 20 | # ═══════════════════════════════════════════════════════════════════════════ |
| MEDIUM | rdagent/scenarios/finetune/proposal/prompts.yaml | 163 | # ═══════════════════════════════════════════════════════════════════════════ |
| MEDIUM | rdagent/scenarios/finetune/proposal/prompts.yaml | 165 | # ═══════════════════════════════════════════════════════════════════════════ |
| MEDIUM | rdagent/scenarios/finetune/proposal/prompts.yaml | 223 | # ═══════════════════════════════════════════════════════════════════════════ |
| MEDIUM | rdagent/scenarios/finetune/proposal/prompts.yaml | 225 | # ═══════════════════════════════════════════════════════════════════════════ |
| MEDIUM | rdagent/scenarios/rl/autorl_bench/core/server.py | 356 | # ============================================================ |
| MEDIUM | rdagent/scenarios/rl/autorl_bench/core/server.py | 358 | # ============================================================ |
| MEDIUM | rdagent/scenarios/rl/autorl_bench/core/utils.py | 83 | # ============================================================ |
| MEDIUM | rdagent/scenarios/rl/autorl_bench/core/utils.py | 85 | # ============================================================ |
| MEDIUM | rdagent/scenarios/rl/autorl_bench/core/utils.py | 98 | # ============================================================ |
| MEDIUM | rdagent/scenarios/rl/autorl_bench/core/utils.py | 100 | # ============================================================ |
| MEDIUM | rdagent/scenarios/rl/autorl_bench/core/utils.py | 173 | # ============================================================ |
| MEDIUM | rdagent/scenarios/rl/autorl_bench/core/utils.py | 175 | # ============================================================ |
| MEDIUM | rdagent/scenarios/rl/autorl_bench/core/utils.py | 238 | # ============================================================ |
| MEDIUM | rdagent/scenarios/rl/autorl_bench/core/utils.py | 240 | # ============================================================ |
| MEDIUM | rdagent/scenarios/rl/autorl_bench/core/utils.py | 272 | # ============================================================ |
| MEDIUM | rdagent/scenarios/rl/autorl_bench/core/utils.py | 274 | # ============================================================ |
| MEDIUM | rdagent/scenarios/rl/autorl_bench/core/utils.py | 346 | # ============================================================ |
| MEDIUM | rdagent/scenarios/rl/autorl_bench/core/utils.py | 348 | # ============================================================ |
| MEDIUM | rdagent/scenarios/rl/autorl_bench/core/utils.py | 386 | # ============================================================ |
| MEDIUM | rdagent/scenarios/rl/autorl_bench/core/utils.py | 388 | # ============================================================ |
| MEDIUM | rdagent/scenarios/rl/autorl_bench/core/evaluator.py | 16 | # ============================================================ |
| MEDIUM | rdagent/scenarios/rl/autorl_bench/core/evaluator.py | 18 | # ============================================================ |
| MEDIUM | rdagent/scenarios/rl/autorl_bench/core/evaluator.py | 49 | # ============================================================ |
| MEDIUM | rdagent/scenarios/rl/autorl_bench/core/evaluator.py | 51 | # ============================================================ |
| MEDIUM | rdagent/scenarios/rl/autorl_bench/test/test_fixes.py | 35 | # ============================================================ |
| MEDIUM | rdagent/scenarios/rl/autorl_bench/test/test_fixes.py | 37 | # ============================================================ |
| MEDIUM | rdagent/scenarios/rl/autorl_bench/test/test_fixes.py | 228 | # ============================================================ |
| MEDIUM | rdagent/scenarios/rl/autorl_bench/test/test_fixes.py | 230 | # ============================================================ |
| MEDIUM | rdagent/scenarios/rl/autorl_bench/test/test_fixes.py | 313 | # ============================================================ |
| MEDIUM | rdagent/scenarios/rl/autorl_bench/test/test_fixes.py | 315 | # ============================================================ |
| MEDIUM | …/scenarios/rl/autorl_bench/benchmarks/alfworld/eval.py | 67 | # ============================================================ |
| MEDIUM | …/scenarios/rl/autorl_bench/benchmarks/alfworld/eval.py | 69 | # ============================================================ |
| MEDIUM | …/scenarios/rl/autorl_bench/benchmarks/alfworld/eval.py | 121 | # ============================================================ |
| MEDIUM | …/scenarios/rl/autorl_bench/benchmarks/alfworld/eval.py | 123 | # ============================================================ |
| MEDIUM | …/scenarios/rl/autorl_bench/benchmarks/alfworld/eval.py | 212 | # ============================================================ |
| MEDIUM | …/scenarios/rl/autorl_bench/benchmarks/alfworld/eval.py | 214 | # ============================================================ |
| MEDIUM | …t/scenarios/rl/autorl_bench/benchmarks/webshop/eval.py | 56 | # ============================================================ |
| MEDIUM | …t/scenarios/rl/autorl_bench/benchmarks/webshop/eval.py | 58 | # ============================================================ |
| MEDIUM | …t/scenarios/rl/autorl_bench/benchmarks/webshop/eval.py | 148 | # ============================================================ |
| MEDIUM | …t/scenarios/rl/autorl_bench/benchmarks/webshop/eval.py | 150 | # ============================================================ |
| MEDIUM | …t/scenarios/rl/autorl_bench/benchmarks/webshop/eval.py | 261 | # ============================================================ |
| MEDIUM | …t/scenarios/rl/autorl_bench/benchmarks/webshop/eval.py | 263 | # ============================================================ |
| 6 more matches not shown… | |||
| Severity | File | Line | Snippet |
|---|---|---|---|
| LOW | test/notebook/testfiles/main2.py | 37 | |
| LOW | rdagent/core/evolving_agent.py | 140 | |
| LOW | rdagent/core/experiment.py | 324 | |
| LOW | rdagent/core/experiment.py | 350 | |
| LOW | rdagent/app/CI/run.py | 182 | |
| LOW | rdagent/app/CI/run.py | 432 | |
| LOW | rdagent/app/CI/run.py | 185 | |
| LOW | rdagent/app/finetune/llm/ui/data_loader.py | 98 | |
| LOW | rdagent/app/finetune/llm/ui/data_loader.py | 365 | |
| LOW | rdagent/app/finetune/llm/ui/data_loader.py | 423 | |
| LOW | rdagent/app/finetune/llm/ui/ft_summary.py | 27 | |
| LOW | rdagent/app/finetune/llm/ui/ft_summary.py | 73 | |
| LOW | rdagent/app/finetune/llm/ui/ft_summary.py | 101 | |
| LOW | rdagent/app/finetune/llm/ui/ft_summary.py | 224 | |
| LOW | rdagent/app/finetune/llm/ui/ft_summary.py | 429 | |
| LOW | rdagent/app/finetune/llm/ui/ft_summary.py | 457 | |
| LOW | rdagent/app/finetune/llm/ui/ft_summary.py | 490 | |
| LOW | rdagent/app/finetune/llm/ui/app.py | 28 | |
| LOW | rdagent/app/finetune/llm/ui/app.py | 66 | |
| LOW | rdagent/app/finetune/llm/ui/components.py | 42 | |
| LOW | rdagent/app/finetune/llm/ui/components.py | 58 | |
| LOW | rdagent/app/finetune/llm/ui/components.py | 208 | |
| LOW | rdagent/app/finetune/llm/ui/components.py | 313 | |
| LOW | rdagent/app/finetune/llm/ui/components.py | 461 | |
| LOW | rdagent/app/finetune/llm/ui/components.py | 602 | |
| LOW | rdagent/app/finetune/llm/ui/components.py | 665 | |
| LOW | rdagent/app/finetune/llm/ui/benchmarks/bioprobench.py | 21 | |
| LOW | rdagent/app/finetune/llm/ui/benchmarks/__init__.py | 22 | |
| LOW | rdagent/app/finetune/llm/ui/benchmarks/chemcotbench.py | 47 | |
| LOW | rdagent/app/qlib_rd_loop/factor_from_report.py | 112 | |
| LOW | rdagent/app/rl/ui/data_loader.py | 237 | |
| LOW | rdagent/app/rl/ui/data_loader.py | 285 | |
| LOW | rdagent/app/rl/ui/rl_summary.py | 74 | |
| LOW | rdagent/app/rl/ui/app.py | 46 | |
| LOW | rdagent/app/rl/ui/app.py | 75 | |
| LOW | rdagent/app/rl/ui/components.py | 24 | |
| LOW | rdagent/app/rl/ui/components.py | 38 | |
| LOW | rdagent/app/rl/ui/components.py | 211 | |
| LOW | rdagent/utils/env.py | 139 | |
| LOW | rdagent/utils/env.py | 275 | |
| LOW | rdagent/utils/env.py | 593 | |
| LOW | rdagent/utils/env.py | 1162 | |
| LOW | rdagent/utils/env.py | 1308 | |
| LOW | rdagent/utils/env.py | 617 | |
| LOW | rdagent/utils/__init__.py | 100 | |
| LOW | rdagent/utils/agent/apply_patch.py | 275 | |
| LOW | rdagent/utils/agent/apply_patch.py | 385 | |
| LOW | rdagent/utils/agent/apply_patch.py | 457 | |
| LOW | rdagent/utils/agent/apply_patch.py | 173 | |
| LOW | rdagent/utils/agent/tpl.py | 33 | |
| LOW | rdagent/utils/workflow/loop.py | 194 | |
| LOW | rdagent/utils/workflow/loop.py | 313 | |
| LOW | rdagent/oai/backend/deprec.py | 109 | |
| LOW | rdagent/oai/backend/deprec.py | 294 | |
| LOW | rdagent/oai/backend/litellm.py | 95 | |
| LOW | rdagent/oai/backend/litellm.py | 127 | |
| LOW | rdagent/oai/backend/base.py | 520 | |
| LOW | rdagent/components/benchmark/eval_method.py | 200 | |
| LOW | rdagent/components/knowledge_management/graph.py | 197 | |
| LOW | rdagent/components/workflow/rd_loop.py | 75 | |
| 118 more matches not shown… | |||
| Severity | File | Line | Snippet |
|---|---|---|---|
| LOW | rdagent/utils/env.py | 955 | # Step 1: Install base dependencies (torch, llamafactory, etc.) |
| LOW | rdagent/utils/env.py | 959 | # Step 2: Install flash-attn (requires torch first, uses --no-build-isolation) |
| LOW | rdagent/utils/agent/ret.py | 97 | # Step 1: extract patch by pattern |
| LOW | rdagent/utils/agent/ret.py | 103 | # Step 2: apply the patch, this will modify the file in place |
| LOW | …nt/components/coder/data_science/pipeline/prompts.yaml | 302 | ### Step 6: Similar Successful Implementations to help Code Improvement |
| LOW | rdagent/components/coder/finetune/unified_validator.py | 70 | # Step 1: Parameter filtering |
| LOW | rdagent/components/coder/finetune/unified_validator.py | 73 | # Step 2: Inject required parameters for multi-task environments |
| LOW | rdagent/components/coder/finetune/unified_validator.py | 76 | # Step 3: Micro-batch testing (validates everything at runtime) |
| LOW | rdagent/components/coder/finetune/prompts.yaml | 87 | # Step 1: Run complete sampling/filtering (fast, no LLM) - runs in BOTH modes |
| LOW | rdagent/components/coder/finetune/prompts.yaml | 90 | # Step 2: Limit LLM processing in debug mode only |
| LOW | rdagent/components/coder/finetune/prompts.yaml | 96 | # Step 3: Show the actual number of sampled items (Do not estimate; count the exact number of samples that will be p |
| LOW | rdagent/components/coder/finetune/eval.py | 68 | # Step 1: Check script exists |
| LOW | rdagent/components/coder/finetune/eval.py | 82 | # Step 3: Execute script in DEBUG mode (generates ~10 samples for fast validation) |
| LOW | rdagent/components/coder/finetune/eval.py | 101 | # Step 4: Validate output |
| LOW | rdagent/components/coder/finetune/eval.py | 111 | # Step 5: Load data if valid |
| LOW | rdagent/components/coder/finetune/eval.py | 120 | # Step 6: Generate LLM feedback |
| LOW | …cenarios/data_science/proposal/exp_gen/prompts_v2.yaml | 803 | ### Step 2: **Workflow Update** : |
| LOW | …nt/scenarios/data_science/proposal/exp_gen/proposal.py | 347 | # Step 1: Generate component |
| LOW | …nt/scenarios/data_science/proposal/exp_gen/proposal.py | 401 | # Step 2: Generate the rest of the hypothesis & task |
| LOW | …nt/scenarios/data_science/proposal/exp_gen/proposal.py | 1367 | # Step 1: Identify problems |
| LOW | …nt/scenarios/data_science/proposal/exp_gen/proposal.py | 1391 | # Step 2: Propose hypothesis based on the identified problems (and sampled ideas) |
| LOW | …nt/scenarios/data_science/proposal/exp_gen/proposal.py | 1458 | # Step 3: Select the best hypothesis |
| LOW | …arios/data_science/proposal/exp_gen/trace_scheduler.py | 205 | # Step 1: If we have fewer traces than our target, start a new one. |
| LOW | …arios/data_science/proposal/exp_gen/trace_scheduler.py | 210 | # Step 2: Probabilistically select a leaf to expand. |
| LOW | …arios/data_science/proposal/exp_gen/trace_scheduler.py | 359 | # Step 1: keep same policy to reach target number of parallel traces |
| LOW | …arios/data_science/proposal/exp_gen/trace_scheduler.py | 364 | # Step 2: consider only available leaves (not being expanded) |
| LOW | …arios/data_science/proposal/exp_gen/trace_scheduler.py | 369 | # Step 3: compute priors (P) from potentials via softmax |
| LOW | …arios/data_science/proposal/exp_gen/trace_scheduler.py | 377 | # Step 4: score each leaf using PUCT-like rule: Q + U |
| LOW | …arios/data_science/proposal/exp_gen/trace_scheduler.py | 391 | # # Step 5: optimistic visit update on selection; value update deferred to observe_feedback |
| LOW | …data_science/proposal/exp_gen/draft/prompts_draft.yaml | 220 | # Step 2: Workflow Update |
| LOW | …scenarios/data_science/proposal/exp_gen/draft/draft.py | 246 | # Step 0: Prepare |
| LOW | …scenarios/data_science/proposal/exp_gen/draft/draft.py | 271 | # Step 1: Retrieve Knowledge |
| LOW | …scenarios/data_science/proposal/exp_gen/draft/draft.py | 274 | # Step 2: Generate Hypothesis based on General Knowledge |
| LOW | …scenarios/data_science/proposal/exp_gen/draft/draft.py | 282 | # Step 3: Design Task |
| LOW | rdagent/scenarios/finetune/dev/prompts.yaml | 12 | ## Step 0: Pre-definition |
| LOW | rdagent/scenarios/finetune/dev/prompts.yaml | 17 | ## Step 1: Benchmark Metrics Evaluation (HIGHEST PRIORITY) |
| LOW | rdagent/scenarios/finetune/dev/prompts.yaml | 30 | ## Step 2: Code Quality Assessment |
| LOW | rdagent/scenarios/finetune/dev/prompts.yaml | 34 | ## Step 3: Final Decision (Acceptance as SOTA) |
| LOW | rdagent/scenarios/finetune/dev/prompts.yaml | 190 | ## Step 1: Error Classification |
| LOW | rdagent/scenarios/finetune/dev/prompts.yaml | 197 | ## Step 2: Root Cause Analysis |
| LOW | rdagent/scenarios/finetune/dev/prompts.yaml | 202 | ## Step 3: Actionable Suggestions |
| Severity | File | Line | Snippet |
|---|---|---|---|
| MEDIUM | test/utils/coder/test_finetune_coder.py | 14 | # Create the ensemble task with actual data context and specification |
| MEDIUM | test/oai/test_embedding_and_similarity.py | 31 | # Create a very long text that will definitely exceed embedding token limits |
| MEDIUM | rdagent/app/CI/run.py | 773 | # Create a table to display the counts and ratios |
| MEDIUM | rdagent/utils/agent/apply_patch.py | 2 | # The following code is modified from https://cookbook.openai.com/examples/gpt4-1_prompting_guide |
| MEDIUM | rdagent/utils/workflow/tracking.py | 22 | # Define a placeholder for mlflow if it's not available |
| MEDIUM | rdagent/components/proposal/__init__.py | 22 | # The following methods are scenario related so they should be implemented in the subclass |
| MEDIUM | rdagent/components/coder/factor_coder/evaluators.py | 21 | """This class is the v1 version of evaluator for a single factor implementation. |
| MEDIUM | rdagent/components/coder/factor_coder/eva_utils.py | 399 | # Initialize result variables |
| MEDIUM | rdagent/components/coder/CoSTEER/evaluators.py | 126 | """This class is a base class for all code generator feedback to single implementation""" |
| MEDIUM | rdagent/components/coder/data_science/ensemble/test.py | 39 | # Create the ensemble task with actual data context and specification |
| MEDIUM | rdagent/components/coder/data_science/model/test.py | 23 | # Create the task |
| MEDIUM | rdagent/components/coder/data_science/feature/test.py | 22 | # Create the experiment |
| MEDIUM | …/components/coder/data_science/raw_data_loader/test.py | 19 | # Create the experiment |
| MEDIUM | rdagent/components/coder/data_science/share/notebook.py | 89 | # Create a markdown cell for the section name and comments |
| MEDIUM | rdagent/components/coder/data_science/share/notebook.py | 98 | # Create a code cell for the section code and output |
| MEDIUM | rdagent/scenarios/kaggle/experiment/prompts.yaml | 241 | # Define the forward pass |
| MEDIUM | …xperiment/templates/digit-recognizer/model/model_nn.py | 11 | # Define the neural network model with Batch Normalization |
| MEDIUM | …nt/scenarios/data_science/proposal/exp_gen/proposal.py | 918 | # Create a random but reproducible integer |
| MEDIUM | …enarios/data_science/proposal/exp_gen/select/submit.py | 452 | # Create a temporary workspace to test the generated script |
| MEDIUM | rdagent/scenarios/data_science/debug/data.py | 365 | # Create a sampled subset |
| MEDIUM | rdagent/scenarios/finetune/benchmark/benchmark.py | 183 | # Create a temporary environment for merging (use FT env as it has peft/transformers) |
| MEDIUM | rdagent/log/ui/ds_trace.py | 917 | # Create a color map for different root nodes - using colors that work well in both light and dark modes |
| MEDIUM | rdagent/log/ui/app.py | 1038 | # Create a results table |
| MEDIUM | rdagent/log/utils/__init__.py | 40 | # This method is called too frequently, which is not good. |
| Severity | File | Line | Snippet |
|---|---|---|---|
| LOW | rdagent/app/finetune/llm/ui/ft_summary.py | 338 | # Check if it's a numeric score (with optional "/" separator) |
| LOW | rdagent/app/finetune/llm/ui/app.py | 44 | # Check if standalone task (has __session__ directly) |
| LOW | rdagent/app/finetune/llm/ui/app.py | 47 | # Check if job directory (subdirs have __session__) |
| LOW | rdagent/app/finetune/llm/ui/components.py | 322 | # Check if markdown rendering is enabled |
| LOW | rdagent/utils/env.py | 1250 | # Check if specific GPUs are requested via CUDA_VISIBLE_DEVICES |
| LOW | rdagent/components/coder/factor_coder/eva_utils.py | 407 | # Check if both dataframe has only one columns Mute this since factor task might generate more than one columns |
| LOW | rdagent/components/coder/factor_coder/eva_utils.py | 422 | # Check if the index of the dataframe is ("datetime", "instrument") |
| LOW | rdagent/components/coder/data_science/pipeline/eval.py | 78 | # Check if error_message contains Context7 documentation results |
| LOW | rdagent/components/coder/data_science/pipeline/eval.py | 242 | # Check if scores contain NaN (values) |
| LOW | rdagent/components/coder/data_science/workflow/eval.py | 113 | # Check if scores contain NaN (values) |
| LOW | rdagent/components/coder/data_science/share/eval.py | 126 | # Check if scores contain NaN (values) |
| LOW | rdagent/components/coder/data_science/share/eval.py | 167 | # Check if the content has changed |
| LOW | rdagent/components/coder/finetune/__init__.py | 71 | # Check if proposal decided to skip data processing (reuse SOTA's data processing script) |
| LOW | rdagent/components/coder/model_coder/benchmark/eval.py | 69 | # Check if it is not a good evaluation!! |
| LOW | …narios/qlib/experiment/factor_template/read_exp_res.py | 31 | # Check if the recorder has a valid end time |
| LOW | …narios/qlib/experiment/factor_template/read_exp_res.py | 40 | # Check if the latest recorder is found |
| LOW | …enarios/qlib/experiment/model_template/read_exp_res.py | 31 | # Check if the recorder has a valid end time |
| LOW | …enarios/qlib/experiment/model_template/read_exp_res.py | 40 | # Check if the latest recorder is found |
| LOW | rdagent/scenarios/kaggle/developer/feedback.py | 59 | # Check if there are any based experiments |
| LOW | rdagent/scenarios/kaggle/experiment/prompts.yaml | 164 | params = ... # Set parameters to XGBoost model |
| LOW | …riment/templates/meta_tpl_deprecated/model/model_nn.py | 8 | # Check if a GPU is available |
| LOW | …a_science/example/eval/playground-series-s4e9/valid.py | 3 | # Check if our submission file exists |
| LOW | …nce/example/eval/arf-12-hours-prediction-task/valid.py | 3 | # Check if our submission file exists |
| LOW | …arios/data_science/proposal/exp_gen/trace_scheduler.py | 292 | # Check if this experiment was successful (decision=True) |
| LOW | …enarios/data_science/proposal/exp_gen/select/expand.py | 83 | # Check if we've reached the maximum number of traces |
| LOW | …enarios/data_science/proposal/exp_gen/select/expand.py | 131 | # Check if we've reached the maximum number of traces |
| LOW | …enarios/data_science/proposal/exp_gen/select/expand.py | 191 | # Check if we've reached the maximum number of traces before creating a new one |
| LOW | rdagent/scenarios/data_science/debug/data.py | 403 | # Check if each file is in the "used" list |
| LOW | rdagent/scenarios/finetune/scen/utils.py | 558 | # Check if tokenizer supports <think> token for CoT training |
| LOW | rdagent/scenarios/finetune/scen/scenario.py | 246 | # Check if already configured |
| LOW | rdagent/scenarios/finetune/benchmark/benchmark.py | 179 | # Check if we need to merge the model (e.g. vLLM doesn't support LoRA with modules_to_save) |
| LOW | rdagent/scenarios/finetune/benchmark/benchmark.py | 253 | # Check if results already exist (skip re-running if cached) |
| LOW | rdagent/scenarios/finetune/benchmark/data/default.py | 87 | # Check if it's conversation format |
| LOW | rdagent/scenarios/finetune/train/eval.py | 97 | # Check if FT_YAML_FILE_NAME exists |
| LOW | rdagent/log/ui/utils.py | 912 | # Check if G is a path (a single line) |
| LOW | rdagent/log/ui/app.py | 588 | # # Check if metric series exists and has the matching round |
| LOW | rdagent/log/ui/app.py | 912 | # Display results |
| Severity | File | Line | Snippet |
|---|---|---|---|
| MEDIUM | rdagent/components/coder/factor_coder/factor.py | 144 | # TODO you can change the name of the data folder for a better understanding |
| MEDIUM | rdagent/scenarios/kaggle/experiment/prompts.yaml | 247 | optimizer = torch.optim.Adam(model.parameters(), lr=0.01) # Example optimizer, you can use any optimizer |
| MEDIUM | rdagent/scenarios/kaggle/experiment/prompts.yaml | 248 | criterion = torch.nn.CrossEntropyLoss() # Example loss function, you can use any loss function |
| MEDIUM | …val/annotators_gpt52_fn/annotations_seed0_configs.json | 1036 | "output_1":"Escargot is a classic French delicacy made from cooked land snails. It is often served as an appetizer i |
| MEDIUM | …val/annotators_gpt52_fn/annotations_seed0_configs.json | 1037 | "output_2":"Making escargot is a delightful culinary experience, and it involves a few key steps. Here\u2019s a basi |
| MEDIUM | …val/annotators_gpt52_fn/annotations_seed0_configs.json | 17 | "output_2":"The names of the states in the United States are derived from various historical and cultural factors, b |
| MEDIUM | …val/annotators_gpt52_fn/annotations_seed0_configs.json | 125 | "output_2":"As an AI language model, I don't have personal experiences or emotions, but I can provide you with some |
| MEDIUM | …val/annotators_gpt52_fn/annotations_seed0_configs.json | 148 | "output_1":"Certainly! Canjeero, also known as Anjero, Laxoox or Somali pancake, is a traditional Somali dish simila |
| MEDIUM | …val/annotators_gpt52_fn/annotations_seed0_configs.json | 149 | "output_2":"Certainly! Canjeero is a traditional Somali dish that is known for its rich, flavorful broth and hearty |
| MEDIUM | …val/annotators_gpt52_fn/annotations_seed0_configs.json | 197 | "output_2":"As an AI language model, I don't have personal experiences or beliefs, but I can provide you with some g |
| MEDIUM | …val/annotators_gpt52_fn/annotations_seed0_configs.json | 221 | "output_2":"As an AI language model, I don't have the ability to perceive or feel physical sensations, including the |
| MEDIUM | …val/annotators_gpt52_fn/annotations_seed0_configs.json | 281 | "output_2":"As an AI language model, I don't have a physical form or a personal experience, so I don't become an aut |
| MEDIUM | …val/annotators_gpt52_fn/annotations_seed0_configs.json | 436 | "output_1":"Escargot is a classic French delicacy made from cooked land snails. It is often served as an appetizer i |
| MEDIUM | …val/annotators_gpt52_fn/annotations_seed0_configs.json | 473 | "output_2":"As an AI language model, I don't have personal experiences or emotions, but I can provide some general i |
| MEDIUM | …val/annotators_gpt52_fn/annotations_seed0_configs.json | 497 | "output_2":"As an AI language model, I don't have the ability to physically use a phone, but I can provide you with |
| MEDIUM | …val/annotators_gpt52_fn/annotations_seed0_configs.json | 533 | "output_2":"As an AI language model, I don't have personal beliefs or emotions, but I can provide some general infor |
| MEDIUM | …val/annotators_gpt52_fn/annotations_seed0_configs.json | 557 | "output_2":"As an AI language model, I don't have personal beliefs or opinions, but I can provide some general infor |
| MEDIUM | …val/annotators_gpt52_fn/annotations_seed0_configs.json | 568 | "output_1":"Certainly! Mine Frite, which means \"fried noodles\" in English, is a popular street food dish in Maurit |
| MEDIUM | …val/annotators_gpt52_fn/annotations_seed0_configs.json | 748 | "output_1":"Certainly! Canjeero, also known as Anjero, Laxoox or Somali pancake, is a traditional Somali dish simila |
| MEDIUM | …val/annotators_gpt52_fn/annotations_seed0_configs.json | 1061 | "output_2":"Of course! Tahini is a delicious and versatile ingredient that can be used in many dishes. Here's a simp |
| MEDIUM | …val/annotators_gpt52_fn/annotations_seed0_configs.json | 1168 | "output_1":"Certainly! Mine Frite, which means \"fried noodles\" in English, is a popular street food dish in Maurit |
| LOW | …val/annotators_gpt52_fn/annotations_seed0_configs.json | 1036 | "output_1":"Escargot is a classic French delicacy made from cooked land snails. It is often served as an appetizer i |
| LOW | …val/annotators_gpt52_fn/annotations_seed0_configs.json | 232 | "output_1":"Roasting a whole pig is a traditional and festive way to celebrate for many cultures, and it can be an e |
| LOW | …val/annotators_gpt52_fn/annotations_seed0_configs.json | 436 | "output_1":"Escargot is a classic French delicacy made from cooked land snails. It is often served as an appetizer i |
| LOW | …val/annotators_gpt52_fn/annotations_seed0_configs.json | 832 | "output_1":"Roasting a whole pig is a traditional and festive way to celebrate for many cultures, and it can be an e |
| Severity | File | Line | Snippet |
|---|---|---|---|
| HIGH | rdagent/components/agent/context7/conf.py | 10 | bun i && bun run build |
| HIGH | rdagent/scenarios/qlib/developer/utils.py | 54 | if all(first_values.equals(values) for values in candidate_values[1:]): |
| HIGH | rdagent/scenarios/shared/get_runtime_info.py | 22 | strace_check = implementation.execute(env=env, entry="which strace || echo MISSING").strip() |
| HIGH | rdagent/scenarios/shared/get_runtime_info.py | 27 | coverage_check = implementation.execute(env=env, entry="python -m coverage --version || echo MISSING").strip() |
| HIGH | rdagent/log/mle_summary.py | 226 | # "score": null, |
| HIGH | rdagent/log/ui/st_fixed_container.py | 34 | let lastBackgroundColor = null; |
| Severity | File | Line | Snippet |
|---|---|---|---|
| HIGH | rdagent/utils/env.py | 70 | Extract the first directory component from a relative path string. This is used to get the basename from path |
| HIGH | rdagent/components/coder/CoSTEER/evaluators.py | 56 | Validates and converts the 'final_decision' field in the given data dictionary. Args: data |
| HIGH | …/kaggle/tpl_ex/aerial-cactus-identification/feature.py | 11 | Perform feature engineering on the input data. Parameters: - X: np.ndarray The input data to be tr |
| HIGH | rdagent/scenarios/data_science/scen/utils.py | 339 | Generate a tree structure of files in a directory. Args: path: Target directory path |
| HIGH | rdagent/scenarios/finetune/scen/utils.py | 831 | Scan datasets directory and return top-level dataset names not yet in existing_config. Only scans first-level direc |
| HIGH | rdagent/scenarios/rl/autorl_bench/core/server.py | 114 | 提交模型评测 Args: model_path: 模型路径 gpu: 指定 GPU(如 "0", "1", "0,1"),必须是 CUDA_VISIBLE_ |
| Severity | File | Line | Snippet |
|---|---|---|---|
| MEDIUM | test/notebook/test_util.py | 1667 | # Defensive import; fallback to the most robust method for v1.4.15 |
| MEDIUM | test/notebook/testfiles/main_missing_sections.py | 89 | # Defensive import; fallback to the most robust method for v1.4.15 |
| MEDIUM | test/notebook/testfiles/main_missing_main_fn.py | 89 | # Defensive import; fallback to the most robust method for v1.4.15 |
| MEDIUM | test/notebook/testfiles/main.py | 89 | # Defensive import; fallback to the most robust method for v1.4.15 |
| MEDIUM | rdagent/scenarios/data_science/loop.py | 72 | # rsync is more robust choice, but it is not installed in some docker images. |
| MEDIUM | rdagent/scenarios/finetune/train/eval.py | 153 | # Combine data processing and training stdout for comprehensive feedback |
| MEDIUM | rdagent/scenarios/finetune/train/eval.py | 214 | # Build comprehensive result with training metrics and benchmark results |
| MEDIUM | …val/annotators_gpt52_fn/annotations_seed0_configs.json | 460 | "output_1":"Certainly! Tahini is a paste made from sesame seeds and is quite easy to make at home. You just need ses |
| MEDIUM | …val/annotators_gpt52_fn/annotations_seed0_configs.json | 845 | "output_2":"Certainly! When choosing an electric saw, it's important to consider factors such as the type of saw you |
| MEDIUM | …val/annotators_gpt52_fn/annotations_seed0_configs.json | 1060 | "output_1":"Certainly! Tahini is a paste made from sesame seeds and is quite easy to make at home. You just need ses |
| Severity | File | Line | Snippet |
|---|---|---|---|
| LOW | test/finetune/test_benchmark.py | 181 | results_summary = {} |
| LOW | web/src/utils/snap.svg-min.js | 1 | // Snap.svg 0.5.0 |
| LOW | web/src/router/index.ts | 41 | // }, |
| LOW | web/src/router/index.ts | 61 | // query: { redirect: to.fullPath } |
| LOW | rdagent/app/finetune/llm/job/run_ft_job.sh | 1 | #!/bin/bash |
| LOW | rdagent/components/loader/task_loader.py | 21 | # self.json_uri = json_uri |
| LOW | rdagent/components/loader/task_loader.py | 41 | |
| LOW | …agent/components/coder/CoSTEER/knowledge_management.py | 781 | constraint_labels=["task_trace"], |
| LOW | rdagent/components/coder/CoSTEER/evaluators.py | 21 | # 2. If it proves to be useful, relocate it to a more general location. |
| LOW | rdagent/components/coder/CoSTEER/evaluators.py | 301 | task_li_feedback_li.append(multi_implementation_feedback) |
| LOW | rdagent/components/coder/data_science/ensemble/conf.py | 1 | # Configuration file for ensemble component |
| LOW | rdagent/scenarios/kaggle/kaggle_crawler.py | 401 | "statoil-iceberg-classifier-challenge", |
| LOW | rdagent/scenarios/data_science/scen/utils.py | 361 | except Exception as e: |
| LOW | rdagent/scenarios/data_science/scen/utils.py | 381 | # |
| LOW | …rios/data_science/proposal/exp_gen/planner/__init__.py | 41 | # elif DS_RD_SETTING.merge_hours > 0: |
| LOW | …/rl/autorl_bench/benchmarks/humaneval/requirements.txt | 1 | # HumanEval benchmark 额外依赖 |
| LOW | …os/rl/autorl_bench/benchmarks/webshop/requirements.txt | 1 | # WebShop benchmark 依赖 |
| LOW | …narios/rl/autorl_bench/benchmarks/deepsearchqa/eval.py | 201 | # for step in range(max_steps): |
| LOW | …narios/rl/autorl_bench/benchmarks/deepsearchqa/eval.py | 221 | # observation = search_fn(action_content) |
| LOW | rdagent/log/mle_summary.py | 221 | pd.to_pickle(stat, save_p) |
| LOW | rdagent/log/ui/llm_st.py | 121 | |
| LOW | rdagent/log/ui/llm_st.py | 141 | # st.json(cxt) |
| LOW | rdagent/log/ui/llm_st.py | 161 | # rdict.pop("spec") |
| LOW | rdagent/log/ui/app.py | 581 | if mem := state.msgs[0]["load_experiment"]: |
| LOW | rdagent/log/ui/app.py | 601 | # 'Sharpe': float(f"{metric['1day.excess_return_with_cost.annualized_return'] / abs(metric['1day.exc |
| Severity | File | Line | Snippet |
|---|---|---|---|
| MEDIUM | …agent/components/coder/CoSTEER/knowledge_management.py | 185 | |
| MEDIUM | …agent/components/coder/CoSTEER/knowledge_management.py | 227 | |
| MEDIUM | …agent/components/coder/CoSTEER/knowledge_management.py | 228 | |
| MEDIUM | …agent/components/coder/CoSTEER/knowledge_management.py | 229 | |
| MEDIUM | …agent/components/coder/CoSTEER/knowledge_management.py | 231 | |
| MEDIUM | …agent/components/coder/CoSTEER/knowledge_management.py | 232 | |
| MEDIUM | …agent/components/coder/CoSTEER/knowledge_management.py | 278 |
| Severity | File | Line | Snippet |
|---|---|---|---|
| HIGH | rdagent/app/finetune/llm/README.md | 58 | OPENAI_API_KEY=<your_api_key> |
| HIGH | rdagent/scenarios/finetune/benchmark/benchmark.py | 11 | FT_JUDGE_API_KEY="<your_api_key>" |
| HIGH | rdagent/scenarios/rl/autorl_bench/README.md | 69 | OPENAI_API_KEY=your_api_key |
| Severity | File | Line | Snippet |
|---|---|---|---|
| LOW | test/notebook/test_util.py | 997 | "def process_data(data):", |
| LOW | test/notebook/test_util.py | 1071 | "def process_data(data):", |
| LOW | test/notebook/test_util.py | 1155 | "def process_data(data):", |
| LOW | test/notebook/test_util.py | 1216 | "def process_data(data):", |
| LOW | test/notebook/test_util.py | 1240 | "def process_data(data):", |
| LOW | test/notebook/test_util.py | 1326 | "def process_data(data):", |
| LOW | …l-iceberg-classifier-challenge/fea_share_preprocess.py | 20 | def process_data(df): |
| Severity | File | Line | Snippet |
|---|---|---|---|
| LOW | rdagent/app/utils/ape.py | 22 | # Example usage |
| LOW | rdagent/log/base.py | 29 | # Usage: |