huggingface/sentence-transformers

26.2

Adjusted Score

26.2

Raw Score

100%

Time Factor

2026-07-13

Last Push

18.9K

Stars

Python

Language

111.9K

Lines of Code

590

Files

2.2K

Pattern Hits

2026-07-14

Scan Date

0.23

HC Hit Rate

What These Metrics Mean

Adjusted Score: Primary synthetic code indicator. Raw score normalised per 1,000 lines of code and multiplied by the temporal discount factor. This is the definitive comparative metric — use it to rank repositories by AI authorship density.
Raw Score: The unmodified sum of all severity-weighted, context-multiplied pattern match scores before temporal discounting. Reflects the absolute signal strength independent of when the repository was last active.
Time Factor: The temporal discount multiplier (0–100%) applied to the raw score. Repositories last updated before ChatGPT's launch (Nov 2022) receive a 5% factor. Full signal is only assigned to repositories active in the post-adoption era (Jan 2024+).
Pattern Hits: Total count of individual pattern matches across all files and categories. A high hit count with a low score may indicate a very large codebase with isolated AI snippets; a low count with a high score indicates dense, concentrated AI signatures.
HC Hit Rate: High+Critical pattern hits per file, averaged across the repository. This orthogonal signal catches repositories where a few files are densely packed with high-severity AI tells — a strong indicator even when the normalised score appears moderate due to codebase size.
Lines of Code / Files: Total lines and files analysed. The scanner examines 94 file extensions. These denominators are used to normalise the score, enabling fair comparison between repositories of vastly different sizes.

Score History

This chart maps the temporal evolution of the adjusted synthetic code score across successive scan runs. An upward trajectory indicates ongoing incorporation of AI-generated code or expanding LLM-assisted scaffolding; a stable or declining trajectory may reflect active human refactoring, code removal, or the adoption of stricter authorship policies. The dashed secondary line (right axis) independently tracks total raw pattern hit count, which can diverge from the normalised score when codebase size changes significantly between scans.

Severity Breakdown

Classifies detected patterns by their diagnostic confidence and structural impact. CRITICAL patterns (coefficient 10) represent definitive synthetic signatures — hallucinated imports, explicit LLM attribution metadata — virtually never produced by human authors. HIGH (5) indicates strong structural tells such as cross-file repetition or cross-linguistic idioms. MEDIUM (2) covers recognisable conversational padding and AI-specific vocabulary. LOW (1) captures subtle indicators like tautological comments and generic boilerplate that require density to carry independent signal.

CRITICAL 0HIGH 133MEDIUM 124LOW 1991

Directory Score Breakdown

This horizontal bar chart decomposes the repository's raw synthetic code score by top-level directory, allowing you to pinpoint precisely which modules or components carry the highest AI authorship density. Directories with disproportionately high scores relative to their size warrant targeted manual review: concentrated AI signatures often trace back to mass-generated configuration layers, auto-ported test suites, LLM-scaffolded boilerplate classes, or entire subsystems authored under heavy copilot assistance. Use this view to prioritise your human code-review effort.

Pattern Findings

The scanner identified 2248 distinct pattern matches across 17 syntactic categories. Each entry below represents a discrete location in the source code where the engine recorded a statistically significant AI authorship indicator. Expand any category row to inspect the individual file paths, line numbers, code snippets, and the lexical context (CODE, COMMENT, or STRING) in which each match was detected.

Reading the findings table: The Severity column indicates the diagnostic confidence level (CRITICAL / HIGH / MEDIUM / LOW). The Context column identifies whether the match occurred inside executable code, an inline comment, or a string literal — comment-context matches receive a ×1.5 weight because LLMs systematically over-annotate. The ⚡ bolt icon marks clustered matches: three or more patterns within a 10-line window, each receiving an additional ×1.5 density multiplier as dense clusters constitute far stronger evidence of synthetic authorship than isolated hits.

Hyper-Verbose Identifiers914 hits · 844 pts

Severity	File	Line	Snippet	Context
LOW	…entence_transformer/deprecated_model_card_templates.py	125	def model_card_get_pooling_function(pooling_mode):	STRING
LOW	sentence_transformers/sentence_transformer/model.py	931	def set_pooling_include_prompt(self, include_prompt: bool) -> None:	CODE
LOW	sentence_transformers/sentence_transformer/model.py	977	def get_sentence_embedding_dimension(self) -> int \| None:	CODE
LOW	sentence_transformers/sentence_transformer/model.py	1015	def truncate_sentence_embeddings(self, truncate_dim: int \| None) -> Iterator[None]:	CODE
LOW	…ntence_transformers/sentence_transformer/model_card.py	129	def get_model_specific_metadata(self) -> dict[str, Any]:	CODE
LOW	…sentence_transformer/losses/batch_semi_hard_triplet.py	110	def batch_semi_hard_triplet_loss(self, labels: Tensor, embeddings: Tensor) -> Tensor:	CODE
LOW	…tence_transformer/losses/multiple_negatives_ranking.py	237	def compute_loss_from_embeddings(self, embeddings: list[Tensor], labels: Tensor) -> Tensor:	CODE
LOW	…ence_transformers/sentence_transformer/losses/angle.py	83	def compute_loss_from_embeddings(	CODE
LOW	…ransformer/losses/cached_multiple_negatives_ranking.py	436	def calculate_loss_and_cache_gradients(self, reps: list[list[Tensor]]) -> Tensor:	CODE
LOW	…transformer/losses/global_orthogonal_regularization.py	139	def compute_loss_from_embeddings(	CODE
LOW	…mers/sentence_transformer/losses/batch_hard_triplet.py	228	def get_anchor_positive_triplet_mask(labels: Tensor) -> Tensor:	CODE
LOW⚡	…mers/sentence_transformer/losses/batch_hard_triplet.py	247	def get_anchor_negative_triplet_mask(labels: Tensor) -> Tensor:	CODE
LOW	…transformers/sentence_transformer/losses/margin_mse.py	177	def compute_loss_from_embeddings(self, embeddings: list[Tensor], labels: Tensor) -> Tensor:	CODE
LOW	…rmers/sentence_transformer/losses/cosine_similarity.py	83	def compute_loss_from_embeddings(self, embeddings: list[Tensor], labels: Tensor) -> Tensor:	CODE
LOW	…/sentence_transformer/losses/denoising_auto_encoder.py	16	def _tie_encoder_decoder_weights(encoder: nn.Module, decoder: nn.Module) -> None:	CODE
LOW	…nce_transformers/sentence_transformer/losses/cosent.py	86	def compute_loss_from_embeddings(self, embeddings: list[Tensor], labels: Tensor) -> Tensor:	CODE
LOW	…ce_transformers/sentence_transformer/losses/triplet.py	87	def compute_loss_from_embeddings(self, embeddings: list[Tensor], labels: Tensor) -> Tensor:	CODE
LOW	…e_transformer/losses/batch_hard_soft_margin_triplet.py	99	def batch_hard_triplet_soft_margin_loss(self, labels: Tensor, embeddings: Tensor) -> Tensor:	CODE
LOW	…sformers/sentence_transformer/losses/distill_kl_div.py	141	def compute_loss_from_embeddings(self, embeddings: list[Tensor], labels: Tensor) -> Tensor:	CODE
LOW	…rmers/sentence_transformer/losses/cached_gist_embed.py	296	def calculate_loss_and_cache_gradients(self, reps: list[list[Tensor]], reps_guided: list[list[Tensor]]) -> Tensor:	CODE
LOW	…nsformers/sentence_transformer/losses/embed_distill.py	196	def compute_loss_from_embeddings(self, embeddings: list[Tensor], teacher_embeddings: list[Tensor]) -> Tensor:	CODE
LOW	…ntence_transformer/evaluation/binary_classification.py	316	def find_best_acc_and_threshold(scores, labels, high_score_more_similar: bool):	CODE
LOW	…nsformers/sentence_transformer/evaluation/reranking.py	291	def compute_metrics_individual(self, model: SentenceTransformer):	CODE
LOW	…nsformers/sentence_transformer/evaluation/nano_beir.py	468	def _load_dataset_subset_split(self, subset: str, split: str, required_columns: list[str]):	CODE
LOW	…nsformers/sentence_transformer/evaluation/nano_beir.py	523	def store_metrics_in_model_card_data(self, args, *kwargs):	CODE
LOW	…e_transformers/sentence_transformer/modules/pooling.py	26	def _convert_legacy_pooling_kwargs(kwargs: dict[str, Any]) -> None:	CODE
LOW	…e_transformers/sentence_transformer/modules/pooling.py	46	def _deprecated_pooling_mode_kwargs(func):	CODE
LOW	…e_transformers/sentence_transformer/modules/pooling.py	151	def _exclude_prompt_from_mask(attention_mask: Tensor, prompt_length: int) -> Tensor:	CODE
LOW	…ransformers/sentence_transformer/modules/clip_model.py	36	def _get_default_modality_config(config: dict[str, Any]) -> tuple[ModalityConfig, str]:	CODE
LOW	sentence_transformers/util/deprecated_import.py	261	def setup_deprecated_module_imports() -> None:	CODE
LOW	sentence_transformers/util/similarity.py	299	def to_similarity_pairwise_fn(	CODE
LOW	sentence_transformers/util/retrieval.py	89	def paraphrase_mining_embeddings(	CODE
LOW	sentence_transformers/util/environment.py	28	def suggest_extra_on_exception() -> Generator[None, None, None]:	CODE
LOW	sentence_transformers/util/environment.py	83	def check_package_availability(package_name: str, owner: str) -> bool:	CODE
LOW	sentence_transformers/util/file_io.py	34	def is_sentence_transformer_model(	CODE
LOW	sentence_transformers/util/decorators.py	48	def transformer_kwargs_decorator(func):	CODE
LOW	sentence_transformers/util/decorators.py	92	def cross_encoder_init_args_decorator(func):	CODE
LOW	sentence_transformers/util/decorators.py	165	def cross_encoder_predict_rank_args_decorator(func):	CODE
LOW	sentence_transformers/util/decorators.py	190	def save_to_hub_args_decorator(func):	CODE
LOW	sentence_transformers/backend/optimize.py	19	def export_optimized_onnx_model(	CODE
LOW	sentence_transformers/backend/quantize.py	24	def export_dynamic_quantized_onnx_model(	CODE
LOW	sentence_transformers/backend/quantize.py	107	def export_static_quantized_openvino_model(	CODE
LOW	sentence_transformers/backend/utils.py	155	def save_or_push_to_hub_model(	CODE
LOW	sentence_transformers/cross_encoder/model.py	406	def get_default_activation_fn(self) -> Callable:	CODE
LOW	sentence_transformers/cross_encoder/model.py	467	def default_activation_function(self) -> Callable:	CODE
LOW	sentence_transformers/cross_encoder/fit_mixin.py	397	def smart_batching_collate_text_only(self, batch: list[InputExample]) -> BatchEncoding:	CODE
LOW	sentence_transformers/cross_encoder/model_card.py	269	def get_model_specific_metadata(self) -> dict[str, Any]:	CODE
LOW	…ss_encoder/losses/cached_multiple_negatives_ranking.py	239	def calculate_loss_and_cache_gradients(self, logits: list[Tensor], batch_size: int) -> Tensor:	CODE
LOW	…nce_transformers/cross_encoder/evaluation/nano_beir.py	389	def _load_dataset_subset_split(self, subset: str, split: str, required_columns: list[str]):	CODE
LOW	sentence_transformers/sparse_encoder/model.py	602	def set_pooling_include_prompt(self, include_prompt: bool) -> None:	CODE
LOW	sentence_transformers/sparse_encoder/model.py	830	def get_sentence_embedding_dimension(self) -> int \| None:	CODE
LOW	sentence_transformers/sparse_encoder/model.py	1118	def splade_pooling_chunk_size(self) -> int \| None:	CODE
LOW	sentence_transformers/sparse_encoder/model.py	1139	def splade_pooling_chunk_size(self, value: int \| None) -> None:	CODE
LOW	sentence_transformers/sparse_encoder/model_card.py	118	def get_model_specific_metadata(self) -> dict[str, Any]:	CODE
LOW	sentence_transformers/sparse_encoder/search_engines.py	160	def semantic_search_elasticsearch(	CODE
LOW	sentence_transformers/sparse_encoder/search_engines.py	428	def semantic_search_opensearch(	CODE
LOW	…ce_transformers/sparse_encoder/losses/cached_splade.py	182	def calculate_loss_and_cache_gradients(self, reps: list[list[Tensor]], labels: Tensor \| None) -> Tensor:	CODE
LOW	sentence_transformers/sparse_encoder/losses/flops.py	47	def compute_loss_from_embeddings(self, embeddings: torch.Tensor) -> torch.Tensor:	CODE
LOW	sentence_transformers/sparse_encoder/losses/csr.py	15	def normalized_mean_squared_error(reconstruction: torch.Tensor, original_input: torch.Tensor) -> torch.Tensor:	CODE
LOW	sentence_transformers/sparse_encoder/losses/csr.py	68	def compute_loss_from_embeddings(self, outputs: list[dict[str, torch.Tensor]]) -> dict[str, torch.Tensor]:	CODE
854 more matches not shown…

Cross-File Repetition114 hits · 570 pts

Severity	File	Line	Snippet	Context
HIGH	…nce_transformers/sentence_transformer/training_args.py	0	basetrainingarguments extends :class:`~transformers.trainingarguments` with additional arguments specific to sentence tr	STRING
HIGH	sentence_transformers/sparse_encoder/training_args.py	0	basetrainingarguments extends :class:`~transformers.trainingarguments` with additional arguments specific to sentence tr	STRING
HIGH	sentence_transformers/base/training_args.py	0	basetrainingarguments extends :class:`~transformers.trainingarguments` with additional arguments specific to sentence tr	STRING
HIGH	sentence_transformers/sentence_transformer/trainer.py	0	basetrainer is a simple but feature-complete training and eval loop for pytorch based on the 🤗 transformers :class:`~tra	STRING
HIGH	sentence_transformers/cross_encoder/trainer.py	0	basetrainer is a simple but feature-complete training and eval loop for pytorch based on the 🤗 transformers :class:`~tra	STRING
HIGH	sentence_transformers/sparse_encoder/trainer.py	0	basetrainer is a simple but feature-complete training and eval loop for pytorch based on the 🤗 transformers :class:`~tra	STRING
HIGH	sentence_transformers/base/trainer.py	0	basetrainer is a simple but feature-complete training and eval loop for pytorch based on the 🤗 transformers :class:`~tra	STRING
HIGH	…sentence_transformer/losses/batch_semi_hard_triplet.py	0	batchhardsoftmargintripletloss takes a batch with (input, label) pairs and computes the loss for all possible, valid tri	STRING
HIGH	…mers/sentence_transformer/losses/batch_hard_triplet.py	0	batchhardsoftmargintripletloss takes a batch with (input, label) pairs and computes the loss for all possible, valid tri	STRING
HIGH	…e_transformer/losses/batch_hard_soft_margin_triplet.py	0	batchhardsoftmargintripletloss takes a batch with (input, label) pairs and computes the loss for all possible, valid tri	STRING
HIGH	…sentence_transformer/losses/batch_semi_hard_triplet.py	0	@misc{hermans2017defense, title={in defense of the triplet loss for person re-identification}, author={alexander hermans	STRING
HIGH	…rmers/sentence_transformer/losses/batch_all_triplet.py	0	@misc{hermans2017defense, title={in defense of the triplet loss for person re-identification}, author={alexander hermans	STRING
HIGH	…mers/sentence_transformer/losses/batch_hard_triplet.py	0	@misc{hermans2017defense, title={in defense of the triplet loss for person re-identification}, author={alexander hermans	STRING
HIGH	…ce_transformers/sentence_transformer/losses/triplet.py	0	@misc{hermans2017defense, title={in defense of the triplet loss for person re-identification}, author={alexander hermans	STRING
HIGH	…e_transformer/losses/batch_hard_soft_margin_triplet.py	0	@misc{hermans2017defense, title={in defense of the triplet loss for person re-identification}, author={alexander hermans	STRING
HIGH	…ransformer/losses/cached_multiple_negatives_ranking.py	0	random-state context manager class. reference: https://github.com/luyug/gradcache. this class will back up the pytorch's	STRING
HIGH	…rmers/sentence_transformer/losses/cached_gist_embed.py	0	random-state context manager class. reference: https://github.com/luyug/gradcache. this class will back up the pytorch's	STRING
HIGH	…ss_encoder/losses/cached_multiple_negatives_ranking.py	0	random-state context manager class. reference: https://github.com/luyug/gradcache. this class will back up the pytorch's	STRING
HIGH	…ransformer/losses/cached_multiple_negatives_ranking.py	0	a backward hook to backpropagate the cached gradients mini-batch by mini-batch.	STRING
HIGH	…rmers/sentence_transformer/losses/cached_gist_embed.py	0	a backward hook to backpropagate the cached gradients mini-batch by mini-batch.	STRING
HIGH	…ss_encoder/losses/cached_multiple_negatives_ranking.py	0	a backward hook to backpropagate the cached gradients mini-batch by mini-batch.	STRING
HIGH	…ransformer/losses/cached_multiple_negatives_ranking.py	0	do forward pass on all the minibatches of the input features and yield corresponding embeddings.	STRING
HIGH	…rmers/sentence_transformer/losses/cached_gist_embed.py	0	do forward pass on all the minibatches of the input features and yield corresponding embeddings.	STRING
HIGH	…ss_encoder/losses/cached_multiple_negatives_ranking.py	0	do forward pass on all the minibatches of the input features and yield corresponding embeddings.	STRING
HIGH	…ransformers/sentence_transformer/datasets/sentences.py	0	this file contains deprecated code that can only be used with the old `model.fit`-style sentence transformers v2.x train	STRING
HIGH	…tence_transformer/datasets/no_duplicates_dataloader.py	0	this file contains deprecated code that can only be used with the old `model.fit`-style sentence transformers v2.x train	STRING
HIGH	…transformers/sentence_transformer/datasets/__init__.py	0	this file contains deprecated code that can only be used with the old `model.fit`-style sentence transformers v2.x train	STRING
HIGH	…_transformers/sentence_transformer/readers/__init__.py	0	this file contains deprecated code that can only be used with the old `model.fit`-style sentence transformers v2.x train	STRING
HIGH	…ormers/sentence_transformer/datasets/sentence_label.py	0	this file contains deprecated code that can only be used with the old `model.fit`-style sentence transformers v2.x train	STRING
HIGH	…rs/sentence_transformer/datasets/parallel_sentences.py	0	this file contains deprecated code that can only be used with the old `model.fit`-style sentence transformers v2.x train	STRING
HIGH	…_transformers/sentence_transformer/readers/nli_data.py	0	this file contains deprecated code that can only be used with the old `model.fit`-style sentence transformers v2.x train	STRING
HIGH	…_transformers/sentence_transformer/readers/sts_data.py	0	this file contains deprecated code that can only be used with the old `model.fit`-style sentence transformers v2.x train	STRING
HIGH	…nsformers/sentence_transformer/readers/paired_files.py	0	this file contains deprecated code that can only be used with the old `model.fit`-style sentence transformers v2.x train	STRING
HIGH	…sformers/sentence_transformer/readers/input_example.py	0	this file contains deprecated code that can only be used with the old `model.fit`-style sentence transformers v2.x train	STRING
HIGH	…formers/sentence_transformer/readers/label_sentence.py	0	this file contains deprecated code that can only be used with the old `model.fit`-style sentence transformers v2.x train	STRING
HIGH	…e_transformers/sentence_transformer/readers/triplet.py	0	this file contains deprecated code that can only be used with the old `model.fit`-style sentence transformers v2.x train	STRING
HIGH	…entence_transformer/datasets/denoising_auto_encoder.py	0	this file contains deprecated code that can only be used with the old `model.fit`-style sentence transformers v2.x train	STRING
HIGH	…ransformers/sentence_transformer/modules/clip_model.py	0	get the default modality configuration for the current transformer task. returns: tuple[modalityconfig, str]: a tuple of	STRING
HIGH	…transformers/sparse_encoder/modules/mlm_transformer.py	0	get the default modality configuration for the current transformer task. returns: tuple[modalityconfig, str]: a tuple of	STRING
HIGH	sentence_transformers/base/modules/transformer.py	0	get the default modality configuration for the current transformer task. returns: tuple[modalityconfig, str]: a tuple of	STRING
HIGH	sentence_transformers/cross_encoder/losses/rank_net.py	0	get configuration parameters for this loss function. returns: dictionary containing the configuration parameters	STRING
HIGH	sentence_transformers/cross_encoder/losses/list_net.py	0	get configuration parameters for this loss function. returns: dictionary containing the configuration parameters	STRING
HIGH	sentence_transformers/cross_encoder/losses/plist_mle.py	0	get configuration parameters for this loss function. returns: dictionary containing the configuration parameters	STRING
HIGH	sentence_transformers/cross_encoder/losses/list_mle.py	0	get configuration parameters for this loss function. returns: dictionary containing the configuration parameters	STRING
HIGH	…tence_transformers/cross_encoder/losses/lambda_loss.py	0	get configuration parameters for this loss function. returns: dictionary containing the configuration parameters	STRING
HIGH	sentence_transformers/cross_encoder/losses/adr_mse.py	0	compute lambdaloss for a batch of queries and their documents. args: inputs: list of (queries, documents_list) labels: g	STRING
HIGH	sentence_transformers/cross_encoder/losses/list_net.py	0	compute lambdaloss for a batch of queries and their documents. args: inputs: list of (queries, documents_list) labels: g	STRING
HIGH	sentence_transformers/cross_encoder/losses/plist_mle.py	0	compute lambdaloss for a batch of queries and their documents. args: inputs: list of (queries, documents_list) labels: g	STRING
HIGH	…tence_transformers/cross_encoder/losses/lambda_loss.py	0	compute lambdaloss for a batch of queries and their documents. args: inputs: list of (queries, documents_list) labels: g	STRING
HIGH	tests/sentence_transformer/test_model_card.py	0	dummy dataset for testing purposes. the dataset looks as follows: { "anchor": ["anchor 1", "anchor 2", ..., "anchor 10"]	STRING
HIGH	tests/cross_encoder/test_model_card.py	0	dummy dataset for testing purposes. the dataset looks as follows: { "anchor": ["anchor 1", "anchor 2", ..., "anchor 10"]	STRING
HIGH	tests/sparse_encoder/test_model_card.py	0	dummy dataset for testing purposes. the dataset looks as follows: { "anchor": ["anchor 1", "anchor 2", ..., "anchor 10"]	STRING
HIGH	tests/sentence_transformer/test_compute_embeddings.py	0	predict() must run the forward pass via __call__ so that model.compile() applies to inference.	STRING
HIGH	tests/sparse_encoder/test_model.py	0	predict() must run the forward pass via __call__ so that model.compile() applies to inference.	STRING
HIGH	tests/cross_encoder/test_model.py	0	predict() must run the forward pass via __call__ so that model.compile() applies to inference.	STRING
HIGH	…ransformer/evaluation/test_label_accuracy_evaluator.py	0	tests the correct computation of evaluation scores from binaryclassificationevaluator	STRING
HIGH	…mer/evaluation/test_binary_classification_evaluator.py	0	tests the correct computation of evaluation scores from binaryclassificationevaluator	STRING
HIGH	…sformer/evaluation/test_paraphrase_mining_evaluator.py	0	tests the correct computation of evaluation scores from binaryclassificationevaluator	STRING
HIGH	tests/base/modules/transformer/test_text_generation.py	0	create a transformer instance and return it with its supported modalities.	STRING
HIGH	tests/base/modules/transformer/test_fill_mask.py	0	create a transformer instance and return it with its supported modalities.	STRING
54 more matches not shown…

Unused Imports560 hits · 527 pts

Severity	File	Line	Context
LOW	sentence_transformers/__init__.py	1	CODE
LOW	sentence_transformers/__init__.py	10	CODE
LOW	sentence_transformers/__init__.py	10	CODE
LOW	sentence_transformers/__init__.py	10	CODE
LOW	sentence_transformers/__init__.py	15	CODE
LOW	sentence_transformers/__init__.py	15	CODE
LOW	sentence_transformers/__init__.py	16	CODE
LOW	sentence_transformers/__init__.py	16	CODE
LOW	sentence_transformers/__init__.py	16	CODE
LOW	sentence_transformers/__init__.py	16	CODE
LOW	sentence_transformers/__init__.py	22	CODE
LOW	sentence_transformers/__init__.py	22	CODE
LOW	sentence_transformers/__init__.py	23	CODE
LOW	sentence_transformers/__init__.py	24	CODE
LOW	sentence_transformers/__init__.py	25	CODE
LOW	sentence_transformers/__init__.py	26	CODE
LOW	sentence_transformers/__init__.py	27	CODE
LOW	sentence_transformers/__init__.py	28	CODE
LOW	sentence_transformers/__init__.py	28	CODE
LOW	sentence_transformers/__init__.py	28	CODE
LOW	sentence_transformers/__init__.py	28	CODE
LOW	sentence_transformers/__init__.py	34	CODE
LOW	sentence_transformers/__init__.py	36	CODE
LOW	sentence_transformers/__init__.py	37	CODE
LOW	sentence_transformers/__init__.py	38	CODE
LOW	…entence_transformer/deprecated_model_card_templates.py	6	CODE
LOW	sentence_transformers/sentence_transformer/__init__.py	1	CODE
LOW	sentence_transformers/sentence_transformer/__init__.py	3	CODE
LOW	sentence_transformers/sentence_transformer/__init__.py	4	CODE
LOW	sentence_transformers/sentence_transformer/__init__.py	5	CODE
LOW	sentence_transformers/sentence_transformer/__init__.py	6	CODE
LOW	sentence_transformers/sentence_transformer/__init__.py	7	CODE
LOW	…nce_transformers/sentence_transformer/training_args.py	1	CODE
LOW	…nce_transformers/sentence_transformer/training_args.py	5	CODE
LOW	…nce_transformers/sentence_transformer/training_args.py	5	CODE
LOW	sentence_transformers/sentence_transformer/model.py	1	CODE
LOW	…nce_transformers/sentence_transformer/data_collator.py	1	CODE
LOW	sentence_transformers/sentence_transformer/trainer.py	1	CODE
LOW	sentence_transformers/sentence_transformer/fit_mixin.py	1	CODE
LOW	…ntence_transformers/sentence_transformer/model_card.py	1	CODE
LOW	…sentence_transformer/losses/batch_semi_hard_triplet.py	1	CODE
LOW	…/losses/cached_multiple_negatives_symmetric_ranking.py	1	CODE
LOW	…ntence_transformers/sentence_transformer/losses/mse.py	1	CODE
LOW	…tence_transformer/losses/multiple_negatives_ranking.py	1	CODE
LOW	…transformers/sentence_transformer/losses/gist_embed.py	1	CODE
LOW	…rmers/sentence_transformer/losses/batch_all_triplet.py	1	CODE
LOW	…ransformers/sentence_transformer/losses/contrastive.py	1	CODE
LOW	…ence_transformers/sentence_transformer/losses/angle.py	1	CODE
LOW	…ransformer/losses/cached_multiple_negatives_ranking.py	1	CODE
LOW	…transformer/losses/global_orthogonal_regularization.py	1	CODE
LOW	…mers/sentence_transformer/losses/batch_hard_triplet.py	1	CODE
LOW	…transformers/sentence_transformer/losses/margin_mse.py	1	CODE
LOW	…e_transformers/sentence_transformer/losses/__init__.py	2	CODE
LOW	…e_transformers/sentence_transformer/losses/__init__.py	4	CODE
LOW	…e_transformers/sentence_transformer/losses/__init__.py	6	CODE
LOW	…e_transformers/sentence_transformer/losses/__init__.py	7	CODE
LOW	…e_transformers/sentence_transformer/losses/__init__.py	8	CODE
LOW	…e_transformers/sentence_transformer/losses/__init__.py	9	CODE
LOW	…e_transformers/sentence_transformer/losses/__init__.py	10	CODE
LOW	…e_transformers/sentence_transformer/losses/__init__.py	10	CODE
500 more matches not shown…

Self-Referential Comments94 hits · 285 pts

Severity	File	Line	Snippet	Context
MEDIUM	sentence_transformers/sentence_transformer/model.py	861	# Create a pool if not provided, but a list of devices is	COMMENT
MEDIUM	…transformers/sentence_transformer/losses/gist_embed.py	193	# Define the anchor threshold	COMMENT
MEDIUM	…rmers/sentence_transformer/losses/cached_gist_embed.py	351	# Define the anchor threshold	COMMENT
MEDIUM	…s/sentence_transformer/evaluation/paraphrase_mining.py	66	# Create a mapping from qid to question & a list of duplicates (qid1, qid2)	STRING
MEDIUM	sentence_transformers/util/retrieval.py	163	"""This function is deprecated. Use semantic_search instead"""	STRING
MEDIUM	sentence_transformers/cross_encoder/model.py	301	# Create a pool if is not provided, but a list of devices is	COMMENT
MEDIUM	…ers/cross_encoder/losses/multiple_negatives_ranking.py	137	# Create a mask for each anchor to each candidate index, where the matching positive	COMMENT
MEDIUM	sentence_transformers/cross_encoder/losses/plist_mle.py	224	# Create a mask for valid entries	COMMENT
MEDIUM	…ransformers/cross_encoder/evaluation/classification.py	51	# Create a list of pairs, and map the labels to the labels that the model knows	STRING
MEDIUM	sentence_transformers/sparse_encoder/model.py	728	# Create a pool if not provided, but a list of devices is	COMMENT
MEDIUM	sentence_transformers/base/sampler.py	533	# Create a random numpy permutation using int32 (or int64 if necessary)	COMMENT
MEDIUM	sentence_transformers/base/modules/transformer.py	2177	# This method is only called if this model has a modules.json, i.e. it's already been saved	COMMENT
MEDIUM	sentence_transformers/base/modules/router.py	81	# Create an asymmetric model with different encoders for queries and documents	COMMENT
MEDIUM⚡	tests/sentence_transformer/test_model.py	1201	# Create a simple dataset with a text column	COMMENT
MEDIUM⚡	tests/sentence_transformer/test_model.py	1322	# Create a Router with mixed modules	COMMENT
MEDIUM	tests/sentence_transformer/test_model.py	986	# Create a mock model with required prompts	COMMENT
MEDIUM	tests/sentence_transformer/test_model.py	1043	# Create a mock model with required prompts	COMMENT
MEDIUM	tests/sentence_transformer/test_multi_process.py	89	# Create a pool	COMMENT
MEDIUM	tests/sentence_transformer/test_multi_process.py	163	# Create a pool	COMMENT
MEDIUM	tests/sentence_transformer/test_trainer.py	197	# Create a new model card if a Trainer was initialized	COMMENT
MEDIUM	tests/sentence_transformer/test_trainer.py	801	# Define a custom batch sampler function	COMMENT
MEDIUM	tests/sentence_transformer/test_trainer.py	931	# Define a custom multi-dataset batch sampler function	COMMENT
MEDIUM	tests/util/test_hard_negatives.py	1030	# Create a larger dataset with 32 entries	COMMENT
MEDIUM	tests/util/test_hard_negatives.py	1145	# Create a dataset with just 2 pairs	COMMENT
MEDIUM⚡	tests/cross_encoder/test_model.py	753	# Create a simple dataset with a text column	COMMENT
MEDIUM	tests/cross_encoder/test_trainer.py	101	# Create a new model card if a Trainer was initialized	COMMENT
MEDIUM⚡	tests/sparse_encoder/test_model.py	426	# Create a simple dataset with a text column	COMMENT
MEDIUM	tests/sparse_encoder/test_model.py	136	# Create an empty sparse tensor	COMMENT
MEDIUM	tests/sparse_encoder/test_model.py	171	# Create a batch where the first sample has values but the second is all zeros	COMMENT
MEDIUM	tests/sparse_encoder/test_multi_process.py	62	# Create a pool	COMMENT
MEDIUM	tests/sparse_encoder/modules/test_csr.py	11	# Create a wrapper to measure outputs of the forward method	COMMENT
MEDIUM	tests/sparse_encoder/modules/test_csr.py	61	# Create the wrapper and replace the forward method	COMMENT
MEDIUM	tests/base/samplers/test_no_duplicates_batch_sampler.py	39	# Create a list of two 0's, two 1's, two 2's, ... two 49's. Then shuffle.	COMMENT
MEDIUM⚡	tests/base/modules/test_router.py	683	# Create a Router with different module configurations for each route	COMMENT
MEDIUM⚡	tests/base/modules/test_router.py	693	# Create a SentenceTransformer with static_embedding followed by router	COMMENT
MEDIUM	tests/base/modules/test_router.py	67	# Create a custom ModuleDict subclass to track access	COMMENT
MEDIUM	tests/base/modules/test_router.py	209	# Create a Router with StaticEmbedding modules	COMMENT
MEDIUM	tests/base/modules/test_router.py	313	# Create a Router with StaticEmbedding modules	COMMENT
MEDIUM	tests/base/modules/test_router.py	439	# Create a Router with StaticEmbedding modules	COMMENT
MEDIUM	tests/base/modules/test_router.py	457	# Create a loss function that works with router	COMMENT
MEDIUM	tests/base/modules/test_router.py	481	# Create a Router with StaticEmbedding modules	COMMENT
MEDIUM	tests/base/modules/test_router.py	493	# Create a loss function that works with router	COMMENT
MEDIUM	tests/base/modules/test_router.py	1356	# Create an object that will fail modality inference	COMMENT
MEDIUM	tests/base/modules/test_dense.py	92	# Create a Dense layer with custom keys	COMMENT
MEDIUM	…raining/data_augmentation/train_sts_qqp_crossdomain.py	182	# Define the training arguments	COMMENT
MEDIUM	…/training/data_augmentation/train_sts_indomain_bm25.py	216	# Define the training arguments	COMMENT
MEDIUM	…/training/data_augmentation/train_sts_indomain_bm25.py	237	# Create the trainer & start training	COMMENT
MEDIUM	…raining/data_augmentation/train_sts_indomain_nlpaug.py	137	# Define the training arguments	COMMENT
MEDIUM	…raining/data_augmentation/train_sts_indomain_nlpaug.py	158	# Create the trainer & start training	COMMENT
MEDIUM	…ining/data_augmentation/train_sts_indomain_semantic.py	226	# Define the training arguments	COMMENT
MEDIUM	…transformer/training/other/training_batch_hard_trec.py	45	# Create a dev set from train set	COMMENT
MEDIUM	…ansformer/training/other/training_gooaq_infonce_gor.py	42	# Define a custom loss that combines InfoNCE and Global Orthogonal Regularization	COMMENT
MEDIUM	…ransformer/training/distillation/model_distillation.py	111	# Create a relatively small dataset for evaluation	COMMENT
MEDIUM	…ransformer/training/distillation/model_distillation.py	116	# Create an STSB evaluator	COMMENT
MEDIUM	…ransformer/training/distillation/model_distillation.py	165	# Define the training arguments	COMMENT
MEDIUM	…ransformer/training/distillation/model_distillation.py	189	# Create the trainer & start training	COMMENT
MEDIUM	…ing/distillation/model_distillation_layer_reduction.py	58	# Create a smaller student model by using only some of the teacher layers	COMMENT
MEDIUM	…ing/distillation/model_distillation_layer_reduction.py	134	# Create a relatively small dataset for evaluation	COMMENT
MEDIUM	…ing/distillation/model_distillation_layer_reduction.py	169	# Create an STSB evaluator	COMMENT
MEDIUM	…ing/distillation/model_distillation_layer_reduction.py	188	# Define the training arguments	COMMENT
34 more matches not shown…

Excessive Try-Catch Wrapping178 hits · 180 pts

Severity	File	Line	Snippet	Context
LOW	…entence_transformer/deprecated_model_card_templates.py	189	except Exception as e:	CODE
MEDIUM	…entence_transformer/deprecated_model_card_templates.py	160	def get_train_objective_info(dataloader, loss):	CODE
LOW	…nsformers/sentence_transformer/evaluation/nano_beir.py	477	except Exception as e:	CODE
LOW	sentence_transformers/util/hard_negatives.py	463	except Exception:	CODE
LOW	sentence_transformers/util/logging.py	19	except Exception:	CODE
MEDIUM	sentence_transformers/util/logging.py	12	def emit(self, record) -> None:	CODE
LOW	sentence_transformers/util/misc.py	69	except Exception:	CODE
LOW	sentence_transformers/util/file_io.py	192	except Exception as first_error:	CODE
LOW	sentence_transformers/util/file_io.py	249	except Exception:	CODE
LOW	sentence_transformers/cross_encoder/model.py	387	except Exception as e:	CODE
LOW	sentence_transformers/cross_encoder/model.py	391	except Exception:	CODE
LOW	…nce_transformers/cross_encoder/evaluation/nano_beir.py	398	except Exception as e:	CODE
LOW	sentence_transformers/sparse_encoder/model.py	793	except Exception as e:	CODE
LOW	sentence_transformers/sparse_encoder/model.py	797	except Exception:	CODE
LOW	sentence_transformers/base/model.py	778	except Exception:	CODE
LOW	sentence_transformers/base/model.py	944	except Exception as exc:	CODE
LOW	sentence_transformers/base/model.py	1117	except Exception as exc:	CODE
LOW	sentence_transformers/base/model.py	1127	except Exception:	CODE
LOW	sentence_transformers/base/trainer.py	685	except Exception:	CODE
LOW	sentence_transformers/base/trainer.py	690	except Exception as exc:	CODE
MEDIUM	sentence_transformers/base/model_card.py	1424	def try_to_float(metric_value):	CODE
LOW⚡	sentence_transformers/base/model_card.py	1318	except Exception:	CODE
LOW⚡	sentence_transformers/base/model_card.py	1937	except Exception as exc:	CODE
LOW⚡	sentence_transformers/base/model_card.py	1945	except Exception as exc:	CODE
LOW⚡	sentence_transformers/base/model_card.py	1953	except Exception as exc:	CODE
LOW⚡	sentence_transformers/base/model_card.py	1960	except Exception as exc:	CODE
LOW	sentence_transformers/base/model_card.py	456	except Exception:	CODE
LOW	sentence_transformers/base/model_card.py	851	except Exception:	CODE
LOW	sentence_transformers/base/model_card.py	894	except Exception:	CODE
LOW	sentence_transformers/base/model_card.py	1387	except Exception:	CODE
LOW	sentence_transformers/base/model_card.py	1427	except Exception:	CODE
LOW	sentence_transformers/base/model_card.py	1557	except Exception:	CODE
LOW	sentence_transformers/base/model_card.py	1565	except Exception:	CODE
LOW	sentence_transformers/base/model_card.py	1587	except Exception:	CODE
LOW	sentence_transformers/base/model_card.py	1842	except Exception:	CODE
LOW	sentence_transformers/base/model_card.py	1894	except Exception:	CODE
LOW	sentence_transformers/base/model_card.py	1905	except Exception as exc:	CODE
LOW	sentence_transformers/base/model_card.py	1911	except Exception as exc:	CODE
LOW	sentence_transformers/base/model_card.py	1922	except Exception as exc:	CODE
LOW	sentence_transformers/base/model_card.py	495	except Exception:	STRING
LOW	sentence_transformers/base/modules/transformer.py	1511	except Exception as exc:	CODE
LOW	sentence_transformers/base/modules/transformer.py	1549	except Exception as exc: # never let suffix derivation break tokenization	CODE
LOW	sentence_transformers/base/modules/transformer.py	1972	except Exception:	CODE
LOW	tests/cross_encoder/test_model_card.py	290	except Exception:	CODE
LOW	tests/sparse_encoder/test_pretrained.py	142	except Exception as e:	CODE
LOW	tests/sparse_encoder/test_pretrained.py	153	except Exception as e:	CODE
LOW	tests/sparse_encoder/test_pretrained.py	183	except Exception as e:	CODE
LOW	tests/sparse_encoder/test_trainer.py	127	except Exception as e:	CODE
LOW	tests/base/test_model_card.py	281	except Exception:	CODE
LOW⚡	tests/base/test_modality.py	694	except Exception as exc:	CODE
LOW	tests/base/modules/transformer/test_text_generation.py	87	except Exception as e:	CODE
LOW	tests/base/modules/transformer/conftest.py	197	except Exception as e:	CODE
LOW	tests/base/modules/transformer/conftest.py	722	except Exception:	CODE
LOW	tests/base/modules/transformer/test_fill_mask.py	78	except Exception as e:	CODE
LOW	…ts/base/modules/transformer/test_feature_extraction.py	89	except Exception as e:	CODE
LOW	tests/base/modules/transformer/test_any_to_any.py	95	except Exception as e:	CODE
LOW	…se/modules/transformer/test_sequence_classification.py	87	except Exception as e:	CODE
LOW	docs/cross_encoder/training_overview.md	865	except Exception:	CODE
LOW	docs/cross_encoder/training_overview.md	1069	except Exception:	CODE
LOW	…ples/sentence_transformer/training/nli/training_nli.py	117	except Exception:	CODE
118 more matches not shown…

Modern Structural Boilerplate111 hits · 109 pts

Severity	File	Line	Snippet	Context
LOW	sentence_transformers/__init__.py	51	__all__ = [	CODE
LOW	sentence_transformers/sentence_transformer/__init__.py	9	__all__ = [	CODE
LOW	…nce_transformers/sentence_transformer/training_args.py	53	__all__ = ["SentenceTransformerTrainingArguments", "BatchSamplers", "MultiDatasetBatchSamplers"]	STRING
LOW	sentence_transformers/sentence_transformer/model.py	931	def set_pooling_include_prompt(self, include_prompt: bool) -> None:	CODE
LOW	sentence_transformers/sentence_transformer/trainer.py	33	logger = logging.getLogger(__name__)	CODE
LOW	sentence_transformers/sentence_transformer/fit_mixin.py	33	logger = logging.getLogger(__name__)	CODE
LOW	…tence_transformer/losses/multiple_negatives_ranking.py	14	logger = logging.getLogger(__name__)	CODE
LOW	…ransformer/losses/cached_multiple_negatives_ranking.py	19	logger = logging.getLogger(__name__)	CODE
LOW	…e_transformers/sentence_transformer/losses/__init__.py	38	__all__ = [	CODE
LOW	…transformers/sentence_transformer/losses/matryoshka.py	19	logger = logging.getLogger(__name__)	CODE
LOW	…transformers/sentence_transformer/losses/matryoshka.py	50	def set_dim(self, dim) -> None:	CODE
LOW	…/sentence_transformer/losses/denoising_auto_encoder.py	13	logger = logging.getLogger(__name__)	CODE
LOW	…sformers/sentence_transformer/losses/adaptive_layer.py	37	def set_layer_idx(self, layer_idx) -> None:	CODE
LOW	…ce_transformers/sentence_transformer/losses/softmax.py	15	logger = logging.getLogger(__name__)	CODE
LOW	…transformers/sentence_transformer/datasets/__init__.py	17	__all__ = [	CODE
LOW	…ormers/sentence_transformer/datasets/sentence_label.py	20	logger = logging.getLogger(__name__)	CODE
LOW	…rs/sentence_transformer/datasets/parallel_sentences.py	22	logger = logging.getLogger(__name__)	CODE
LOW	…ce_transformers/sentence_transformer/evaluation/mse.py	15	logger = logging.getLogger(__name__)	CODE
LOW	…ntence_transformer/evaluation/information_retrieval.py	21	logger = logging.getLogger(__name__)	CODE
LOW	…/sentence_transformer/evaluation/mse_from_dataframe.py	17	logger = logging.getLogger(__name__)	CODE
LOW	…ansformers/sentence_transformer/evaluation/__init__.py	20	__all__ = [	CODE
LOW	…ntence_transformer/evaluation/binary_classification.py	24	logger = logging.getLogger(__name__)	CODE
LOW	…nsformers/sentence_transformer/evaluation/reranking.py	23	logger = logging.getLogger(__name__)	CODE
LOW	…formers/sentence_transformer/evaluation/translation.py	19	logger = logging.getLogger(__name__)	CODE
LOW	…ransformers/sentence_transformer/evaluation/triplet.py	23	logger = logging.getLogger(__name__)	CODE
LOW	…s/sentence_transformer/evaluation/paraphrase_mining.py	15	logger = logging.getLogger(__name__)	CODE
LOW	…mers/sentence_transformer/evaluation/label_accuracy.py	17	logger = logging.getLogger(__name__)	CODE
LOW	…nsformers/sentence_transformer/evaluation/nano_beir.py	22	logger = logging.getLogger(__name__)	CODE
LOW	…entence_transformer/evaluation/embedding_similarity.py	24	logger = logging.getLogger(__name__)	CODE
LOW	…_transformers/sentence_transformer/readers/__init__.py	18	__all__ = [	CODE
LOW	…_transformers/sentence_transformer/modules/__init__.py	23	__all__ = [	CODE
LOW	…ransformers/sentence_transformer/modules/clip_model.py	8	logger = logging.getLogger(__name__)	CODE
LOW	…tence_transformers/sentence_transformer/modules/bow.py	13	logger = logging.getLogger(__name__)	CODE
LOW	…ormers/sentence_transformer/modules/word_embeddings.py	24	logger = logging.getLogger(__name__)	CODE
LOW	…rmers/sentence_transformer/modules/static_embedding.py	25	logger = logging.getLogger(__name__)	CODE
LOW	…nsformers/sentence_transformer/modules/word_weights.py	10	logger = logging.getLogger(__name__)	CODE
LOW	…ers/sentence_transformer/modules/tokenizer/__init__.py	7	__all__ = [	CODE
LOW	…rmers/sentence_transformer/modules/tokenizer/phrase.py	14	logger = logging.getLogger(__name__)	CODE
LOW	sentence_transformers/util/hard_negatives.py	16	logger = logging.getLogger(__name__)	CODE
LOW	sentence_transformers/util/quantization.py	10	logger = logging.getLogger(__name__)	CODE
LOW	sentence_transformers/util/__init__.py	58	__all__ = [	CODE
LOW	sentence_transformers/util/retrieval.py	17	logger = logging.getLogger(__name__)	CODE
LOW	sentence_transformers/util/environment.py	13	logger = logging.getLogger(__name__)	CODE
LOW	sentence_transformers/util/file_io.py	11	logger = logging.getLogger(__name__)	CODE
LOW	sentence_transformers/backend/optimize.py	8	logger = logging.getLogger(__name__)	CODE
LOW	sentence_transformers/backend/quantize.py	9	logger = logging.getLogger(__name__)	CODE
LOW	sentence_transformers/backend/__init__.py	8	__all__ = [	CODE
LOW	sentence_transformers/backend/utils.py	18	logger = logging.getLogger(__name__)	CODE
LOW	sentence_transformers/backend/load.py	11	logger = logging.getLogger(__name__)	CODE
LOW	sentence_transformers/cross_encoder/__init__.py	9	__all__ = [	CODE
LOW	sentence_transformers/cross_encoder/trainer.py	31	logger = logging.getLogger(__name__)	CODE
LOW	sentence_transformers/cross_encoder/fit_mixin.py	36	logger = logging.getLogger(__name__)	CODE
LOW	sentence_transformers/cross_encoder/model_card.py	27	logger = logging.getLogger(__name__)	CODE
LOW	sentence_transformers/cross_encoder/model_card.py	100	def set_widget_examples(self, dataset: Dataset \| DatasetDict) -> None:	CODE
LOW	sentence_transformers/cross_encoder/losses/__init__.py	23	__all__ = [	CODE
LOW	…ransformers/cross_encoder/evaluation/classification.py	17	logger = logging.getLogger(__name__)	CODE
LOW	…ence_transformers/cross_encoder/evaluation/__init__.py	16	__all__ = [	CODE
LOW	…nce_transformers/cross_encoder/evaluation/reranking.py	16	logger = logging.getLogger(__name__)	CODE
LOW	…e_transformers/cross_encoder/evaluation/correlation.py	16	logger = logging.getLogger(__name__)	CODE
LOW	…nce_transformers/cross_encoder/evaluation/nano_beir.py	18	logger = logging.getLogger(__name__)	CODE
51 more matches not shown…

Docstring Block Structure19 hits · 95 pts

Severity	File	Line	Snippet	Context
HIGH	sentence_transformers/sentence_transformer/trainer.py	37	SentenceTransformerTrainer is a simple but feature-complete training and eval loop for PyTorch based on the 🤗 T	STRING
HIGH	…rmers/sentence_transformer/modules/static_embedding.py	182	Creates a StaticEmbedding instance from a distillation process using the `model2vec` package. Args:	STRING
HIGH	…rmers/sentence_transformer/modules/static_embedding.py	245	Create a StaticEmbedding instance from a model2vec model. This method loads a pre-trained model2vec model	STRING
HIGH	sentence_transformers/util/misc.py	41	Import a dotted module path and return the attribute/class designated by the last name in the path. Raise Impor	STRING
HIGH	sentence_transformers/util/similarity.py	264	Converts a similarity function name or enum value to the corresponding similarity function. Args:	STRING
HIGH	sentence_transformers/util/similarity.py	302	Converts a similarity function into a pairwise similarity function. The pairwise similarity function r	STRING
HIGH	sentence_transformers/util/quantization.py	31	Performs semantic search using the FAISS library. Rescoring will be performed if: 1. `rescore` is True	STRING
HIGH	sentence_transformers/util/quantization.py	198	Performs semantic search using the usearch library. Rescoring will be performed if: 1. `rescore` is True	STRING
HIGH	sentence_transformers/util/tensor.py	152	Returns a new tensor with only the top-k values (in absolute terms) of each embedding, all others set to zero.	STRING
HIGH	sentence_transformers/util/file_io.py	41	Checks if the given model name or path corresponds to a SentenceTransformer model. Args: model_name_or	STRING
HIGH	sentence_transformers/util/file_io.py	82	Loads a file from a local or remote location. Args: model_name_or_path (str): The model name or path.	STRING
HIGH	sentence_transformers/util/file_io.py	141	Loads the subfolder path for a given model name or path. Args: model_name_or_path (str): The name or p	STRING
HIGH	sentence_transformers/util/file_io.py	206	Download a URL to a local file with a progress bar. The content is streamed in chunks and first written to a tempor	STRING
HIGH	sentence_transformers/backend/optimize.py	27	Export an optimized ONNX model from a SentenceTransformer, SparseEncoder, or CrossEncoder model. The O1-O4 opt	STRING
HIGH	sentence_transformers/backend/quantize.py	32	Export a quantized ONNX model from a SentenceTransformer, SparseEncoder, or CrossEncoder model. This function	STRING
HIGH	sentence_transformers/backend/quantize.py	119	Export a quantized OpenVINO model from a SentenceTransformer, SparseEncoder, or CrossEncoder model. This funct	STRING
HIGH	sentence_transformers/cross_encoder/model.py	560	Performs predictions with the CrossEncoder on the given input pairs. .. tip:: Adjusting `	STRING
HIGH	sentence_transformers/base/modality.py	556	Infer the modality of a single input sample by inspecting its type/structure. Pure type-based detection, does not r	STRING
HIGH	sentence_transformers/base/modules/module.py	311	A utility function to load the PyTorch weights of a model from a checkpoint. The checkpoint can be either a	STRING

Deep Nesting91 hits · 82 pts

Severity	File	Line	Context
LOW	sentence_transformers/sentence_transformer/model.py	485	CODE
LOW	sentence_transformers/sentence_transformer/model.py	843	CODE
LOW	sentence_transformers/sentence_transformer/model.py	908	CODE
LOW	sentence_transformers/sentence_transformer/fit_mixin.py	63	CODE
LOW	sentence_transformers/sentence_transformer/fit_mixin.py	407	CODE
LOW	sentence_transformers/sentence_transformer/fit_mixin.py	465	CODE
LOW	…tence_transformer/losses/multiple_negatives_ranking.py	237	CODE
LOW	…ransformer/losses/cached_multiple_negatives_ranking.py	83	CODE
LOW	…ransformer/losses/cached_multiple_negatives_ranking.py	171	CODE
LOW	…ransformer/losses/cached_multiple_negatives_ranking.py	445	CODE
LOW	…transformers/sentence_transformer/losses/matryoshka.py	92	CODE
LOW	…rmers/sentence_transformer/losses/cached_gist_embed.py	64	CODE
LOW	…tence_transformer/datasets/no_duplicates_dataloader.py	29	CODE
LOW	…ntence_transformer/evaluation/information_retrieval.py	299	CODE
LOW	…ntence_transformer/evaluation/information_retrieval.py	447	CODE
LOW	…formers/sentence_transformer/evaluation/translation.py	102	CODE
LOW	…s/sentence_transformer/evaluation/paraphrase_mining.py	245	CODE
LOW	…e_transformers/sentence_transformer/modules/pooling.py	163	CODE
LOW	…e_transformers/sentence_transformer/modules/pooling.py	237	CODE
LOW	…rmers/sentence_transformer/modules/tokenizer/phrase.py	63	CODE
LOW	sentence_transformers/util/hard_negatives.py	25	CODE
LOW	sentence_transformers/util/deprecated_import.py	210	CODE
LOW	sentence_transformers/util/quantization.py	18	CODE
LOW	sentence_transformers/util/quantization.py	185	CODE
LOW	sentence_transformers/util/quantization.py	371	CODE
LOW	sentence_transformers/util/retrieval.py	89	CODE
LOW	sentence_transformers/util/retrieval.py	167	CODE
LOW	sentence_transformers/util/retrieval.py	258	CODE
LOW	sentence_transformers/util/environment.py	51	CODE
LOW	sentence_transformers/util/file_io.py	205	CODE
LOW	sentence_transformers/util/decorators.py	92	CODE
LOW	sentence_transformers/util/decorators.py	115	CODE
LOW	sentence_transformers/backend/utils.py	155	CODE
LOW	sentence_transformers/cross_encoder/model.py	283	CODE
LOW	sentence_transformers/cross_encoder/model.py	357	CODE
LOW	sentence_transformers/cross_encoder/fit_mixin.py	61	CODE
LOW	sentence_transformers/cross_encoder/fit_mixin.py	417	CODE
LOW	sentence_transformers/sparse_encoder/model.py	711	CODE
LOW	…rs/sparse_encoder/evaluation/reciprocal_rank_fusion.py	105	CODE
LOW	…_transformers/sparse_encoder/modules/splade_pooling.py	65	CODE
LOW	sentence_transformers/base/model.py	854	CODE
LOW	sentence_transformers/base/model.py	1052	CODE
LOW	sentence_transformers/base/modality.py	232	CODE
LOW	sentence_transformers/base/modality.py	446	CODE
LOW	sentence_transformers/base/modality.py	470	CODE
LOW	sentence_transformers/base/data_collator.py	114	CODE
LOW	sentence_transformers/base/trainer.py	392	CODE
LOW	sentence_transformers/base/trainer.py	417	CODE
LOW	sentence_transformers/base/trainer.py	532	CODE
LOW	sentence_transformers/base/trainer.py	815	CODE
LOW	sentence_transformers/base/trainer.py	1206	CODE
LOW	sentence_transformers/base/model_card.py	429	CODE
LOW	sentence_transformers/base/model_card.py	512	CODE
LOW	sentence_transformers/base/model_card.py	594	CODE
LOW	sentence_transformers/base/model_card.py	696	CODE
LOW	sentence_transformers/base/model_card.py	792	CODE
LOW	sentence_transformers/base/model_card.py	1361	CODE
LOW	sentence_transformers/base/model_card.py	1889	CODE
LOW	sentence_transformers/base/modules/transformer.py	456	CODE
LOW	sentence_transformers/base/modules/transformer.py	589	CODE
31 more matches not shown…

AI Structural Patterns63 hits · 60 pts

Severity	File	Line	Context
LOW	sentence_transformers/sentence_transformer/model.py	149	CODE
LOW	sentence_transformers/sentence_transformer/model.py	222	CODE
LOW	sentence_transformers/sentence_transformer/model.py	281	CODE
LOW	sentence_transformers/sentence_transformer/model.py	344	CODE
LOW	sentence_transformers/sentence_transformer/model.py	364	CODE
LOW	sentence_transformers/sentence_transformer/model.py	384	CODE
LOW	sentence_transformers/sentence_transformer/model.py	404	CODE
LOW	sentence_transformers/sentence_transformer/model.py	424	CODE
LOW	sentence_transformers/sentence_transformer/model.py	444	CODE
LOW	sentence_transformers/sentence_transformer/model.py	464	CODE
LOW	sentence_transformers/sentence_transformer/model.py	485	CODE
LOW	sentence_transformers/sentence_transformer/model.py	809	CODE
LOW	sentence_transformers/sentence_transformer/trainer.py	118	CODE
LOW	sentence_transformers/sentence_transformer/fit_mixin.py	161	CODE
LOW	sentence_transformers/sentence_transformer/fit_mixin.py	465	CODE
LOW	…ransformer/losses/cached_multiple_negatives_ranking.py	199	CODE
LOW	…rmers/sentence_transformer/losses/cached_gist_embed.py	92	CODE
LOW	…ntence_transformer/evaluation/information_retrieval.py	131	CODE
LOW	…nsformers/sentence_transformer/evaluation/reranking.py	90	CODE
LOW	…ransformers/sentence_transformer/evaluation/triplet.py	84	CODE
LOW	…s/sentence_transformer/evaluation/paraphrase_mining.py	92	CODE
LOW	…nsformers/sentence_transformer/evaluation/nano_beir.py	214	CODE
LOW	…entence_transformer/evaluation/embedding_similarity.py	82	CODE
LOW	…_transformers/sentence_transformer/readers/sts_data.py	26	CODE
LOW	…_transformers/sentence_transformer/readers/sts_data.py	78	CODE
LOW	…ormers/sentence_transformer/modules/word_embeddings.py	91	CODE
LOW	…rmers/sentence_transformer/modules/static_embedding.py	169	CODE
LOW	sentence_transformers/util/hard_negatives.py	25	CODE
LOW	sentence_transformers/util/quantization.py	18	CODE
LOW	sentence_transformers/util/quantization.py	185	CODE
LOW	sentence_transformers/util/retrieval.py	23	CODE
LOW	sentence_transformers/cross_encoder/model.py	150	CODE
LOW	sentence_transformers/cross_encoder/model.py	471	CODE
LOW	sentence_transformers/cross_encoder/model.py	489	CODE
LOW	sentence_transformers/cross_encoder/model.py	507	CODE
LOW	sentence_transformers/cross_encoder/model.py	525	CODE
LOW	sentence_transformers/cross_encoder/model.py	544	CODE
LOW	sentence_transformers/cross_encoder/model.py	721	CODE
LOW	sentence_transformers/cross_encoder/trainer.py	114	CODE
LOW	sentence_transformers/cross_encoder/fit_mixin.py	182	CODE
LOW	sentence_transformers/cross_encoder/fit_mixin.py	417	CODE
LOW	…nce_transformers/cross_encoder/evaluation/reranking.py	105	CODE
LOW	…nce_transformers/cross_encoder/evaluation/nano_beir.py	197	CODE
LOW	sentence_transformers/sparse_encoder/model.py	144	CODE
LOW	sentence_transformers/sparse_encoder/model.py	199	CODE
LOW	sentence_transformers/sparse_encoder/model.py	272	CODE
LOW	sentence_transformers/sparse_encoder/model.py	348	CODE
LOW	sentence_transformers/sparse_encoder/trainer.py	114	CODE
LOW	…ce_transformers/sparse_encoder/losses/cached_splade.py	26	CODE
LOW	…rse_encoder/evaluation/sparse_information_retrieval.py	136	CODE
LOW	…ansformers/sparse_encoder/evaluation/sparse_triplet.py	98	CODE
LOW	…sformers/sparse_encoder/evaluation/sparse_nano_beir.py	200	CODE
LOW	…sformers/sparse_encoder/evaluation/sparse_reranking.py	110	CODE
LOW	sentence_transformers/base/model.py	81	CODE
LOW	sentence_transformers/base/model.py	790	CODE
LOW	sentence_transformers/base/model.py	854	CODE
LOW	sentence_transformers/base/trainer.py	149	CODE
LOW	sentence_transformers/base/trainer.py	1181	CODE
LOW	sentence_transformers/base/model_card.py	1813	CODE
LOW	sentence_transformers/base/modules/transformer.py	589	CODE
3 more matches not shown…

Modern AI Meta-Vocabulary22 hits · 60 pts

Severity	File	Line	Snippet	Context
MEDIUM	sentence_transformers/base/modules/router.py	72	# Use a regular SentenceTransformer for the document embeddings, and a static embedding model for the qu	COMMENT
MEDIUM	docs/migration_guide.md	475	# and a static embedding model for the query embeddings	COMMENT
MEDIUM	docs/migration_guide.md	502	# and a static embedding model for the query embeddings	COMMENT
MEDIUM	docs/cross_encoder/training_overview.md	323	# Mine hard negatives using a very efficient embedding model	COMMENT
MEDIUM	docs/cross_encoder/training_overview.md	335	batch_size=4096, # Use a batch size of 4096 for the embedding model	CODE
MEDIUM	docs/cross_encoder/training_overview.md	606	# Mine hard negatives using a very efficient embedding model	COMMENT
MEDIUM	docs/cross_encoder/training_overview.md	613	batch_size=4096, # Use a batch size of 4096 for the embedding model	CODE
MEDIUM	docs/cross_encoder/training_overview.md	954	# 2b. Modify our training dataset to include hard negatives using a very efficient embedding model	COMMENT
MEDIUM	docs/cross_encoder/training_overview.md	964	batch_size=4096, # Use a batch size of 4096 for the embedding model	CODE
MEDIUM	docs/cross_encoder/training_overview.md	987	# embedding model as a baseline.	COMMENT
MEDIUM	…nsformer/applications/embedding-quantization/README.md	37	# 1. Load an embedding model	COMMENT
MEDIUM	…nsformer/applications/embedding-quantization/README.md	109	# 1. Load an embedding model	COMMENT
MEDIUM	…rmer/applications/parallel-sentence-mining/bucc2018.py	62	# Use PCA to reduce the dimensionality of the sentence embedding model	COMMENT
MEDIUM	…cross_encoder/training/rerankers/training_gooaq_bce.py	54	# 2b. Modify our training dataset to include hard negatives using a very efficient embedding model	COMMENT
MEDIUM	…cross_encoder/training/rerankers/training_gooaq_bce.py	64	batch_size=4096, # Use a batch size of 4096 for the embedding model	CODE
MEDIUM	…cross_encoder/training/rerankers/training_gooaq_bce.py	87	# embedding model as a baseline.	COMMENT
MEDIUM	…ss_encoder/training/rerankers/training_gooaq_lambda.py	54	# 2b. Modify our training dataset to include hard negatives using a very efficient embedding model	COMMENT
MEDIUM	…ss_encoder/training/rerankers/training_gooaq_lambda.py	64	batch_size=4096, # Use a batch size of 4096 for the embedding model	CODE
MEDIUM	…ss_encoder/training/rerankers/training_gooaq_lambda.py	86	# embedding model as a baseline.	COMMENT
MEDIUM	…es/cross_encoder/training/rerankers/training_nq_bce.py	54	# 2b. Modify our training dataset to include hard negatives using a very efficient embedding model	COMMENT
MEDIUM	…es/cross_encoder/training/rerankers/training_nq_bce.py	64	batch_size=4096, # Use a batch size of 4096 for the embedding model	CODE
MEDIUM	…es/cross_encoder/training/rerankers/training_nq_bce.py	87	# embedding model as a baseline.	COMMENT

Redundant / Tautological Comments42 hits · 55 pts

Severity	File	Line	Snippet	Context
LOW⚡	…mers/sentence_transformer/losses/batch_hard_triplet.py	240	# Check if labels[i] == labels[j]	COMMENT
LOW⚡	…mers/sentence_transformer/losses/batch_hard_triplet.py	254	# Check if labels[i] != labels[k]	COMMENT
LOW	sentence_transformers/util/tensor.py	23	# Check if list contains sparse tensors	COMMENT
LOW	sentence_transformers/util/retrieval.py	332	# Check if we need to increase sort_max_size	COMMENT
LOW	sentence_transformers/cross_encoder/model_card.py	211	# Check if any pair element is non-text (from usage_examples before asset saving)	COMMENT
LOW	…arse_encoder/evaluation/sparse_embedding_similarity.py	78	# Print the results	STRING
LOW	…rse_encoder/evaluation/sparse_information_retrieval.py	128	# Print the results	STRING
LOW	…ansformers/sparse_encoder/evaluation/sparse_triplet.py	90	# Print the results	STRING
LOW	…sformers/sparse_encoder/evaluation/sparse_nano_beir.py	171	# Print the results	STRING
LOW	…rse_encoder/evaluation/sparse_binary_classification.py	104	# Print the results	STRING
LOW	…e_transformers/sparse_encoder/evaluation/sparse_mse.py	78	# Print the results	STRING
LOW	…ormers/sparse_encoder/evaluation/sparse_translation.py	75	# Print the results	STRING
LOW	…sformers/sparse_encoder/evaluation/sparse_reranking.py	102	# Print the results	STRING
LOW	…mers/sparse_encoder/modules/sparse_static_embedding.py	202	# Check if we have a JSON path in config	COMMENT
LOW	sentence_transformers/base/model.py	995	# Check if this is a Sentence Transformer model	COMMENT
LOW	sentence_transformers/base/model.py	1081	# Check if the config_sentence_transformers.json file exists (exists since v2 of the framework)	COMMENT
LOW	sentence_transformers/base/model.py	1106	# Check if a readme exists. README is optional metadata; a transient Hub error	COMMENT
LOW	sentence_transformers/base/model.py	1160	# Check if the `load` method only accepts a single parameter (the path to the local directory).	COMMENT
LOW	sentence_transformers/base/model_card.py	651	# Check if the model has a tuple modality whose parts all match available columns.	STRING
LOW	tests/sparse_encoder/utils.py	26	# Check if shape matches	COMMENT
LOW	tests/sparse_encoder/utils.py	40	# Check if indices are the same	COMMENT
LOW	tests/sparse_encoder/utils.py	44	# Check if values are close	COMMENT
LOW	tests/sparse_encoder/test_trainer.py	116	# Check if model parameters have changed after training	COMMENT
LOW	…sparse_encoder/modules/test_sparse_static_embedding.py	55	# Check if embeddings are the same before and after save/load	COMMENT
LOW	…sparse_encoder/modules/test_sparse_static_embedding.py	58	# Check if SparseStaticEmbedding weights are maintained after loading	COMMENT
LOW	…modules/transformer/update_transformers_tiny_models.py	42	# Check if the model_id contains the architecture name	COMMENT
LOW	…er/training/quora_duplicate_questions/create_splits.py	510	####### Write files for Information Retrieval #####	COMMENT
LOW	…/embedding-quantization/semantic_search_recommended.py	120	# Output the results	COMMENT
LOW	…tions/semantic-search/semantic_search_quora_pytorch.py	34	# Check if embedding cache path exists	COMMENT
LOW	…cations/semantic-search/semantic_search_quora_annoy.py	55	# Check if embedding cache path exists	COMMENT
LOW	…tions/semantic-search/semantic_search_quora_hnswlib.py	45	# Check if embedding cache path exists	COMMENT
LOW	…cations/semantic-search/semantic_search_quora_faiss.py	59	# Check if embedding cache path exists	COMMENT
LOW	…/cross_encoder/applications/cross_encoder_reranking.py	41	# Check if embedding cache path exists	COMMENT
LOW	…parse_encoder/evaluation/sparse_reranking_evaluator.py	53	# Print the results	COMMENT
LOW	…coder/evaluation/sparse_nanobeir_advanced_evaluator.py	45	# Print the results	COMMENT
LOW	…arse_encoder/evaluation/sparse_similarity_evaluator.py	31	# Print the results	COMMENT
LOW	…rse_encoder/evaluation/sparse_translation_evaluator.py	30	# Print the results	COMMENT
LOW	…/sparse_encoder/evaluation/sparse_triplet_evaluator.py	37	# Print the results	COMMENT
LOW	…parse_encoder/evaluation/sparse_retrieval_evaluator.py	69	# Print the results	COMMENT
LOW	…sparse_encoder/evaluation/sparse_nanobeir_evaluator.py	94	# Print the results	COMMENT
LOW	…_encoder/evaluation/sparse_classification_evaluator.py	58	# Print the results	COMMENT
LOW	…ples/sparse_encoder/evaluation/sparse_mse_evaluator.py	32	# Print the results	COMMENT

AI Slop Vocabulary7 hits · 14 pts

Severity	File	Line	Snippet	Context
LOW	…transformers/sentence_transformer/losses/gist_embed.py	243	# so the label for anchor[i] is i. This means that we can just use arange	COMMENT
LOW	…rmers/sentence_transformer/losses/cached_gist_embed.py	334	# so the label for anchor[i] is i. This means that we can just use arange	COMMENT
LOW	sentence_transformers/base/trainer.py	514	# if loss is nan or inf simply add the average of previous logged losses	COMMENT
LOW	sentence_transformers/base/trainer.py	567	# would not accept it. If None, we just call the super().log() method without it so that it works with all versi	COMMENT
LOW	sentence_transformers/base/trainer.py	644	# If the evaluator is not defined, we can just return the output	COMMENT
MEDIUM	…applications/parallel-sentence-mining/bitext_mining.py	27	# Model we want to use for bitext mining. sentence-transformers/LaBSE achieves state-of-the-art performance	COMMENT
MEDIUM	…rmer/applications/parallel-sentence-mining/bucc2018.py	25	# Model we want to use for bitext mining. sentence-transformers/LaBSE achieves state-of-the-art performance	COMMENT

Verbosity Indicators9 hits · 14 pts

Severity	File	Line	Snippet	Context
LOW	…raining/data_augmentation/train_sts_qqp_crossdomain.py	102	# Step 1: Train cross-encoder model with STSbenchmark	COMMENT
LOW	…raining/data_augmentation/train_sts_qqp_crossdomain.py	141	# Step 2: Label QQP train dataset using cross-encoder (BERT) model	COMMENT
LOW	…raining/data_augmentation/train_sts_qqp_crossdomain.py	158	# Step 3: Train bi-encoder (SBERT) model with QQP dataset - Augmented SBERT	COMMENT
LOW	…/training/data_augmentation/train_sts_indomain_bm25.py	89	# Step 1: Train cross-encoder model with (gold) STS benchmark dataset	COMMENT
LOW	…/training/data_augmentation/train_sts_indomain_bm25.py	135	# Step 2: Label BM25 sampled STSb (silver dataset) using cross-encoder model	COMMENT
LOW	…/training/data_augmentation/train_sts_indomain_bm25.py	188	# Step 3: Train bi-encoder model with both (gold + silver) STSbenchmark dataset - Augmented SBERT	COMMENT
LOW	…ining/data_augmentation/train_sts_indomain_semantic.py	101	# Step 1: Train cross-encoder model with STSbenchmark	COMMENT
LOW	…ining/data_augmentation/train_sts_indomain_semantic.py	140	# Step 2: Find silver pairs to label	COMMENT
LOW	…ining/data_augmentation/train_sts_indomain_semantic.py	199	# Step 3: Train bi-encoder model with both STSbenchmark and labeled AllNlI - Augmented SBERT	COMMENT

Structural Annotation Overuse9 hits · 14 pts

Severity	File	Line	Snippet	Context
LOW	…raining/data_augmentation/train_sts_qqp_crossdomain.py	102	# Step 1: Train cross-encoder model with STSbenchmark	COMMENT
LOW	…raining/data_augmentation/train_sts_qqp_crossdomain.py	141	# Step 2: Label QQP train dataset using cross-encoder (BERT) model	COMMENT
LOW	…raining/data_augmentation/train_sts_qqp_crossdomain.py	158	# Step 3: Train bi-encoder (SBERT) model with QQP dataset - Augmented SBERT	COMMENT
LOW	…/training/data_augmentation/train_sts_indomain_bm25.py	89	# Step 1: Train cross-encoder model with (gold) STS benchmark dataset	COMMENT
LOW	…/training/data_augmentation/train_sts_indomain_bm25.py	135	# Step 2: Label BM25 sampled STSb (silver dataset) using cross-encoder model	COMMENT
LOW	…/training/data_augmentation/train_sts_indomain_bm25.py	188	# Step 3: Train bi-encoder model with both (gold + silver) STSbenchmark dataset - Augmented SBERT	COMMENT
LOW	…ining/data_augmentation/train_sts_indomain_semantic.py	101	# Step 1: Train cross-encoder model with STSbenchmark	COMMENT
LOW	…ining/data_augmentation/train_sts_indomain_semantic.py	140	# Step 2: Find silver pairs to label	COMMENT
LOW	…ining/data_augmentation/train_sts_indomain_semantic.py	199	# Step 3: Train bi-encoder model with both STSbenchmark and labeled AllNlI - Augmented SBERT	COMMENT

Over-Commented Block12 hits · 12 pts

Severity	File	Line	Snippet	Context
LOW	docs/conf.py	1	# Configuration file for the Sphinx documentation builder.	COMMENT
LOW	docs/sparse_encoder/training_overview.md	141	router = Router.for_query_document(	COMMENT
LOW	…raining/data_augmentation/train_sts_qqp_crossdomain.py	141	# Step 2: Label QQP train dataset using cross-encoder (BERT) model	COMMENT
LOW	…ransformer/training/distillation/model_quantization.py	61	},	COMMENT
LOW	…transformer/training/multilingual/make_multilingual.py	81	# If we want, we can limit the maximum sequence length for the model	COMMENT
LOW	…ations/embedding-quantization/semantic_search_faiss.py	61	# In the first call we'll provide the `corpus_embeddings` and get the `corpus_index` back, which	COMMENT
LOW	…ions/embedding-quantization/semantic_search_usearch.py	61	# In the first call we'll provide the `corpus_embeddings` and get the `corpus_index` back, which	COMMENT
LOW	…ncoder/training/ms_marco/training_ms_marco_plistmle.py	101	# lambda_weight = PListMLELambdaWeight(rank_discount_fn=custom_discount)	COMMENT
LOW	examples/cross_encoder/training/distillation/README.md	61	# {"corpus_id": 4, "score": 0.91639173},	COMMENT
LOW	.github/workflows/sync-skills.yml	1	name: Sync skill to huggingface/skills	COMMENT
LOW	…train_sentence_transformer_static_embedding_example.py	1	#!/usr/bin/env python3	COMMENT
LOW	…cripts/train_sentence_transformer_with_lora_example.py	1	#!/usr/bin/env python3	COMMENT

Slop Phrases2 hits · 6 pts

Severity	File	Line	Snippet	Context
MEDIUM	…te_questions/application_duplicate_questions_mining.py	10	# For demonstration purposes, we limit it to a few questions which all have on duplicate	COMMENT
MEDIUM	…ce_transformer/training/prompts/training_nq_prompts.py	26	# Feel free to adjust these variables:	COMMENT

Fake / Example Data1 hit · 2 pts

Severity	File	Line	Snippet	Context
LOW⚡	tests/base/test_model.py	381	SentenceTransformer(modules=[torch.nn.Linear(10, 10)], use_auth_token="fake-token")	CODE

Analysis Overview

What These Metrics Mean

Score History

Severity Breakdown

Directory Score Breakdown

Pattern Findings