Repository Analysis

openai/whisper

Robust Speech Recognition via Large-Scale Weak Supervision

4.1 Likely human-written View on GitHub
4.1
Adjusted Score
4.1
Raw Score
100%
Time Factor
2026-04-15
Last Push
100,922
Stars
Python
Language
7,048
Lines of Code
32
Files
29
Pattern Hits
2026-05-31
Scan Date

Score History

Severity Breakdown

CRITICAL 0HIGH 0MEDIUM 0LOW 29

Pattern Findings

29 matches across 4 categories. Click a row to expand file-level details.

Unused Imports13 hits · 13 pts
SeverityFileLineSnippet
LOWwhisper/__init__.py11
LOWwhisper/__init__.py11
LOWwhisper/__init__.py11
LOWwhisper/__init__.py12
LOWwhisper/__init__.py12
LOWwhisper/__init__.py12
LOWwhisper/__init__.py12
LOWwhisper/__init__.py14
LOWwhisper/decoding.py15
LOWwhisper/timing.py16
LOWwhisper/transcribe.py35
LOWwhisper/normalizers/__init__.py1
LOWwhisper/normalizers/__init__.py2
Deep Nesting10 hits · 10 pts
SeverityFileLineSnippet
LOWwhisper/__init__.py54
LOWwhisper/decoding.py323
LOWwhisper/decoding.py384
LOWwhisper/timing.py58
LOWwhisper/timing.py83
LOWwhisper/timing.py279
LOWwhisper/utils.py123
LOWwhisper/utils.py142
LOWwhisper/transcribe.py38
LOWwhisper/normalizers/english.py165
Hyper-Verbose Identifiers5 hits · 5 pts
SeverityFileLineSnippet
LOWtests/test_timing.py57def test_dtw_cuda_equivalence(N: int, M: int):
LOWtests/test_timing.py89def test_median_filter_equivalence(shape):
LOWtests/test_tokenizer.py14def test_multilingual_tokenizer():
LOWwhisper/tokenizer.py238 def sot_sequence_including_notimestamps(self) -> Tuple[int]:
LOWwhisper/normalizers/basic.py27def remove_symbols_and_diacritics(s: str, keep=""):
Excessive Try-Catch Wrapping1 hit · 1 pts
SeverityFileLineSnippet
LOWwhisper/transcribe.py617 except Exception as e: