Python tool for converting files and office documents to Markdown.
382 matches across 13 categories. Click a row to expand file-level details.
| Severity | File | Line | Snippet |
|---|---|---|---|
| MEDIUM | packages/markitdown/tests/test_cu_converter.py | 114 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown/tests/test_cu_converter.py | 116 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown/tests/test_cu_converter.py | 160 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown/tests/test_cu_converter.py | 162 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown/tests/test_cu_converter.py | 193 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown/tests/test_cu_converter.py | 195 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown/tests/test_cu_converter.py | 308 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown/tests/test_cu_converter.py | 310 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown/tests/test_cu_converter.py | 558 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown/tests/test_cu_converter.py | 560 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown/tests/test_cu_converter.py | 637 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown/tests/test_cu_converter.py | 639 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown/tests/test_cu_converter.py | 719 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown/tests/test_cu_converter.py | 721 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown/tests/test_cu_converter.py | 746 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown/tests/test_cu_converter.py | 748 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown/tests/test_cu_converter.py | 788 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown/tests/test_cu_converter.py | 790 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown/tests/test_cu_converter.py | 906 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown/tests/test_cu_converter.py | 908 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown/tests/test_cu_converter.py | 25 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown/tests/test_cu_converter.py | 27 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown/tests/test_cu_converter.py | 47 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown/tests/test_cu_converter.py | 49 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown/tests/test_cu_converter.py | 662 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown/tests/test_cu_converter.py | 664 | # --------------------------------------------------------------------------- |
| MEDIUM | …/markitdown/src/markitdown/converters/_cu_converter.py | 50 | # --------------------------------------------------------------------------- |
| MEDIUM | …/markitdown/src/markitdown/converters/_cu_converter.py | 52 | # --------------------------------------------------------------------------- |
| MEDIUM | …/markitdown/src/markitdown/converters/_cu_converter.py | 335 | # --------------------------------------------------------------------------- |
| MEDIUM | …/markitdown/src/markitdown/converters/_cu_converter.py | 337 | # --------------------------------------------------------------------------- |
| MEDIUM | …/markitdown/src/markitdown/converters/_cu_converter.py | 436 | # --------------------------------------------------------------------------- |
| MEDIUM | …/markitdown/src/markitdown/converters/_cu_converter.py | 438 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown-ocr/tests/test_docx_converter.py | 103 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown-ocr/tests/test_docx_converter.py | 105 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown-ocr/tests/test_docx_converter.py | 166 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown-ocr/tests/test_docx_converter.py | 168 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown-ocr/tests/test_docx_converter.py | 210 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown-ocr/tests/test_docx_converter.py | 212 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown-ocr/tests/test_docx_converter.py | 55 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown-ocr/tests/test_docx_converter.py | 57 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown-ocr/tests/test_docx_converter.py | 70 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown-ocr/tests/test_docx_converter.py | 72 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown-ocr/tests/test_docx_converter.py | 87 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown-ocr/tests/test_docx_converter.py | 89 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown-ocr/tests/test_docx_converter.py | 120 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown-ocr/tests/test_docx_converter.py | 122 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown-ocr/tests/test_docx_converter.py | 147 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown-ocr/tests/test_docx_converter.py | 149 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown-ocr/tests/test_pptx_converter.py | 61 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown-ocr/tests/test_pptx_converter.py | 63 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown-ocr/tests/test_pptx_converter.py | 75 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown-ocr/tests/test_pptx_converter.py | 77 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown-ocr/tests/test_pptx_converter.py | 91 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown-ocr/tests/test_pptx_converter.py | 93 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown-ocr/tests/test_pptx_converter.py | 106 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown-ocr/tests/test_pptx_converter.py | 108 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown-ocr/tests/test_pptx_converter.py | 121 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown-ocr/tests/test_pptx_converter.py | 123 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown-ocr/tests/test_pptx_converter.py | 135 | # --------------------------------------------------------------------------- |
| MEDIUM | packages/markitdown-ocr/tests/test_pptx_converter.py | 137 | # --------------------------------------------------------------------------- |
| 30 more matches not shown… | |||
| Severity | File | Line | Snippet |
|---|---|---|---|
| LOW | packages/markitdown/tests/test_module_vectors.py | 72 | def test_convert_stream_with_hints(test_vector): |
| LOW | packages/markitdown/tests/test_module_vectors.py | 93 | def test_convert_stream_without_hints(test_vector): |
| LOW | packages/markitdown/tests/test_module_vectors.py | 163 | def test_convert_keep_data_uris(test_vector): |
| LOW | packages/markitdown/tests/test_module_vectors.py | 181 | def test_convert_stream_keep_data_uris(test_vector): |
| LOW | packages/markitdown/tests/test_pdf_masterformat.py | 17 | def test_partial_numbering_pattern_regex(self): |
| LOW | packages/markitdown/tests/test_pdf_masterformat.py | 34 | def test_masterformat_partial_numbering_not_split(self): |
| LOW | packages/markitdown/tests/test_pdf_masterformat.py | 73 | def test_masterformat_content_preserved(self): |
| LOW | packages/markitdown/tests/test_pdf_masterformat.py | 115 | def test_merge_partial_numbering_with_empty_lines_between(self): |
| LOW | packages/markitdown/tests/test_pdf_masterformat.py | 148 | def test_multiple_partial_numberings_all_merged(self): |
| LOW | packages/markitdown/tests/test_pdf_memory.py | 86 | def test_page_close_called_on_every_page(self): |
| LOW | packages/markitdown/tests/test_pdf_memory.py | 116 | def test_plain_text_pdf_falls_back_to_pdfminer(self): |
| LOW | packages/markitdown/tests/test_pdf_memory.py | 150 | def test_plain_text_pdf_still_closes_all_pages(self): |
| LOW | packages/markitdown/tests/test_pdf_memory.py | 177 | def test_mixed_pdf_uses_form_extraction_per_page(self): |
| LOW | packages/markitdown/tests/test_pdf_memory.py | 222 | def test_only_one_pdfplumber_open_call(self): |
| LOW | packages/markitdown/tests/test_pdf_memory.py | 249 | def test_real_pdf_page_cleanup(self): |
| LOW | packages/markitdown/tests/test_pdf_memory.py | 303 | def test_memory_does_not_grow_linearly(self): |
| LOW | packages/markitdown/tests/test_pdf_memory.py | 333 | def test_memory_constant_across_page_counts(self): |
| LOW | packages/markitdown/tests/test_module_misc.py | 110 | def test_stream_info_operations() -> None: |
| LOW | packages/markitdown/tests/test_module_misc.py | 291 | def test_deeply_nested_html_fallback() -> None: |
| LOW | packages/markitdown/tests/test_module_misc.py | 404 | def test_speech_transcription() -> None: |
| LOW | packages/markitdown/tests/test_module_misc.py | 465 | def test_markitdown_llm_parameters() -> None: |
| LOW | packages/markitdown/tests/test_docintel_html.py | 15 | def test_docintel_accepts_html_extension(): |
| LOW | packages/markitdown/tests/test_docintel_html.py | 21 | def test_docintel_accepts_html_mimetype(): |
| LOW | packages/markitdown/tests/test_cli_vectors.py | 98 | def test_input_from_stdin_without_hints(shared_tmp_dir, test_vector) -> None: |
| LOW | packages/markitdown/tests/test_cli_vectors.py | 152 | def test_output_to_file_with_data_uris(shared_tmp_dir, test_vector) -> None: |
| LOW | packages/markitdown/tests/test_cu_converter.py | 109 | def test_rejects_unsupported_extensions(self, ext): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 155 | def test_rejects_unsupported_mimetypes(self, mime): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 168 | def test_restricted_to_pdf_only(self): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 186 | def test_webm_value_matches_cli_input(self): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 201 | def test_detects_video_from_mime_without_extension(self): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 207 | def test_detects_audio_from_mime_without_extension(self): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 213 | def test_detects_audio_alias_from_mime_without_extension(self): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 219 | def test_detects_video_alias_from_mime_without_extension(self): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 298 | def test_file_type_restriction_applies_to_mime(self): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 316 | def test_document_analyzer_routes_pdf_to_custom(self): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 566 | def test_known_document_prebuilts(self): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 578 | def test_known_audio_prebuilts(self): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 584 | def test_known_video_prebuilts(self): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 590 | def test_known_image_prebuilts(self): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 596 | def test_unknown_prebuilt_falls_back_to_get_analyzer(self): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 618 | def test_custom_analyzer_no_base_defaults_to_document(self): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 628 | def test_get_analyzer_failure_raises_value_error(self): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 702 | def test_wav_returns_markdown(self): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 712 | def test_jpeg_returns_markdown(self): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 727 | def test_nonexistent_analyzer_raises_value_error(self): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 754 | def test_cu_registered_before_docintel(self): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 796 | def test_use_cu_without_endpoint_exits(self): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 914 | def test_missing_deps_message(self): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 94 | def test_accepts_supported_extensions(self, ext): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 143 | def test_accepts_supported_mimetypes(self, mime): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 267 | def test_content_type_for_resolves_conflicts_to_file_type( |
| LOW | packages/markitdown/tests/test_cu_converter.py | 273 | def test_conflicting_extension_and_mimetype_in_convert(self): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 386 | def test_no_analyzer_id_uses_auto_routing(self): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 406 | def test_no_analyzer_id_routes_image_to_document_search(self): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 426 | def test_document_analyzer_routes_image_to_custom(self): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 449 | def test_image_analyzer_routes_jpeg_to_custom(self): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 472 | def test_image_analyzer_routes_pdf_to_document_prebuilt(self): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 504 | def test_mime_only_input_uses_auto_routing(self, mimetype, expected_analyzer): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 521 | def test_mime_alias_input_uses_canonical_content_type(self): |
| LOW | packages/markitdown/tests/test_cu_converter.py | 539 | def test_extension_only_input_uses_file_type_content_type(self): |
| 55 more matches not shown… | |||
| Severity | File | Line | Snippet |
|---|---|---|---|
| LOW | …sample-plugin/src/markitdown_sample_plugin/__init__.py | 5 | |
| LOW | …sample-plugin/src/markitdown_sample_plugin/__init__.py | 5 | |
| LOW | …sample-plugin/src/markitdown_sample_plugin/__init__.py | 5 | |
| LOW | …sample-plugin/src/markitdown_sample_plugin/__init__.py | 6 | |
| LOW | packages/markitdown/tests/test_pdf_memory.py | 26 | |
| LOW | packages/markitdown/tests/test_cu_converter.py | 13 | |
| LOW | packages/markitdown/src/markitdown/__init__.py | 5 | |
| LOW | packages/markitdown/src/markitdown/__init__.py | 6 | |
| LOW | packages/markitdown/src/markitdown/__init__.py | 6 | |
| LOW | packages/markitdown/src/markitdown/__init__.py | 6 | |
| LOW | packages/markitdown/src/markitdown/__init__.py | 11 | |
| LOW | packages/markitdown/src/markitdown/__init__.py | 11 | |
| LOW | packages/markitdown/src/markitdown/__init__.py | 12 | |
| LOW | packages/markitdown/src/markitdown/__init__.py | 13 | |
| LOW | packages/markitdown/src/markitdown/__init__.py | 13 | |
| LOW | packages/markitdown/src/markitdown/__init__.py | 13 | |
| LOW | packages/markitdown/src/markitdown/__init__.py | 13 | |
| LOW | packages/markitdown/src/markitdown/__init__.py | 13 | |
| LOW | …arkitdown/src/markitdown/converters/_xlsx_converter.py | 13 | |
| LOW | …arkitdown/src/markitdown/converters/_xlsx_converter.py | 20 | |
| LOW | …kages/markitdown/src/markitdown/converters/__init__.py | 5 | |
| LOW | …kages/markitdown/src/markitdown/converters/__init__.py | 6 | |
| LOW | …kages/markitdown/src/markitdown/converters/__init__.py | 7 | |
| LOW | …kages/markitdown/src/markitdown/converters/__init__.py | 8 | |
| LOW | …kages/markitdown/src/markitdown/converters/__init__.py | 9 | |
| LOW | …kages/markitdown/src/markitdown/converters/__init__.py | 10 | |
| LOW | …kages/markitdown/src/markitdown/converters/__init__.py | 11 | |
| LOW | …kages/markitdown/src/markitdown/converters/__init__.py | 12 | |
| LOW | …kages/markitdown/src/markitdown/converters/__init__.py | 13 | |
| LOW | …kages/markitdown/src/markitdown/converters/__init__.py | 14 | |
| LOW | …kages/markitdown/src/markitdown/converters/__init__.py | 14 | |
| LOW | …kages/markitdown/src/markitdown/converters/__init__.py | 15 | |
| LOW | …kages/markitdown/src/markitdown/converters/__init__.py | 16 | |
| LOW | …kages/markitdown/src/markitdown/converters/__init__.py | 17 | |
| LOW | …kages/markitdown/src/markitdown/converters/__init__.py | 18 | |
| LOW | …kages/markitdown/src/markitdown/converters/__init__.py | 19 | |
| LOW | …kages/markitdown/src/markitdown/converters/__init__.py | 20 | |
| LOW | …kages/markitdown/src/markitdown/converters/__init__.py | 20 | |
| LOW | …kages/markitdown/src/markitdown/converters/__init__.py | 24 | |
| LOW | …kages/markitdown/src/markitdown/converters/__init__.py | 24 | |
| LOW | …kages/markitdown/src/markitdown/converters/__init__.py | 28 | |
| LOW | …kages/markitdown/src/markitdown/converters/__init__.py | 29 | |
| LOW | …arkitdown/src/markitdown/converters/_docx_converter.py | 2 | |
| LOW | …arkitdown/src/markitdown/converters/_docx_converter.py | 3 | |
| LOW | …arkitdown/src/markitdown/converters/_pptx_converter.py | 9 | |
| LOW | …markitdown/src/markitdown/converters/_zip_converter.py | 13 | |
| LOW | …own/src/markitdown/converters/_plain_text_converter.py | 12 | |
| LOW | …src/markitdown/converter_utils/docx/math/latex_dict.py | 8 | |
| LOW | packages/markitdown-mcp/src/markitdown_mcp/__init__.py | 5 | |
| LOW | packages/markitdown-ocr/src/markitdown_ocr/__init__.py | 10 | |
| LOW | packages/markitdown-ocr/src/markitdown_ocr/__init__.py | 10 | |
| LOW | packages/markitdown-ocr/src/markitdown_ocr/__init__.py | 11 | |
| LOW | packages/markitdown-ocr/src/markitdown_ocr/__init__.py | 12 | |
| LOW | packages/markitdown-ocr/src/markitdown_ocr/__init__.py | 12 | |
| LOW | packages/markitdown-ocr/src/markitdown_ocr/__init__.py | 16 | |
| LOW | packages/markitdown-ocr/src/markitdown_ocr/__init__.py | 17 | |
| LOW | packages/markitdown-ocr/src/markitdown_ocr/__init__.py | 18 | |
| LOW | packages/markitdown-ocr/src/markitdown_ocr/__init__.py | 19 |
| Severity | File | Line | Snippet |
|---|---|---|---|
| LOW | packages/markitdown/src/markitdown/_markitdown.py | 79 | except Exception: |
| LOW | packages/markitdown/src/markitdown/_markitdown.py | 268 | except Exception: |
| LOW | packages/markitdown/src/markitdown/_markitdown.py | 630 | except Exception: |
| LOW | …rkitdown/src/markitdown/converters/_image_converter.py | 112 | except Exception as e: |
| LOW | …s/markitdown/src/markitdown/converters/_llm_caption.py | 24 | except Exception as e: |
| LOW | …wn/src/markitdown/converters/_outlook_msg_converter.py | 66 | except Exception as e: |
| LOW | …wn/src/markitdown/converters/_outlook_msg_converter.py | 147 | except Exception: |
| LOW | …markitdown/src/markitdown/converters/_pdf_converter.py | 576 | except Exception: |
| LOW | …/markitdown/src/markitdown/converters/_cu_converter.py | 421 | except Exception as exc: |
| LOW | …itdown/src/markitdown/converters/_youtube_converter.py | 114 | except Exception as e: |
| MEDIUM | …itdown/src/markitdown/converters/_youtube_converter.py | 115 | print(f"Error extracting description: {e}") |
| LOW | …itdown/src/markitdown/converters/_youtube_converter.py | 176 | except Exception as e: |
| MEDIUM | …itdown/src/markitdown/converters/_youtube_converter.py | 179 | print(f"Error fetching transcript: {e}") |
| LOW | …itdown/src/markitdown/converters/_youtube_converter.py | 232 | except Exception as e: |
| LOW | …rkitdown/src/markitdown/converters/_ipynb_converter.py | 93 | except Exception as e: |
| LOW | …arkitdown/src/markitdown/converters/_pptx_converter.py | 127 | except Exception: |
| LOW | …arkitdown/src/markitdown/converters/_pptx_converter.py | 134 | except Exception: |
| LOW | …arkitdown/src/markitdown/converters/_pptx_converter.py | 262 | except Exception: |
| MEDIUM | …arkitdown/src/markitdown/converters/_pptx_converter.py | 235 | def _convert_chart_to_markdown(self, chart): |
| LOW | …own/src/markitdown/converter_utils/docx/pre_process.py | 150 | except Exception: |
| LOW | …down-ocr/src/markitdown_ocr/_pdf_converter_with_ocr.py | 83 | except Exception: |
| LOW | …down-ocr/src/markitdown_ocr/_pdf_converter_with_ocr.py | 120 | except Exception: |
| LOW | …down-ocr/src/markitdown_ocr/_pdf_converter_with_ocr.py | 123 | except Exception: |
| LOW | …down-ocr/src/markitdown_ocr/_pdf_converter_with_ocr.py | 297 | except Exception: |
| LOW | …down-ocr/src/markitdown_ocr/_pdf_converter_with_ocr.py | 302 | except Exception: |
| LOW | …down-ocr/src/markitdown_ocr/_pdf_converter_with_ocr.py | 332 | except Exception: |
| LOW | …down-ocr/src/markitdown_ocr/_pdf_converter_with_ocr.py | 380 | except Exception as e: |
| LOW | …down-ocr/src/markitdown_ocr/_pdf_converter_with_ocr.py | 386 | except Exception: |
| LOW | …down-ocr/src/markitdown_ocr/_pdf_converter_with_ocr.py | 413 | except Exception as e: |
| LOW | …down-ocr/src/markitdown_ocr/_pdf_converter_with_ocr.py | 419 | except Exception: |
| LOW | …own-ocr/src/markitdown_ocr/_xlsx_converter_with_ocr.py | 134 | except Exception: |
| LOW | …own-ocr/src/markitdown_ocr/_xlsx_converter_with_ocr.py | 208 | except Exception: |
| LOW | …own-ocr/src/markitdown_ocr/_xlsx_converter_with_ocr.py | 211 | except Exception: |
| LOW | …own-ocr/src/markitdown_ocr/_pptx_converter_with_ocr.py | 121 | except Exception: |
| LOW | …own-ocr/src/markitdown_ocr/_pptx_converter_with_ocr.py | 132 | except Exception: |
| LOW | …own-ocr/src/markitdown_ocr/_pptx_converter_with_ocr.py | 248 | except Exception: |
| MEDIUM | …own-ocr/src/markitdown_ocr/_pptx_converter_with_ocr.py | 222 | def _convert_chart_to_markdown(self, chart): |
| LOW | …own-ocr/src/markitdown_ocr/_docx_converter_with_ocr.py | 152 | except Exception: |
| LOW | …own-ocr/src/markitdown_ocr/_docx_converter_with_ocr.py | 155 | except Exception: |
| LOW | …ages/markitdown-ocr/src/markitdown_ocr/_ocr_service.py | 78 | except Exception: |
| LOW | …ages/markitdown-ocr/src/markitdown_ocr/_ocr_service.py | 107 | except Exception as e: |
| Severity | File | Line | Snippet |
|---|---|---|---|
| LOW | packages/markitdown/tests/test_pdf_masterformat.py | 115 | |
| LOW | packages/markitdown/src/markitdown/_exceptions.py | 58 | |
| LOW | packages/markitdown/src/markitdown/__main__.py | 14 | |
| LOW | packages/markitdown/src/markitdown/_markitdown.py | 141 | |
| LOW | packages/markitdown/src/markitdown/_markitdown.py | 275 | |
| LOW | packages/markitdown/src/markitdown/_markitdown.py | 489 | |
| LOW | packages/markitdown/src/markitdown/_markitdown.py | 561 | |
| LOW | packages/markitdown/src/markitdown/_markitdown.py | 696 | |
| LOW | …down/src/markitdown/converters/_doc_intel_converter.py | 71 | |
| LOW | …down/src/markitdown/converters/_doc_intel_converter.py | 104 | |
| LOW | …wn/src/markitdown/converters/_outlook_msg_converter.py | 127 | |
| LOW | …markitdown/src/markitdown/converters/_pdf_converter.py | 120 | |
| LOW | …markitdown/src/markitdown/converters/_pdf_converter.py | 398 | |
| LOW | …markitdown/src/markitdown/converters/_pdf_converter.py | 520 | |
| LOW | …/markitdown/src/markitdown/converters/_cu_converter.py | 251 | |
| LOW | …itdown/src/markitdown/converters/_youtube_converter.py | 70 | |
| LOW | …itdown/src/markitdown/converters/_youtube_converter.py | 211 | |
| LOW | …down/src/markitdown/converters/_bing_serp_converter.py | 57 | |
| LOW | …rkitdown/src/markitdown/converters/_ipynb_converter.py | 57 | |
| LOW | …arkitdown/src/markitdown/converters/_epub_converter.py | 53 | |
| LOW | …arkitdown/src/markitdown/converters/_pptx_converter.py | 61 | |
| LOW | …markitdown/src/markitdown/converters/_zip_converter.py | 87 | |
| LOW | …own/src/markitdown/converter_utils/docx/pre_process.py | 118 | |
| LOW | …down-ocr/src/markitdown_ocr/_pdf_converter_with_ocr.py | 28 | |
| LOW | …down-ocr/src/markitdown_ocr/_pdf_converter_with_ocr.py | 158 | |
| LOW | …down-ocr/src/markitdown_ocr/_pdf_converter_with_ocr.py | 340 | |
| LOW | …own-ocr/src/markitdown_ocr/_xlsx_converter_with_ocr.py | 149 | |
| LOW | …own-ocr/src/markitdown_ocr/_pptx_converter_with_ocr.py | 54 | |
| LOW | …own-ocr/src/markitdown_ocr/_pptx_converter_with_ocr.py | 87 | |
| LOW | …own-ocr/src/markitdown_ocr/_docx_converter_with_ocr.py | 126 |
| Severity | File | Line | Snippet |
|---|---|---|---|
| MEDIUM | …ckages/markitdown/tests/test_files/test_wikipedia.html | 936 | </p><p>In 1990, Microsoft introduced the <a href="/wiki/Microsoft_Office" title="Microsoft Office">Microsoft Office</a> |
| MEDIUM | packages/markitdown/tests/test_files/test_rss.xml | 1 | <rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:ns0="http://www.w3.org/2005/Atom" xmlns:ns1="http://purl.org/rss/ |
| MEDIUM | packages/markitdown/tests/test_files/test_rss.xml | 1 | <rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:ns0="http://www.w3.org/2005/Atom" xmlns:ns1="http://purl.org/rss/ |
| MEDIUM | packages/markitdown/tests/test_files/test_rss.xml | 1 | <rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:ns0="http://www.w3.org/2005/Atom" xmlns:ns1="http://purl.org/rss/ |
| MEDIUM | packages/markitdown/tests/test_files/test_rss.xml | 1 | <rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:ns0="http://www.w3.org/2005/Atom" xmlns:ns1="http://purl.org/rss/ |
| MEDIUM | packages/markitdown/tests/test_files/test_rss.xml | 1 | <rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:ns0="http://www.w3.org/2005/Atom" xmlns:ns1="http://purl.org/rss/ |
| MEDIUM | packages/markitdown/tests/test_files/test_rss.xml | 1 | <rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:ns0="http://www.w3.org/2005/Atom" xmlns:ns1="http://purl.org/rss/ |
| MEDIUM | packages/markitdown/tests/test_files/test_rss.xml | 1 | <rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:ns0="http://www.w3.org/2005/Atom" xmlns:ns1="http://purl.org/rss/ |
| MEDIUM | packages/markitdown/tests/test_files/test_serp.html | 17 | var logJSText=function(n,t){t===void 0&&(t=null);(new Image).src=_G.lsUrl+'&Type=Event.ClientInst&DATA=[{"T":"CI.ClientI |
| LOW | packages/markitdown/src/markitdown/_markitdown.py | 790 | # There were no other guesses, so just add the base guess |
| Severity | File | Line | Snippet |
|---|---|---|---|
| LOW | …-sample-plugin/src/markitdown_sample_plugin/_plugin.py | 63 | # Read the file stream into an str using hte provided charset encoding, or using the system default |
| LOW | …-sample-plugin/src/markitdown_sample_plugin/_plugin.py | 67 | # Return the result |
| LOW | packages/markitdown/tests/test_pdf_masterformat.py | 56 | # Check if line contains ONLY a partial numbering (with possible whitespace/pipes) |
| LOW | packages/markitdown/tests/test_pdf_masterformat.py | 139 | # Check if next non-empty line exists and wasn't merged |
| LOW | packages/markitdown/src/markitdown/_markitdown.py | 392 | # Check if we have a seekable stream. If not, load the entire stream into memory. |
| LOW | packages/markitdown/src/markitdown/_markitdown.py | 614 | # Check if the converter will accept the file, and if so, try to convert it |
| LOW | …rkitdown/src/markitdown/converters/_audio_converter.py | 100 | # Return the result |
| LOW | …markitdown/src/markitdown/converters/_pdf_converter.py | 37 | # Check if this line is ONLY a partial numbering |
| LOW | …markitdown/src/markitdown/converters/_pdf_converter.py | 296 | # Check if enough rows are table rows (at least 20%) |
| LOW | …markitdown/src/markitdown/converters/_pdf_converter.py | 328 | # Check if this row starts a table region |
| LOW | …markitdown/src/markitdown/converters/_pdf_converter.py | 383 | # Check if we're inside a table region (not at start) |
| LOW | …markitdown/src/markitdown/converters/_pdf_converter.py | 448 | # Assign words to columns |
| LOW | …markitdown/src/markitdown/converters/_pdf_converter.py | 477 | # Check if cells contain short, structured data (not long text) |
| LOW | …markitdown/src/markitdown/converters/_pdf_converter.py | 539 | # Read file stream into BytesIO for compatibility with pdfplumber |
| LOW | …markitdown/src/markitdown/converters/_csv_converter.py | 44 | # Read the file content |
| LOW | …down-ocr/src/markitdown_ocr/_pdf_converter_with_ocr.py | 94 | # Check if dimensions are valid |
| LOW | …own-ocr/src/markitdown_ocr/_xlsx_converter_with_ocr.py | 165 | # Check if sheet has images |
| Severity | File | Line | Snippet |
|---|---|---|---|
| MEDIUM | packages/markitdown/tests/test_module_misc.py | 18 | # This file contains module tests that are not directly tested by the FileTestVectors. |
| MEDIUM | packages/markitdown/tests/test_cli_misc.py | 5 | # This file contains CLI tests that are not directly tested by the FileTestVectors. |
| MEDIUM | packages/markitdown/src/markitdown/_markitdown.py | 530 | # Create an initial guess from all this information |
| MEDIUM | packages/markitdown/src/markitdown/_markitdown.py | 569 | # Create a copy of the page_converters list, sorted by priority. |
| MEDIUM | …own/src/markitdown/converter_utils/docx/pre_process.py | 85 | # Create a new paragraph tag |
| Severity | File | Line | Snippet |
|---|---|---|---|
| HIGH | packages/markitdown/tests/test_files/test_rss.xml | 1 | <rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:ns0="http://www.w3.org/2005/Atom" xmlns:ns1="http://purl.org/rss/ |
| HIGH | packages/markitdown/tests/test_files/test_rss.xml | 1 | <rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:ns0="http://www.w3.org/2005/Atom" xmlns:ns1="http://purl.org/rss/ |
| Severity | File | Line | Snippet |
|---|---|---|---|
| HIGH | packages/markitdown/src/markitdown/_base_converter.py | 51 | Return a quick determination on if the converter should attempt converting the document. This is primar |
| HIGH | packages/markitdown/src/markitdown/_base_converter.py | 90 | Convert a document to Markdown text. Parameters: - file_stream: The file-like object to conver |
| HIGH | …/markitdown/src/markitdown/converters/_cu_converter.py | 397 | Resolve analyzer modality from cache or via get_analyzer() fallback. For known prebuilt-* names, returns the modali |
| Severity | File | Line | Snippet |
|---|---|---|---|
| CRITICAL | …arkitdown/src/markitdown/converters/_pptx_converter.py | 133 | alt_text = shape._element._nvXxPr.cNvPr.attrib.get("descr", "") |
| Severity | File | Line | Snippet |
|---|---|---|---|
| HIGH | packages/markitdown/tests/test_module_vectors.py | 0 | test the conversion of a stream with no stream info. |
| HIGH | packages/markitdown/tests/test_module_vectors.py | 0 | test the conversion of a stream with no stream info. |
| HIGH | packages/markitdown/tests/test_cli_vectors.py | 0 | test the conversion of a stream with no stream info. |
| Severity | File | Line | Snippet |
|---|---|---|---|
| LOW | .devcontainer/devcontainer.json | 21 | // Use 'forwardPorts' to make a list of ports inside the container available locally. |
| LOW | packages/markitdown-sample-plugin/tests/__init__.py | 1 | # SPDX-FileCopyrightText: 2024-present Adam Fourney <adamfo@microsoft.com> |
| LOW | …ample-plugin/src/markitdown_sample_plugin/__about__.py | 1 | # SPDX-FileCopyrightText: 2024-present Adam Fourney <adamfo@microsoft.com> |
| LOW | packages/markitdown/tests/__init__.py | 1 | # SPDX-FileCopyrightText: 2024-present Adam Fourney <adamfo@microsoft.com> |
| LOW | packages/markitdown/src/markitdown/__about__.py | 1 | # SPDX-FileCopyrightText: 2024-present Adam Fourney <adamfo@microsoft.com> |
| LOW | packages/markitdown-mcp/tests/__init__.py | 1 | # SPDX-FileCopyrightText: 2024-present Adam Fourney <adamfo@microsoft.com> |
| LOW | packages/markitdown-mcp/src/markitdown_mcp/__about__.py | 1 | # SPDX-FileCopyrightText: 2024-present Adam Fourney <adamfo@microsoft.com> |