Python scraper based on AI
785 matches across 12 categories. Click a row to expand file-level details.
| Severity | File | Line | Snippet |
|---|---|---|---|
| MEDIUM | tests/test_fetch_node_timeout.py | 165 | # Create a mock loader that takes longer than timeout |
| MEDIUM | tests/test_json_scraper_graph.py | 46 | # Create a JSONScraperGraph instance |
| MEDIUM | tests/test_json_scraper_graph.py | 103 | # Create a JSONScraperGraph instance with a single JSON file |
| MEDIUM | tests/test_json_scraper_graph.py | 157 | # Create a JSONScraperGraph instance |
| MEDIUM | tests/test_json_scraper_graph.py | 203 | # Define a custom schema |
| MEDIUM | tests/test_json_scraper_graph.py | 220 | # Create a JSONScraperGraph instance with a custom schema |
| MEDIUM | tests/test_script_creator_multi_graph.py | 172 | # Create a dummy graph that raises an exception when execute is called. |
| MEDIUM | tests/test_chromium.py | 1026 | # Create a loader instance with a backend that does not have a corresponding scraping method. |
| MEDIUM | tests/test_chromium.py | 1038 | # Create a dummy undetected_chromedriver module with a dummy Chrome driver that always fails. |
| MEDIUM | tests/test_chromium.py | 2007 | # Create a loader instance with a retry limit of 2 and a short timeout. |
| MEDIUM | tests/test_chromium.py | 295 | # Create a dummy undetected_chromedriver module with a dummy Chrome driver. |
| MEDIUM | tests/test_chromium.py | 623 | # Create a loader with JS support and a retry_limit of 2 (so one failure is allowed) |
| MEDIUM | tests/test_chromium.py | 1232 | # Create a dummy undetected_chromedriver module where Chrome is defined but will not be used. |
| MEDIUM | tests/test_chromium.py | 1491 | # Define a dummy scraper that returns an integer (non‐string) |
| MEDIUM | tests/test_chromium.py | 1976 | # Create a dummy playwright so that evaluate and content can be called |
| MEDIUM | tests/test_omni_search_graph.py | 7 | # Create a dummy graph class to simulate graph execution |
| MEDIUM | tests/nodes/search_internet_node_test.py | 18 | # Define the model |
| MEDIUM | tests/nodes/search_link_node_test.py | 14 | # Define the configuration for the graph |
| MEDIUM | tests/nodes/search_link_node_test.py | 22 | # Define the SearchLinkNode with necessary configurations |
| MEDIUM | tests/nodes/search_link_node_test.py | 29 | # Define the initial state for the node |
| MEDIUM | tests/utils/copy_utils_test.py | 176 | original["self"] = original # Create a circular reference |
| MEDIUM | tests/graphs/screenshot_scraper_test.py | 13 | # Define a fixture for the graph configuration |
| MEDIUM | tests/graphs/code_generator_graph_openai_test.py | 17 | # Define the output schema for the graph |
| MEDIUM | tests/graphs/xml_scraper_openai_test.py | 16 | # Define the test fixtures and helpers |
| MEDIUM | tests/graphs/xml_scraper_openai_test.py | 49 | # Define the test cases |
| MEDIUM | tests/graphs/abstract_graph_test.py | 56 | # Create a mock graph with mock nodes |
| MEDIUM | tests/graphs/abstract_graph_test.py | 61 | # Create a TestGraph instance with the mock graph |
| MEDIUM | tests/graphs/abstract_graph_test.py | 254 | # Create a mock graph with mock nodes |
| MEDIUM | tests/graphs/abstract_graph_test.py | 260 | # Create a TestGraph instance with the mock graph |
| MEDIUM | tests/graphs/search_graph_openai_test.py | 15 | # Define the test fixtures and helpers |
| MEDIUM | tests/graphs/search_graph_openai_test.py | 36 | # Define the test cases |
| MEDIUM | examples/speech_graph/speech_graph_openai.py | 23 | # Define the configuration for the graph |
| MEDIUM | examples/speech_graph/speech_graph_openai.py | 39 | # Create the SpeechGraph instance and run it |
| MEDIUM | examples/custom_graph/ollama/custom_graph_ollama.py | 16 | # Define the configuration for the graph |
| MEDIUM | examples/custom_graph/ollama/custom_graph_ollama.py | 31 | # Define the graph nodes |
| MEDIUM | examples/custom_graph/ollama/custom_graph_ollama.py | 75 | # Create the graph by defining the connections |
| MEDIUM | examples/custom_graph/openai/custom_graph_openai.py | 22 | # Define the configuration for the graph |
| MEDIUM | examples/custom_graph/openai/custom_graph_openai.py | 34 | # Define the graph nodes |
| MEDIUM | examples/custom_graph/openai/custom_graph_openai.py | 86 | # Create the graph by defining the connections |
| MEDIUM | …mples/json_scraper_graph/ollama/json_scraper_ollama.py | 26 | # Define the configuration for the graph |
| MEDIUM | …mples/json_scraper_graph/ollama/json_scraper_ollama.py | 41 | # Create the JSONScraperGraph instance and run it |
| MEDIUM | …mples/json_scraper_graph/openai/json_scraper_openai.py | 26 | # Define the configuration for the graph |
| MEDIUM | …mples/json_scraper_graph/openai/json_scraper_openai.py | 39 | # Create the JSONScraperGraph instance and run it |
| MEDIUM | …mples/json_scraper_graph/openai/omni_scraper_openai.py | 16 | # Define the configuration for the graph |
| MEDIUM | …mples/json_scraper_graph/openai/omni_scraper_openai.py | 32 | # Create the OmniScraperGraph instance and run it |
| MEDIUM | examples/json_scraper_graph/openai/md_scraper_openai.py | 26 | # Define the configuration for the graph |
| MEDIUM | examples/json_scraper_graph/openai/md_scraper_openai.py | 39 | # Create the DocumentScraperGraph instance and run it |
| MEDIUM | …_generator_graph/ollama/code_generator_graph_ollama.py | 15 | # Define the output schema for the graph |
| MEDIUM | …_generator_graph/ollama/code_generator_graph_ollama.py | 29 | # Define the configuration for the graph |
| MEDIUM | …_generator_graph/ollama/code_generator_graph_ollama.py | 54 | # Create the SmartScraperGraph instance and run it |
| MEDIUM | …_generator_graph/openai/code_generator_graph_openai.py | 16 | # Define the output schema for the graph |
| MEDIUM | …_generator_graph/openai/code_generator_graph_openai.py | 30 | # Define the configuration for the graph |
| MEDIUM | …_generator_graph/openai/code_generator_graph_openai.py | 54 | # Create the SmartScraperGraph instance and run it |
| MEDIUM | examples/search_graph/ollama/search_graph_ollama.py | 9 | # Define the configuration for the graph |
| MEDIUM | examples/search_graph/ollama/search_graph_ollama.py | 25 | # Create the SearchGraph instance and run it |
| MEDIUM | …ples/search_graph/ollama/search_graph_schema_ollama.py | 13 | # Define the output schema for the graph |
| MEDIUM | …ples/search_graph/ollama/search_graph_schema_ollama.py | 27 | # Define the configuration for the graph |
| MEDIUM | …ples/search_graph/ollama/search_graph_schema_ollama.py | 42 | # Create the SearchGraph instance and run it |
| MEDIUM | …amples/search_graph/openai/search_link_graph_openai.py | 14 | # Define the configuration for the graph |
| MEDIUM | …amples/search_graph/openai/search_link_graph_openai.py | 29 | # Create the SearchLinkGraph instance and run it |
| 96 more matches not shown… | |||
| Severity | File | Line | Snippet |
|---|---|---|---|
| HIGH | …amples/search_graph/openai/search_link_graph_openai.py | 0 | basic example of scraping pipeline using smartscraper |
| HIGH | …art_scraper_graph/ollama/smart_scraper_multi_ollama.py | 0 | basic example of scraping pipeline using smartscraper |
| HIGH | …les/smart_scraper_graph/ollama/smart_scraper_ollama.py | 0 | basic example of scraping pipeline using smartscraper |
| HIGH | …craper_graph/ollama/smart_scraper_multi_lite_ollama.py | 0 | basic example of scraping pipeline using smartscraper |
| HIGH | …aper_graph/ollama/smart_scraper_multi_concat_ollama.py | 0 | basic example of scraping pipeline using smartscraper |
| HIGH | …mart_scraper_graph/ollama/smart_scraper_lite_ollama.py | 0 | basic example of scraping pipeline using smartscraper |
| HIGH | …aper_graph/openai/smart_scraper_multi_concat_openai.py | 0 | basic example of scraping pipeline using smartscraper |
| HIGH | …les/smart_scraper_graph/openai/smart_scraper_openai.py | 0 | basic example of scraping pipeline using smartscraper |
| HIGH | …craper_graph/openai/smart_scraper_multi_lite_openai.py | 0 | basic example of scraping pipeline using smartscraper |
| HIGH | …art_scraper_graph/openai/smart_scraper_multi_openai.py | 0 | basic example of scraping pipeline using smartscraper |
| HIGH | …mart_scraper_graph/openai/smart_scraper_lite_openai.py | 0 | basic example of scraping pipeline using smartscraper |
| HIGH | …ript_generator_graph/openai/script_generator_openai.py | 0 | basic example of scraping pipeline using smartscraper |
| HIGH | examples/extras/slow_mo.py | 0 | basic example of scraping pipeline using smartscraper |
| HIGH | examples/extras/custom_prompt.py | 0 | basic example of scraping pipeline using smartscraper |
| HIGH | examples/extras/browser_base_integration.py | 0 | basic example of scraping pipeline using smartscraper |
| HIGH | examples/extras/force_mode.py | 0 | basic example of scraping pipeline using smartscraper |
| HIGH | examples/extras/proxy_rotation.py | 0 | basic example of scraping pipeline using smartscraper |
| HIGH | examples/extras/undected_playwright.py | 0 | basic example of scraping pipeline using smartscraper |
| HIGH | examples/extras/reasoning.py | 0 | basic example of scraping pipeline using smartscraper |
| HIGH | examples/extras/rag_caching.py | 0 | basic example of scraping pipeline using smartscraper |
| HIGH | examples/extras/load_yml.py | 0 | basic example of scraping pipeline using smartscraper |
| HIGH | examples/extras/scrape_do.py | 0 | basic example of scraping pipeline using smartscraper |
| HIGH | …enerator_graph/ollama/script_multi_generator_ollama.py | 0 | basic example of scraping pipeline using scriptcreatorgraph |
| HIGH | …ript_generator_graph/ollama/script_generator_ollama.py | 0 | basic example of scraping pipeline using scriptcreatorgraph |
| HIGH | …enerator_graph/openai/script_generator_multi_openai.py | 0 | basic example of scraping pipeline using scriptcreatorgraph |
| HIGH | …nerator_graph/openai/script_generator_schema_openai.py | 0 | basic example of scraping pipeline using scriptcreatorgraph |
| HIGH | scrapegraphai/nodes/rag_node.py | 0 | a node responsible for compressing the input tokens and storing the document in a vector database for retrieval. relevan |
| HIGH | scrapegraphai/nodes/generate_answer_node_k_level.py | 0 | a node responsible for compressing the input tokens and storing the document in a vector database for retrieval. relevan |
| HIGH | scrapegraphai/nodes/description_node.py | 0 | a node responsible for compressing the input tokens and storing the document in a vector database for retrieval. relevan |
| HIGH | scrapegraphai/graphs/search_link_graph.py | 0 | creates the graph of nodes representing the workflow for web scraping. returns: basegraph: a graph instance representing |
| HIGH | scrapegraphai/graphs/depth_search_graph.py | 0 | creates the graph of nodes representing the workflow for web scraping. returns: basegraph: a graph instance representing |
| HIGH | scrapegraphai/graphs/smart_scraper_graph.py | 0 | creates the graph of nodes representing the workflow for web scraping. returns: basegraph: a graph instance representing |
| HIGH | scrapegraphai/graphs/code_generator_graph.py | 0 | creates the graph of nodes representing the workflow for web scraping. returns: basegraph: a graph instance representing |
| HIGH | scrapegraphai/graphs/document_scraper_graph.py | 0 | creates the graph of nodes representing the workflow for web scraping. returns: basegraph: a graph instance representing |
| HIGH | scrapegraphai/graphs/smart_scraper_lite_graph.py | 0 | creates the graph of nodes representing the workflow for web scraping. returns: basegraph: a graph instance representing |
| HIGH | scrapegraphai/graphs/omni_scraper_graph.py | 0 | creates the graph of nodes representing the workflow for web scraping. returns: basegraph: a graph instance representing |
| HIGH | scrapegraphai/graphs/script_creator_graph.py | 0 | creates the graph of nodes representing the workflow for web scraping. returns: basegraph: a graph instance representing |
| HIGH | scrapegraphai/graphs/xml_scraper_graph.py | 0 | creates the graph of nodes representing the workflow for web scraping. returns: basegraph: a graph instance representing |
| HIGH | scrapegraphai/graphs/json_scraper_graph.py | 0 | creates the graph of nodes representing the workflow for web scraping. returns: basegraph: a graph instance representing |
| HIGH | scrapegraphai/graphs/search_link_graph.py | 0 | executes the scraping process and returns the answer to the prompt. returns: str: the answer to the prompt. |
| HIGH | scrapegraphai/graphs/speech_graph.py | 0 | executes the scraping process and returns the answer to the prompt. returns: str: the answer to the prompt. |
| HIGH | scrapegraphai/graphs/smart_scraper_graph.py | 0 | executes the scraping process and returns the answer to the prompt. returns: str: the answer to the prompt. |
| HIGH | scrapegraphai/graphs/screenshot_scraper_graph.py | 0 | executes the scraping process and returns the answer to the prompt. returns: str: the answer to the prompt. |
| HIGH | scrapegraphai/graphs/document_scraper_graph.py | 0 | executes the scraping process and returns the answer to the prompt. returns: str: the answer to the prompt. |
| HIGH | scrapegraphai/graphs/omni_scraper_graph.py | 0 | executes the scraping process and returns the answer to the prompt. returns: str: the answer to the prompt. |
| HIGH | scrapegraphai/graphs/search_graph.py | 0 | creates the graph of nodes representing the workflow for web scraping and searching. returns: basegraph: a graph instanc |
| HIGH | scrapegraphai/graphs/omni_search_graph.py | 0 | creates the graph of nodes representing the workflow for web scraping and searching. returns: basegraph: a graph instanc |
| HIGH | scrapegraphai/graphs/xml_scraper_multi_graph.py | 0 | creates the graph of nodes representing the workflow for web scraping and searching. returns: basegraph: a graph instanc |
| HIGH | scrapegraphai/graphs/csv_scraper_multi_graph.py | 0 | creates the graph of nodes representing the workflow for web scraping and searching. returns: basegraph: a graph instanc |
| HIGH | scrapegraphai/graphs/smart_scraper_multi_graph.py | 0 | creates the graph of nodes representing the workflow for web scraping and searching. returns: basegraph: a graph instanc |
| HIGH | scrapegraphai/graphs/script_creator_multi_graph.py | 0 | creates the graph of nodes representing the workflow for web scraping and searching. returns: basegraph: a graph instanc |
| HIGH | scrapegraphai/graphs/json_scraper_multi_graph.py | 0 | creates the graph of nodes representing the workflow for web scraping and searching. returns: basegraph: a graph instanc |
| HIGH | scrapegraphai/graphs/document_scraper_multi_graph.py | 0 | creates the graph of nodes representing the workflow for web scraping and searching. returns: basegraph: a graph instanc |
| HIGH | scrapegraphai/graphs/search_graph.py | 0 | executes the web scraping and searching process. returns: str: the answer to the prompt. |
| HIGH | …rapegraphai/graphs/smart_scraper_multi_concat_graph.py | 0 | executes the web scraping and searching process. returns: str: the answer to the prompt. |
| HIGH | scrapegraphai/graphs/omni_search_graph.py | 0 | executes the web scraping and searching process. returns: str: the answer to the prompt. |
| HIGH | scrapegraphai/graphs/xml_scraper_multi_graph.py | 0 | executes the web scraping and searching process. returns: str: the answer to the prompt. |
| HIGH | scrapegraphai/graphs/csv_scraper_multi_graph.py | 0 | executes the web scraping and searching process. returns: str: the answer to the prompt. |
| HIGH | scrapegraphai/graphs/smart_scraper_multi_graph.py | 0 | executes the web scraping and searching process. returns: str: the answer to the prompt. |
| HIGH | scrapegraphai/graphs/script_creator_multi_graph.py | 0 | executes the web scraping and searching process. returns: str: the answer to the prompt. |
| 5 more matches not shown… | |||
| Severity | File | Line | Snippet |
|---|---|---|---|
| HIGH | scrapegraphai/docloaders/chromium.py | 175 | Asynchronously scrape the content of a given URL using Playwright's sync API and scrolling. Notes: |
| HIGH | scrapegraphai/docloaders/chromium.py | 324 | Asynchronously scrape the content of a given URL using Playwright's async API. Args: url ( |
| HIGH | scrapegraphai/docloaders/chromium.py | 385 | Asynchronously scrape the content of a given URL by rendering JavaScript using Playwright. Args: |
| HIGH | scrapegraphai/builders/graph_builder.py | 14 | GraphBuilder is a dynamic tool for constructing web scraping graphs based on user prompts. It utilizes a natura |
| HIGH | scrapegraphai/nodes/search_link_node.py | 94 | Filter out relevant links from the webpage that are relavant to prompt. Out of the filtered links, also |
| HIGH | scrapegraphai/nodes/search_node_with_context.py | 51 | Generates an answer by constructing a prompt from the user's input and the scraped content, querying th |
| HIGH | scrapegraphai/nodes/merge_answers_node.py | 58 | Executes the node's logic to merge the answers from multiple graph instances into a single answer. |
| HIGH | scrapegraphai/nodes/search_internet_node.py | 61 | Generates an answer by constructing a prompt from the user's input and the scraped content, querying th |
| HIGH | scrapegraphai/nodes/reasoning_node.py | 57 | Generate a refined prompt for the reasoning task based on the user's input and the JSON schema. |
| HIGH | scrapegraphai/nodes/graph_iterator_node.py | 47 | Executes the node's logic to instantiate and run multiple graph instances in parallel. Args: |
| HIGH | scrapegraphai/nodes/graph_iterator_node.py | 82 | asynchronously executes the node's logic with multiple graph instances running in parallel, using a semaphore of |
| HIGH | scrapegraphai/nodes/generate_answer_csv_node.py | 71 | Generates an answer by constructing a prompt from the user's input and the scraped content, querying th |
| HIGH | scrapegraphai/nodes/generate_scraper_node.py | 56 | Generates a python script for scraping a website using the specified library. Args: state |
| HIGH | scrapegraphai/nodes/robots_node.py | 59 | Checks if a website is scrapeable based on the robots.txt file and updates the state with the scrapeabi |
| HIGH | scrapegraphai/nodes/parse_node_depth_k_node.py | 43 | Executes the node's logic to parse the HTML documents content. Args: state (dict): The cur |
| HIGH | scrapegraphai/nodes/fetch_node_level_k.py | 73 | Executes the node's logic to fetch the HTML content of a specified URL and its sub-links recursively, t |
| HIGH | scrapegraphai/nodes/merge_generated_scripts_node.py | 42 | Executes the node's logic to merge the answers from multiple graph instances into a single answer. |
| HIGH | scrapegraphai/nodes/fetch_node.py | 224 | Handles the local source by fetching HTML content, optionally converting it to Markdown, and updating t |
| HIGH | scrapegraphai/nodes/fetch_node.py | 267 | Handles the web source by fetching HTML content from a URL, optionally converting it to Markdown, and u |
| HIGH | scrapegraphai/nodes/concat_answers_node.py | 42 | Executes the node's logic to concatenate the answers from multiple graph instances into a single answer |
| HIGH | scrapegraphai/nodes/generate_code_node.py | 94 | Generates Python code for a function that extracts data from HTML based on a output schema. Args: |
| HIGH | scrapegraphai/nodes/generate_code_node.py | 146 | Executes the overall reasoning loop to generate and validate the code. Args: state (dict): |
| HIGH | scrapegraphai/nodes/text_to_speech_node.py | 40 | Converts text to speech using the specified text-to-speech model. Args: state (dict): The |
| HIGH | scrapegraphai/nodes/markdownify_node.py | 42 | Executes the node's logic to convert HTML content to Markdown. Args: state (dict): The cur |
| HIGH | scrapegraphai/nodes/get_probable_tags_node.py | 46 | Generates a list of probable HTML tags based on the user's input and updates the state with this list. |
| HIGH | scrapegraphai/nodes/base_node.py | 13 | An abstract base class for nodes in a graph-based workflow, designed to perform specific actions when executed. |
| HIGH | scrapegraphai/nodes/base_node.py | 99 | Determines the necessary state keys based on the input specification. Args: state (dict): |
| HIGH | scrapegraphai/nodes/base_node.py | 137 | Parses the input keys expression to extract relevant keys from the state based on logical conditions. |
| HIGH | scrapegraphai/nodes/html_analyzer_node.py | 59 | Generates an analysis of the provided HTML code based on the wanted infromations to be extracted. Args |
| HIGH | scrapegraphai/nodes/parse_node.py | 63 | Executes the node's logic to parse the HTML document content and split it into chunks. Args: |
| HIGH | scrapegraphai/nodes/prompt_refiner_node.py | 63 | Generate a refined prompt using the user's prompt, the schema, and additional context. Args: |
| HIGH | scrapegraphai/nodes/generate_answer_omni_node.py | 65 | Generates an answer by constructing a prompt from the user's input and the scraped content, querying th |
| HIGH | scrapegraphai/utils/batch_api.py | 118 | Create and submit an OpenAI Batch API job. Args: client: An initialized OpenAI client. requests: Li |
| HIGH | scrapegraphai/utils/batch_api.py | 196 | Poll a batch job until it completes, fails, or times out. Args: client: An initialized OpenAI client. |
| HIGH | scrapegraphai/utils/code_error_analysis.py | 107 | Analyzes the syntax errors in the generated code. Args: state (dict): Contains the 'generated_code' an |
| HIGH | scrapegraphai/utils/code_error_analysis.py | 160 | Analyzes the execution errors in the generated code and HTML code. Args: state (dict): Contains the 'g |
| HIGH | scrapegraphai/utils/code_error_analysis.py | 215 | Analyzes the validation errors in the generated code based on a JSON schema. Args: state (dict): Conta |
| HIGH | scrapegraphai/utils/code_error_analysis.py | 278 | Analyzes the semantic differences in the generated code based on a comparison result. Args: state (dic |
| HIGH | scrapegraphai/utils/code_error_correction.py | 89 | Generates corrected code based on syntax error analysis. Args: state (dict): Contains the 'generated_c |
| HIGH | scrapegraphai/utils/code_error_correction.py | 142 | Generates corrected code based on execution error analysis. Args: state (dict): Contains the 'generate |
| HIGH | scrapegraphai/utils/code_error_correction.py | 195 | Generates corrected code based on validation error analysis. Args: state (dict): Contains the 'generat |
| HIGH | scrapegraphai/utils/code_error_correction.py | 254 | Generates corrected code based on semantic error analysis. Args: state (dict): Contains the 'generated |
| HIGH | scrapegraphai/utils/copy.py | 36 | Safely create a deep copy of an object, handling special cases. Args: obj: Object to copy Returns |
| HIGH | scrapegraphai/utils/parse_state_keys.py | 9 | Parses a complex boolean expression involving state keys. Args: expression (str): The boolean expressi |
| HIGH | scrapegraphai/utils/research_web.py | 172 | Search web function with improved error handling, validation, and security features. Args: query (str) |
| HIGH | scrapegraphai/utils/sys_dynamic_import.py | 16 | imports a python module from its srcfile Args: modpath: The srcfile absolute path modname: The |
| HIGH | …apegraphai/utils/screenshot_scraping/text_detection.py | 7 | Detects and extracts text from a given image. Parameters: image (PIL Image): The input image to ext |
| HIGH | …ai/utils/screenshot_scraping/screenshot_preparation.py | 225 | Crop an image using the specified coordinates. Parameters: image (PIL.Image): The image to be cropped. |
| HIGH | scrapegraphai/integrations/indexify_node.py | 38 | Executes the node's logic to index the content present in the state. Args: state (dict): T |
| HIGH | scrapegraphai/graphs/abstract_graph.py | 119 | Create a large language model instance based on the configuration provided. Args: llm_conf |
| HIGH | scrapegraphai/graphs/base_graph.py | 19 | BaseGraph manages the execution flow of a graph composed of interconnected nodes. Attributes: nodes (l |
| Severity | File | Line | Snippet |
|---|---|---|---|
| LOW | tests/test_fetch_node_timeout.py | 65 | def test_timeout_default_value(self): |
| LOW | tests/test_fetch_node_timeout.py | 74 | def test_timeout_custom_value(self): |
| LOW | tests/test_fetch_node_timeout.py | 102 | def test_requests_get_with_timeout(self, mock_requests): |
| LOW | tests/test_fetch_node_timeout.py | 125 | def test_requests_get_without_timeout_when_none(self, mock_requests): |
| LOW | tests/test_fetch_node_timeout.py | 147 | def test_pdf_parsing_with_timeout(self): |
| LOW | tests/test_fetch_node_timeout.py | 163 | def test_pdf_parsing_timeout_exceeded(self): |
| LOW | tests/test_fetch_node_timeout.py | 189 | def test_timeout_propagated_to_chromium_loader(self, mock_loader_class): |
| LOW | tests/test_fetch_node_timeout.py | 213 | def test_timeout_not_overridden_in_loader_kwargs(self, mock_loader_class): |
| LOW | tests/test_csv_scraper_multi_graph.py | 74 | def test_create_graph_structure(monkeypatch): |
| LOW | tests/test_csv_scraper_multi_graph.py | 119 | def test_run_argument_passing(): |
| LOW | tests/test_csv_scraper_multi_graph.py | 146 | def test_run_with_exception_in_execute(): |
| LOW | tests/test_cleanup_html.py | 13 | def test_extract_from_script_tags(): |
| LOW | tests/test_cleanup_html.py | 32 | def test_cleanup_html_success(): |
| LOW | tests/test_cleanup_html.py | 61 | def test_cleanup_html_no_body(): |
| LOW | tests/conftest.py | 373 | def pytest_collection_modifyitems(config, items): |
| LOW | tests/test_json_scraper_graph.py | 21 | def test_json_scraper_graph_with_directory( |
| LOW | tests/test_json_scraper_graph.py | 78 | def test_json_scraper_graph_with_single_file( |
| LOW | tests/test_json_scraper_graph.py | 135 | def test_json_scraper_graph_no_answer_found( |
| LOW | tests/test_json_scraper_graph.py | 189 | def test_json_scraper_graph_with_custom_schema( |
| LOW | tests/test_scrape_do.py | 9 | def test_scrape_do_fetch_without_proxy(): |
| LOW | tests/test_scrape_do.py | 36 | def test_scrape_do_fetch_with_proxy_no_geo(): |
| LOW | tests/test_scrape_do.py | 70 | def test_scrape_do_fetch_with_proxy_with_geo(): |
| LOW | tests/test_scrape_do.py | 107 | def test_scrape_do_fetch_without_proxy_custom_env(): |
| LOW | tests/test_scrape_do.py | 131 | def test_scrape_do_fetch_with_proxy_custom_env(): |
| LOW | tests/test_scrape_do.py | 161 | def test_scrape_do_fetch_exception_propagation(): |
| LOW | tests/test_scrape_do.py | 174 | def test_scrape_do_fetch_with_proxy_with_geo_and_super_false(): |
| LOW | tests/test_scrape_do.py | 208 | def test_scrape_do_fetch_empty_token_without_proxy(): |
| LOW | tests/test_scrape_do.py | 230 | def test_scrape_do_fetch_with_proxy_with_empty_geo(): |
| LOW | tests/test_scrape_do.py | 265 | def test_scrape_do_fetch_api_encoding_special_characters(): |
| LOW | tests/test_script_creator_multi_graph.py | 65 | def test_create_graph_structure(self): |
| LOW | tests/test_script_creator_multi_graph.py | 130 | def test_create_graph_node_configs(self): |
| LOW | tests/test_script_creator_multi_graph.py | 181 | def test_run_with_empty_prompt(self): |
| LOW | tests/test_batch_api.py | 45 | def test_to_jsonl_line_with_max_tokens(self): |
| LOW | tests/test_batch_api.py | 56 | def test_to_jsonl_line_with_response_format(self): |
| LOW | tests/test_batch_api.py | 67 | def test_to_jsonl_line_without_optional_fields(self): |
| LOW | tests/test_batch_api.py | 78 | def test_to_jsonl_line_custom_temperature(self): |
| LOW | tests/test_batch_api.py | 159 | def test_retrieve_no_output_file(self): |
| LOW | tests/test_batch_api.py | 173 | def test_results_sorted_by_custom_id(self): |
| LOW | tests/test_batch_api.py | 230 | def test_handles_partial_failures(self): |
| LOW | tests/test_batch_api.py | 280 | def test_rejects_non_openai_provider(self): |
| LOW | tests/test_batch_api.py | 293 | def test_rejects_groq_provider(self): |
| LOW | tests/test_batch_api.py | 313 | def test_empty_parsed_docs_raises(self): |
| LOW | tests/test_batch_api.py | 349 | def test_model_name_extraction(self): |
| LOW | tests/test_batch_api.py | 366 | def test_batch_model_override(self): |
| LOW | tests/test_batch_api.py | 386 | def test_format_instructions_without_schema(self): |
| LOW | tests/test_depth_search_graph.py | 19 | def test_depth_search_graph_initialization(self, source, expected_input_key): |
| LOW | tests/test_models_tokens.py | 41 | def test_non_existent_provider(self): |
| LOW | tests/test_models_tokens.py | 52 | def test_specific_token_value(self): |
| LOW | tests/test_models_tokens.py | 58 | def test_non_empty_model_keys(self): |
| LOW | tests/test_models_tokens.py | 88 | def test_specific_model_token_values(self): |
| LOW | tests/test_models_tokens.py | 121 | def test_no_whitespace_in_model_names(self): |
| LOW | tests/test_models_tokens.py | 130 | def test_specific_models_additional(self): |
| LOW | tests/test_generate_answer_node.py | 45 | def test_process_missing_content_and_user_prompt(dummy_node): |
| LOW | tests/test_generate_answer_node.py | 84 | def test_execute_multiple_chunks(dummy_node_with_pipe): |
| LOW | tests/test_generate_answer_node.py | 109 | def test_execute_single_chunk(dummy_node_with_pipe): |
| LOW | tests/test_generate_answer_node.py | 125 | def test_execute_merge_json_decode_error(dummy_node_with_pipe): |
| LOW | tests/test_generate_answer_node.py | 190 | def test_execute_timeout_single_chunk(dummy_node_with_pipe): |
| LOW | tests/test_generate_answer_node.py | 206 | def test_execute_script_creator_single_chunk(): |
| LOW | tests/test_generate_answer_node.py | 249 | def test_init_chat_ollama_format(): |
| LOW | tests/test_chromium.py | 486 | async def test_alazy_load_empty_urls(): |
| 161 more matches not shown… | |||
| Severity | File | Line | Snippet |
|---|---|---|---|
| LOW | tests/test_minimax_models.py | 5 | |
| LOW | tests/utils/test_sys_dynamic_import.py | 71 | |
| LOW | tests/utils/test_sys_dynamic_import.py | 77 | |
| LOW | scrapegraphai/docloaders/__init__.py | 5 | |
| LOW | scrapegraphai/docloaders/__init__.py | 6 | |
| LOW | scrapegraphai/docloaders/__init__.py | 7 | |
| LOW | scrapegraphai/docloaders/__init__.py | 8 | |
| LOW | scrapegraphai/docloaders/plasmate.py | 37 | |
| LOW | scrapegraphai/builders/__init__.py | 5 | |
| LOW | scrapegraphai/nodes/__init__.py | 5 | |
| LOW | scrapegraphai/nodes/__init__.py | 6 | |
| LOW | scrapegraphai/nodes/__init__.py | 7 | |
| LOW | scrapegraphai/nodes/__init__.py | 8 | |
| LOW | scrapegraphai/nodes/__init__.py | 9 | |
| LOW | scrapegraphai/nodes/__init__.py | 10 | |
| LOW | scrapegraphai/nodes/__init__.py | 11 | |
| LOW | scrapegraphai/nodes/__init__.py | 12 | |
| LOW | scrapegraphai/nodes/__init__.py | 13 | |
| LOW | scrapegraphai/nodes/__init__.py | 14 | |
| LOW | scrapegraphai/nodes/__init__.py | 15 | |
| LOW | scrapegraphai/nodes/__init__.py | 16 | |
| LOW | scrapegraphai/nodes/__init__.py | 17 | |
| LOW | scrapegraphai/nodes/__init__.py | 18 | |
| LOW | scrapegraphai/nodes/__init__.py | 19 | |
| LOW | scrapegraphai/nodes/__init__.py | 20 | |
| LOW | scrapegraphai/nodes/__init__.py | 21 | |
| LOW | scrapegraphai/nodes/__init__.py | 22 | |
| LOW | scrapegraphai/nodes/__init__.py | 23 | |
| LOW | scrapegraphai/nodes/__init__.py | 24 | |
| LOW | scrapegraphai/nodes/__init__.py | 25 | |
| LOW | scrapegraphai/nodes/__init__.py | 26 | |
| LOW | scrapegraphai/nodes/__init__.py | 27 | |
| LOW | scrapegraphai/nodes/__init__.py | 28 | |
| LOW | scrapegraphai/nodes/__init__.py | 29 | |
| LOW | scrapegraphai/nodes/__init__.py | 30 | |
| LOW | scrapegraphai/nodes/__init__.py | 31 | |
| LOW | scrapegraphai/nodes/__init__.py | 32 | |
| LOW | scrapegraphai/nodes/__init__.py | 33 | |
| LOW | scrapegraphai/nodes/__init__.py | 34 | |
| LOW | scrapegraphai/nodes/__init__.py | 35 | |
| LOW | scrapegraphai/nodes/__init__.py | 36 | |
| LOW | scrapegraphai/nodes/batch_generate_answer_node.py | 10 | |
| LOW | scrapegraphai/nodes/batch_generate_answer_node.py | 10 | |
| LOW | scrapegraphai/nodes/batch_generate_answer_node.py | 13 | |
| LOW | scrapegraphai/nodes/batch_generate_answer_node.py | 19 | |
| LOW | scrapegraphai/utils/batch_api.py | 15 | |
| LOW | scrapegraphai/utils/batch_api.py | 16 | |
| LOW | scrapegraphai/utils/__init__.py | 5 | |
| LOW | scrapegraphai/utils/__init__.py | 6 | |
| LOW | scrapegraphai/utils/__init__.py | 6 | |
| LOW | scrapegraphai/utils/__init__.py | 7 | |
| LOW | scrapegraphai/utils/__init__.py | 7 | |
| LOW | scrapegraphai/utils/__init__.py | 7 | |
| LOW | scrapegraphai/utils/__init__.py | 7 | |
| LOW | scrapegraphai/utils/__init__.py | 13 | |
| LOW | scrapegraphai/utils/__init__.py | 13 | |
| LOW | scrapegraphai/utils/__init__.py | 13 | |
| LOW | scrapegraphai/utils/__init__.py | 13 | |
| LOW | scrapegraphai/utils/__init__.py | 19 | |
| LOW | scrapegraphai/utils/__init__.py | 20 | |
| 130 more matches not shown… | |||
| Severity | File | Line | Snippet |
|---|---|---|---|
| MEDIUM | tests/conftest.py | 28 | # ============================================================================ |
| MEDIUM | tests/conftest.py | 30 | # ============================================================================ |
| MEDIUM | tests/conftest.py | 149 | # ============================================================================ |
| MEDIUM | tests/conftest.py | 151 | # ============================================================================ |
| MEDIUM | tests/conftest.py | 173 | # ============================================================================ |
| MEDIUM | tests/conftest.py | 175 | # ============================================================================ |
| MEDIUM | tests/conftest.py | 254 | # ============================================================================ |
| MEDIUM | tests/conftest.py | 256 | # ============================================================================ |
| MEDIUM | tests/conftest.py | 291 | # ============================================================================ |
| MEDIUM | tests/conftest.py | 293 | # ============================================================================ |
| MEDIUM | tests/conftest.py | 317 | # ============================================================================ |
| MEDIUM | tests/conftest.py | 319 | # ============================================================================ |
| MEDIUM | tests/conftest.py | 349 | # ============================================================================ |
| MEDIUM | tests/conftest.py | 351 | # ============================================================================ |
| MEDIUM | tests/test_plasmate.py | 97 | # --------------------------------------------------------------------------- |
| MEDIUM | tests/test_plasmate.py | 99 | # --------------------------------------------------------------------------- |
| MEDIUM | tests/test_plasmate.py | 139 | # --------------------------------------------------------------------------- |
| MEDIUM | tests/test_plasmate.py | 141 | # --------------------------------------------------------------------------- |
| MEDIUM | tests/test_plasmate.py | 179 | # --------------------------------------------------------------------------- |
| MEDIUM | tests/test_plasmate.py | 181 | # --------------------------------------------------------------------------- |
| MEDIUM | tests/test_plasmate.py | 221 | # --------------------------------------------------------------------------- |
| MEDIUM | tests/test_plasmate.py | 223 | # --------------------------------------------------------------------------- |
| MEDIUM | tests/test_plasmate.py | 257 | # --------------------------------------------------------------------------- |
| MEDIUM | tests/test_plasmate.py | 259 | # --------------------------------------------------------------------------- |
| MEDIUM | tests/test_plasmate.py | 13 | # --------------------------------------------------------------------------- |
| MEDIUM | tests/test_plasmate.py | 15 | # --------------------------------------------------------------------------- |
| MEDIUM | tests/test_plasmate.py | 32 | # --------------------------------------------------------------------------- |
| MEDIUM | tests/test_plasmate.py | 34 | # --------------------------------------------------------------------------- |
| MEDIUM | tests/test_plasmate.py | 65 | # --------------------------------------------------------------------------- |
| MEDIUM | tests/test_plasmate.py | 67 | # --------------------------------------------------------------------------- |
| Severity | File | Line | Snippet |
|---|---|---|---|
| LOW | tests/integration/test_smart_scraper_integration.py | 124 | except Exception as e: |
| LOW | examples/extras/chromium_selenium.py | 45 | except Exception as e: |
| LOW | examples/extras/chromium_selenium.py | 81 | except Exception as e: |
| LOW | examples/extras/chromium_selenium.py | 115 | except Exception as e: |
| LOW | examples/extras/chromium_selenium.py | 138 | except Exception as e: |
| LOW | examples/extras/chromium_selenium.py | 147 | except Exception as e: |
| LOW | scrapegraphai/docloaders/chromium.py | 82 | except Exception as e: |
| LOW | scrapegraphai/nodes/search_link_node.py | 146 | except Exception as e: |
| LOW | scrapegraphai/nodes/generate_answer_node.py | 86 | except Exception as e: |
| LOW | scrapegraphai/nodes/generate_answer_node.py | 114 | except Exception as e: |
| LOW | scrapegraphai/nodes/fetch_node_level_k.py | 229 | except Exception as e: |
| LOW | scrapegraphai/nodes/fetch_node_level_k.py | 251 | except Exception as e: |
| LOW | scrapegraphai/nodes/image_to_text_node.py | 83 | except Exception: |
| LOW | scrapegraphai/nodes/generate_code_node.py | 467 | except Exception as e: |
| LOW | scrapegraphai/nodes/conditional_node.py | 109 | except Exception as e: |
| LOW | scrapegraphai/nodes/parse_node.py | 106 | except Exception: |
| LOW | scrapegraphai/utils/code_error_analysis.py | 155 | except Exception as e: |
| LOW | scrapegraphai/utils/code_error_analysis.py | 210 | except Exception as e: |
| LOW | scrapegraphai/utils/code_error_analysis.py | 271 | except Exception as e: |
| LOW | scrapegraphai/utils/code_error_analysis.py | 334 | except Exception as e: |
| LOW | scrapegraphai/utils/code_error_correction.py | 135 | except Exception as e: |
| LOW | scrapegraphai/utils/code_error_correction.py | 188 | except Exception as e: |
| LOW | scrapegraphai/utils/code_error_correction.py | 247 | except Exception as e: |
| LOW | scrapegraphai/utils/code_error_correction.py | 318 | except Exception as e: |
| LOW | scrapegraphai/utils/copy.py | 70 | except Exception as e: |
| LOW | scrapegraphai/utils/cleanup_html.py | 41 | except Exception: |
| LOW | scrapegraphai/utils/research_web.py | 293 | except Exception as e: |
| LOW | scrapegraphai/utils/research_web.py | 334 | except Exception as e: |
| LOW | scrapegraphai/utils/research_web.py | 380 | except Exception as e: |
| LOW | scrapegraphai/graphs/abstract_graph.py | 278 | except Exception as e: |
| LOW | scrapegraphai/graphs/base_graph.py | 195 | except Exception: |
| LOW | scrapegraphai/graphs/base_graph.py | 293 | except Exception as e: |
| LOW | scrapegraphai/telemetry/telemetry.py | 23 | except Exception: |
| LOW | scrapegraphai/telemetry/telemetry.py | 34 | except Exception: |
| LOW | scrapegraphai/telemetry/telemetry.py | 44 | except Exception: |
| LOW | scrapegraphai/telemetry/telemetry.py | 52 | except Exception: |
| LOW | scrapegraphai/telemetry/telemetry.py | 141 | except Exception as e: |
| Severity | File | Line | Snippet |
|---|---|---|---|
| LOW | scrapegraphai/docloaders/chromium.py | 87 | |
| LOW | scrapegraphai/docloaders/chromium.py | 166 | |
| LOW | scrapegraphai/docloaders/chromium.py | 323 | |
| LOW | scrapegraphai/docloaders/chromium.py | 382 | |
| LOW | scrapegraphai/nodes/fetch_node_level_k.py | 234 | |
| LOW | scrapegraphai/nodes/fetch_node.py | 172 | |
| LOW | scrapegraphai/nodes/fetch_node.py | 266 | |
| LOW | scrapegraphai/utils/llm_callback_manager.py | 37 | |
| LOW | scrapegraphai/utils/cleanup_html.py | 13 | |
| LOW | scrapegraphai/utils/dict_content_compare.py | 15 | |
| LOW | scrapegraphai/utils/research_web.py | 161 | |
| LOW | scrapegraphai/utils/research_web.py | 250 | |
| LOW | scrapegraphai/utils/research_web.py | 338 | |
| LOW | scrapegraphai/utils/schema_trasform.py | 6 | |
| LOW | scrapegraphai/utils/schema_trasform.py | 17 | |
| LOW | …ai/utils/screenshot_scraping/screenshot_preparation.py | 44 | |
| LOW | …ai/utils/screenshot_scraping/screenshot_preparation.py | 67 | |
| LOW | scrapegraphai/graphs/abstract_graph.py | 118 | |
| LOW | scrapegraphai/graphs/base_graph.py | 130 | |
| LOW | scrapegraphai/graphs/base_graph.py | 157 | |
| LOW | scrapegraphai/graphs/base_graph.py | 236 | |
| LOW | scrapegraphai/graphs/base_graph.py | 344 |
| Severity | File | Line | Snippet |
|---|---|---|---|
| HIGH | docs/timeout_configuration.md | 169 | "api_key": "your-api-key" |
| HIGH | examples/markdownify/readme.md | 22 | sgai_client = Client(api_key="your-api-key") |
| HIGH | examples/smart_scraper_graph/scrapegraphai/readme.md | 48 | SCRAPEGRAPH_API_KEY=your-api-key-here |
| Severity | File | Line | Snippet |
|---|---|---|---|
| LOW | tests/conftest.py | 390 | # Check if any API key is available |
| LOW | tests/test_json_scraper_graph.py | 246 | # Check if the custom schema was passed to GenerateAnswerNode |
| LOW | tests/test_chromium.py | 1073 | # Set evaluate to always return constant height value (simulate constant page height) |
| LOW | tests/nodes/robot_node_test.py | 72 | # Set force_scraping to True |
| LOW | tests/nodes/search_link_node_test.py | 56 | # Check if the result is not None |
| LOW | scrapegraphai/utils/code_error_analysis.py | 134 | # Check if syntax errors exist |
| Severity | File | Line | Snippet |
|---|---|---|---|
| LOW | tests/test_chromium.py | 535 | # If no failure, simply pass. |
| LOW | scrapegraphai/nodes/generate_scraper_node.py | 109 | # If there are more than one chunks returned by ParseNode we just use the first one |
| Severity | File | Line | Snippet |
|---|---|---|---|
| LOW | .github/workflows/codeql.yml | 1 | # For most projects, this workflow file will not need changing; you simply need |
| LOW | .github/workflows/dependency-review.yml | 1 | # Dependency Review Action |