Repository Analysis

pathwaycom/pathway

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

14.2 Low AI signal View on GitHub
14.2
Adjusted Score
14.2
Raw Score
100%
Time Factor
2026-05-30
Last Push
63,188
Stars
Python
Language
343,413
Lines of Code
1320
Files
3619
Pattern Hits
2026-05-31
Scan Date

Score History

Severity Breakdown

CRITICAL 6HIGH 312MEDIUM 204LOW 3097

Pattern Findings

3619 matches across 17 categories. Click a row to expand file-level details.

Cross-File Repetition253 hits · 1265 pts
SeverityFileLineSnippet
HIGHpython/pathway/tests/test_common.py0| pet | owner | age 1 | 1 | alice | 10 2 | 1 | bob | 9 3 | 2 | alice | 8
HIGHpython/pathway/tests/test_common.py0| pet | owner | age 1 | 1 | alice | 10 2 | 1 | bob | 9 3 | 2 | alice | 8
HIGHpython/pathway/tests/test_common.py0| pet | owner | age 1 | 1 | alice | 10 2 | 1 | bob | 9 3 | 2 | alice | 8
HIGHpython/pathway/tests/test_common.py0| pet | owner | age 1 | 1 | alice | 10 2 | 1 | bob | 9 3 | 2 | alice | 8
HIGHpython/pathway/tests/test_common.py0| pet | owner | age 1 | 1 | alice | 10 2 | 1 | bob | 9 3 | 2 | alice | 8
HIGHpython/pathway/tests/test_common.py0| pet | owner | age 1 | 1 | alice | 10 2 | 1 | bob | 9 3 | 2 | alice | 8
HIGHpython/pathway/tests/test_common.py0| pet | owner | age 1 | 1 | alice | 10 2 | 1 | bob | 9 3 | 2 | alice | 8
HIGHpython/pathway/tests/test_api.py0| pet | owner | age 1 | 1 | alice | 10 2 | 1 | bob | 9 3 | 2 | alice | 8
HIGHpython/pathway/tests/test_common.py0| pet | owner | age | size 11 | 3 | alice | 10 | m 12 | 1 | bob | 9 | l 13 | 1 | tom | 8 | xl
HIGHpython/pathway/tests/test_common.py0| pet | owner | age | size 11 | 3 | alice | 10 | m 12 | 1 | bob | 9 | l 13 | 1 | tom | 8 | xl
HIGHpython/pathway/tests/test_common.py0| pet | owner | age | size 11 | 3 | alice | 10 | m 12 | 1 | bob | 9 | l 13 | 1 | tom | 8 | xl
HIGHpython/pathway/tests/test_common.py0| pet | owner | age | size 11 | 3 | alice | 10 | m 12 | 1 | bob | 9 | l 13 | 1 | tom | 8 | xl
HIGHpython/pathway/tests/test_api.py0| pet | owner | age | size 11 | 3 | alice | 10 | m 12 | 1 | bob | 9 | l 13 | 1 | tom | 8 | xl
HIGHpython/pathway/tests/test_common.py0| genus | epithet 1 | upupa | epops 2 | acherontia | atropos 3 | bubo | scandiacus 4 | dynastes | hercules
HIGHpython/pathway/tests/test_common.py0| genus | epithet 1 | upupa | epops 2 | acherontia | atropos 3 | bubo | scandiacus 4 | dynastes | hercules
HIGHpython/pathway/tests/test_common.py0| genus | epithet 1 | upupa | epops 2 | acherontia | atropos 3 | bubo | scandiacus 4 | dynastes | hercules
HIGHpython/pathway/tests/test_common.py0pet | owner | age dog | alice | 10 dog | bob | 9 cat | alice | 8 dog | bob | 7
HIGHpython/pathway/tests/test_common.py0pet | owner | age dog | alice | 10 dog | bob | 9 cat | alice | 8 dog | bob | 7
HIGHpython/pathway/tests/test_common.py0pet | owner | age dog | alice | 10 dog | bob | 9 cat | alice | 8 dog | bob | 7
HIGHpython/pathway/tests/test_common.py0pet | owner | age dog | alice | 10 dog | bob | 9 cat | alice | 8 dog | bob | 7
HIGHpython/pathway/tests/test_common.py0pet | owner | age dog | alice | 10 dog | bob | 9 cat | alice | 8 dog | bob | 7
HIGHpython/pathway/tests/test_reducers.py0pet | owner | age dog | alice | 10 dog | bob | 9 cat | alice | 8 dog | bob | 7
HIGHpython/pathway/tests/test_reducers.py0pet | owner | age dog | alice | 10 dog | bob | 9 cat | alice | 8 dog | bob | 7
HIGHpython/pathway/tests/test_reducers.py0pet | owner | age dog | alice | 10 dog | bob | 9 cat | alice | 8 dog | bob | 7
HIGHpython/pathway/tests/test_reducers.py0pet | owner | age dog | alice | 10 dog | bob | 9 cat | alice | 8 dog | bob | 7
HIGHpython/pathway/tests/test_reducers.py0pet | owner | age dog | alice | 10 dog | bob | 9 cat | alice | 8 dog | bob | 7
HIGHpython/pathway/tests/test_reducers.py0pet | owner | age dog | alice | 10 dog | bob | 9 cat | alice | 8 dog | bob | 7
HIGHpython/pathway/tests/test_reducers.py0pet | owner | age dog | alice | 10 dog | bob | 9 cat | alice | 8 dog | bob | 7
HIGHpython/pathway/tests/test_common.py0pet | owner | age 1 | alice | 10 1 | bob | 9 2 | alice | 8 1 | bob | 7
HIGHpython/pathway/tests/test_common.py0pet | owner | age 1 | alice | 10 1 | bob | 9 2 | alice | 8 1 | bob | 7
HIGHpython/pathway/tests/test_common.py0pet | owner | age 1 | alice | 10 1 | bob | 9 2 | alice | 8 1 | bob | 7
HIGHpython/pathway/tests/test_error_messages.py0pet | owner | age 1 | alice | 10 1 | bob | 9 2 | alice | 8 1 | bob | 7
HIGHpython/pathway/tests/test_common.py0| pet | owner | age 1 | 1 | alice | 10 2 | 1 | bob | 9 3 | 2 | alice | 8 4 | 1 | bob | 7
HIGHpython/pathway/tests/test_common.py0| pet | owner | age 1 | 1 | alice | 10 2 | 1 | bob | 9 3 | 2 | alice | 8 4 | 1 | bob | 7
HIGHpython/pathway/tests/test_common.py0| pet | owner | age 1 | 1 | alice | 10 2 | 1 | bob | 9 3 | 2 | alice | 8 4 | 1 | bob | 7
HIGHpython/pathway/tests/test_common.py0| pet | owner | age 1 | 1 | alice | 10 2 | 1 | bob | 9 3 | 2 | alice | 8 4 | 1 | bob | 7
HIGHpython/pathway/tests/test_common.py0| pet | owner | age 1 | 1 | alice | 10 2 | 1 | bob | 9 3 | 2 | alice | 8 4 | 1 | bob | 7
HIGHpython/pathway/tests/test_common.py0| pet | owner | age 1 | 1 | alice | 10 2 | 1 | bob | 9 3 | 2 | alice | 8 4 | 1 | bob | 7
HIGHpython/pathway/tests/test_common.py0| pet | owner | age 1 | dog | alice | 10 2 | dog | bob | 9 3 | cat | alice | 8 4 | dog | bob | 7
HIGHpython/pathway/tests/test_build_and_run.py0| pet | owner | age 1 | dog | alice | 10 2 | dog | bob | 9 3 | cat | alice | 8 4 | dog | bob | 7
HIGHpython/pathway/tests/test_build_and_run.py0| pet | owner | age 1 | dog | alice | 10 2 | dog | bob | 9 3 | cat | alice | 8 4 | dog | bob | 7
HIGHpython/pathway/tests/test_build_and_run.py0| pet | owner | age 1 | dog | alice | 10 2 | dog | bob | 9 3 | cat | alice | 8 4 | dog | bob | 7
HIGHpython/pathway/tests/test_common.py0| pet | owner | age 1 | 1 | alice | 10 2 | 1 | bob | 9
HIGHpython/pathway/tests/test_common.py0| pet | owner | age 1 | 1 | alice | 10 2 | 1 | bob | 9
HIGHpython/pathway/tests/test_error_messages.py0| pet | owner | age 1 | 1 | alice | 10 2 | 1 | bob | 9
HIGHpython/pathway/tests/test_common.py0pet | owner | age dog | bob | 10 cat | alice | 9 cat | alice | 8 dog | bob | 7 foo | charlie | 6
HIGHpython/pathway/tests/test_common.py0pet | owner | age dog | bob | 10 cat | alice | 9 cat | alice | 8 dog | bob | 7 foo | charlie | 6
HIGHpython/pathway/tests/test_common.py0pet | owner | age dog | bob | 10 cat | alice | 9 cat | alice | 8 dog | bob | 7 foo | charlie | 6
HIGHpython/pathway/tests/test_graphs.py0| u | v 100 | 1 | 2 101 | 1 | 3 102 | 1 | 2 103 | 2 | 2 104 | 2 | 3 105 | 5 | 5 106 | 5 | 5
HIGHpython/pathway/tests/test_graphs.py0| u | v 100 | 1 | 2 101 | 1 | 3 102 | 1 | 2 103 | 2 | 2 104 | 2 | 3 105 | 5 | 5 106 | 5 | 5
HIGHpython/pathway/tests/test_graphs.py0| u | v 100 | 1 | 2 101 | 1 | 3 102 | 1 | 2 103 | 2 | 2 104 | 2 | 3 105 | 5 | 5 106 | 5 | 5
HIGHpython/pathway/tests/test_graphs.py0| u | v | weight 1 | 1 | 2 | 5.0 2 | 2 | 1 | 5.0 3 | 3 | 4 | 5.0 4 | 4 | 3 | 5.0 5 | 1 | 4 | 15.0 6 | 4 | 1 | 15.0 7 | 5
HIGHpython/pathway/tests/test_graphs.py0| u | v | weight 1 | 1 | 2 | 5.0 2 | 2 | 1 | 5.0 3 | 3 | 4 | 5.0 4 | 4 | 3 | 5.0 5 | 1 | 4 | 15.0 6 | 4 | 1 | 15.0 7 | 5
HIGHpython/pathway/tests/test_graphs.py0| u | v | weight 1 | 1 | 2 | 5.0 2 | 2 | 1 | 5.0 3 | 3 | 4 | 5.0 4 | 4 | 3 | 5.0 5 | 1 | 4 | 15.0 6 | 4 | 1 | 15.0 7 | 5
HIGHpython/pathway/tests/test_graphs.py0| u | v | weight 1 | 1 | 2 | 5.0 2 | 2 | 1 | 5.0 3 | 3 | 4 | 5.0 4 | 4 | 3 | 5.0 5 | 1 | 4 | 15.0 6 | 4 | 1 | 15.0 7 | 5
HIGHpython/pathway/tests/test_graphs.py0| u | v | weight 1 | 1 | 2 | 50.0 2 | 2 | 1 | 50.0 3 | 3 | 4 | 50.0 4 | 4 | 3 | 50.0 5 | 1 | 4 | 0.5 6 | 4 | 1 | 0.5 7 |
HIGHpython/pathway/tests/test_graphs.py0| u | v | weight 1 | 1 | 2 | 50.0 2 | 2 | 1 | 50.0 3 | 3 | 4 | 50.0 4 | 4 | 3 | 50.0 5 | 1 | 4 | 0.5 6 | 4 | 1 | 0.5 7 |
HIGHpython/pathway/tests/test_graphs.py0| u | v | weight 1 | 1 | 2 | 50.0 2 | 2 | 1 | 50.0 3 | 3 | 4 | 50.0 4 | 4 | 3 | 50.0 5 | 1 | 4 | 0.5 6 | 4 | 1 | 0.5 7 |
HIGHpython/pathway/tests/test_graphs.py0| u | v | weight 1 | 1 | 2 | 50.0 2 | 2 | 1 | 50.0 3 | 3 | 4 | 50.0 4 | 4 | 3 | 50.0 5 | 1 | 4 | 0.5 6 | 4 | 1 | 0.5 7 |
HIGHpython/pathway/tests/test_graphs.py0| u | v | weight 1 | 1 | 2 | 50.0 2 | 2 | 1 | 50.0 3 | 3 | 4 | 50.0 4 | 4 | 3 | 50.0 5 | 1 | 4 | 0.5 6 | 4 | 1 | 0.5 7 |
193 more matches not shown…
Hyper-Verbose Identifiers1427 hits · 1103 pts
SeverityFileLineSnippet
LOWpython/pathway/conftest.py91def tmp_path_with_airbyte_config(tmp_path):
LOWpython/pathway/cli.py52def validate_and_resolve_spawn_args(
LOWpython/pathway/cli.py181def terminate_process_handles(process_handles: list[subprocess.Popen]):
LOWpython/pathway/udfs.py23... async def long_running_async_function(a: int, b: int) -> int:
LOWpython/pathway/io/_synchronization.py57def register_input_synchronization_group(
LOWpython/pathway/io/_utils.py220def construct_raw_data_schema_by_flags(
LOWpython/pathway/io/_utils.py237def construct_schema_and_data_format(
LOWpython/pathway/io/_utils.py362def check_raw_and_plaintext_only_kwargs_for_message_queues(f):
LOWpython/pathway/io/_utils.py571 def add_column_reference_to_extract(
LOWpython/pathway/io/_utils.py612def maybe_schema_registry_settings(
LOWpython/pathway/io/python/__init__.py331def _create_python_datasource(
LOWpython/pathway/io/deltalake/__init__.py61 def _ensure_versions_are_consecutive(self):
LOWpython/pathway/io/deltalake/__init__.py81 def _create_no_op_version_in_between(self, prev_version: int, current_version: int):
LOWpython/pathway/io/deltalake/__init__.py259 def _get_actual_parquet_block_paths(self) -> set[str]:
LOWpython/pathway/io/deltalake/__init__.py270 def _remove_obsolete_versions(self, tlog_file_names: list[str]):
LOWpython/pathway/io/deltalake/__init__.py490def _read_table_schema_from_metadata(
LOWpython/pathway/io/bigquery/__init__.py42 def _convert_to_compatible_format(self, row: dict[str, Any]):
LOWpython/pathway/io/gdrive/__init__.py135 def _deduce_list_objects_strategy(self) -> _ListObjectsStrategy:
LOWpython/pathway/io/gdrive/__init__.py168 def _traverse_objects_with_limit(self) -> list[GDriveFile]:
LOWpython/pathway/io/gdrive/__init__.py209 def _detect_objects_with_full_scan(self) -> list[GDriveFile]:
LOWpython/pathway/io/gdrive/__init__.py326 def _prepare_download_request(self, file: GDriveFile) -> Any:
LOWpython/pathway/io/postgres/__init__.py131def _augment_postgres_settings(settings: dict, unique_name: str | None) -> dict:
LOWpython/pathway/io/postgres/__init__.py145def _connection_string_from_settings(settings: dict):
LOWpython/pathway/io/postgres/__init__.py154def _replication_connection_string_from_settings(settings: dict):
LOWpython/pathway/io/postgres/__init__.py242def _construct_replication_settings(
LOWpython/pathway/io/http/_common.py50 def wait_duration_before_retry(self):
LOWpython/pathway/io/http/_server.py217 def _add_optional_traits_if_present(self, field_description, props):
LOWpython/pathway/io/http/_server.py223 def _construct_openapi_plaintext_schema(self, schema) -> dict:
LOWpython/pathway/io/http/_server.py242 def _construct_openapi_get_request_schema(self, schema) -> list:
LOWpython/pathway/io/http/_server.py265 def _construct_openapi_json_schema(self, schema) -> dict:
LOWpython/pathway/io/http/_server.py445 def _create_request_processor(self):
LOWpython/pathway/io/http/_server.py572 def _wrap_handler_with_logger(
LOWpython/pathway/io/airbyte/logic.py89 def _handle_stream_state_inner(self, stream):
LOWpython/pathway/tests/test_async_transformer.py304def test_with_instance_work_after_restart():
LOWpython/pathway/tests/test_async_transformer.py348def test_fails_whole_instance():
LOWpython/pathway/tests/test_async_transformer.py391def test_fails_on_too_many_columns():
LOWpython/pathway/tests/test_async_transformer.py414def test_fails_on_not_enough_columns():
LOWpython/pathway/tests/test_async_transformer.py470def test_consistent_when_instance_for_key_changes():
LOWpython/pathway/tests/test_async_transformer.py503def test_requires_hashable_instance():
LOWpython/pathway/tests/test_async_transformer.py568def test_commits_even_if_blocked_on_processing(tmp_path):
LOWpython/pathway/tests/test_async_transformer.py597def test_commits_even_if_nothing_to_process(tmp_path):
LOWpython/pathway/tests/test_deduplicate.py14def test_deduplicate_keeps_state(tmp_path: pathlib.Path):
LOWpython/pathway/tests/test_deduplicate.py103def test_deduplicate_keeps_state_after_two_restarts(tmp_path: pathlib.Path):
LOWpython/pathway/tests/test_deduplicate.py225def test_deduplicate_with_instance_keeps_state(tmp_path: pathlib.Path):
LOWpython/pathway/tests/test_deduplicate.py303def test_deduplicate_keeps_state_after_code_change(tmp_path: pathlib.Path):
LOWpython/pathway/tests/test_deduplicate.py369def test_deduplicate_keeps_state_with_regular_persistence(tmp_path: pathlib.Path):
LOWpython/pathway/tests/test_deduplicate.py414def test_selective_persistence_name_set(
LOWpython/pathway/tests/test_deduplicate.py485def test_selective_persistence_no_name_set_or_different_names_set(
LOWpython/pathway/tests/test_deduplicate.py559def test_deduplicate_python_tuple():
LOWpython/pathway/tests/test_io_leann.py51def test_on_change_deletion_nonexistent_key(tmp_path: pathlib.Path):
LOWpython/pathway/tests/test_io_leann.py61def test_on_change_skips_empty_text(tmp_path: pathlib.Path):
LOWpython/pathway/tests/test_io_leann.py86def test_on_change_with_metadata(tmp_path: pathlib.Path):
LOWpython/pathway/tests/test_io_leann.py104def test_on_time_end_rebuilds_when_dirty(tmp_path: pathlib.Path):
LOWpython/pathway/tests/test_io_leann.py115def test_on_end_builds_index_when_dirty(tmp_path: pathlib.Path):
LOWpython/pathway/tests/test_io_leann.py125def test_on_end_skips_build_when_not_dirty_and_index_exists(tmp_path: pathlib.Path):
LOWpython/pathway/tests/test_io_leann.py135def test_build_index_skips_when_no_documents(tmp_path: pathlib.Path):
LOWpython/pathway/tests/test_io_leann.py144def test_build_index_with_documents(tmp_path: pathlib.Path):
LOWpython/pathway/tests/test_io_leann.py182def test_observer_initialization_via_write_params(tmp_path: pathlib.Path):
LOWpython/pathway/tests/test_io_leann.py203def test_check_leann_available_raises_on_missing_package():
LOWpython/pathway/tests/test_io_leann.py209def test_write_raises_on_non_string_text_column():
1367 more matches not shown…
Over-Commented Block845 hits · 830 pts
SeverityFileLineSnippet
LOWpython/pathway/io/gdrive/__init__.py41 "application/vnd.google-apps.presentation": "application/vnd.openxmlformats-officedocument.presentationml.presentati
LOWpython/pathway/io/gdrive/__init__.py61# costly.
LOWpython/pathway/io/postgres/__init__.py61 return prefix
LOWpython/pathway/io/postgres/__init__.py81# Detection-time math: with ``keepalives_idle=300`` and
LOWpython/pathway/io/postgres/__init__.py161 def enc(v) -> str:
LOWpython/pathway/io/postgres/__init__.py801 raise ValueError(
LOWpython/pathway/tests/test_graphs.py221
LOWpython/pathway/tests/test_persistence.py141 wait_result_with_checker(
LOWpython/pathway/tests/test_deltalake.py1461 time_start_5 = int(time.time() * 1000)
LOWpython/pathway/tests/test_persistence_iterate.py281
LOWpython/pathway/tests/test_persistence_iterate.py301# 20 | False | b | 10 ← inherited from prev (a)
LOWpython/pathway/tests/test_persistence_iterate.py381 return expected
LOWpython/pathway/tests/test_persistence_iterate.py401 ],
LOWpython/pathway/tests/test_persistence_iterate.py421
LOWpython/pathway/tests/test_persistence_iterate.py441 #
LOWpython/pathway/tests/test_persistence_iterate.py461 # chunk=10 ────── chunk=30 ──────
LOWpython/pathway/tests/test_persistence_iterate.py481 # Run 2: event c gains a flag
LOWpython/pathway/tests/test_persistence_iterate.py501 #
LOWpython/pathway/tests/test_persistence_iterate.py521 # Run 3: + 45(F,x) → second split inside chunk=30
LOWpython/pathway/tests/test_persistence_iterate.py541 # chunk=10={a,b,c,d} chunk=50={e} chunk=55={x,f}
LOWpython/pathway/tests/test_persistence_iterate.py561 #
LOWintegration_tests/kafka/utils.py421 except Exception:
LOWintegration_tests/db_connectors/test_milvus_types.py1# Copyright © 2026 Pathway
LOWintegration_tests/db_connectors/test_mongodb.py781 pw.io.jsonlines.write(table, output_path)
LOWintegration_tests/db_connectors/test_mongodb.py1241# common in real MongoDB collections. The reader maps them to the closest
LOWintegration_tests/db_connectors/test_mssql.py1381 # 1205 deadlock victim error as a test failure even though the test
LOWintegration_tests/db_connectors/test_leann.py141
LOWintegration_tests/db_connectors/test_leann.py161 # )
LOWintegration_tests/db_connectors/test_postgres.py4641 "pg_interval,expected_td",
LOW…/4.user-guide/40.temporal-data/.window-join/article.py1# ---
LOW…/4.user-guide/40.temporal-data/.window-join/article.py21# src: '/assets/content/tutorials/clickstream_window_join/window_join_illustrative.png'
LOW…/4.user-guide/40.temporal-data/.window-join/article.py101
LOW…poral-data/.interval_join_temporal_behavior/article.py1# ---
LOW…poral-data/.interval_join_temporal_behavior/article.py81
LOW…poral-data/.interval_join_temporal_behavior/article.py121# However, the maximal seen time in the `orders` stream when Frank's order arrived was $128$
LOW…poral-data/.interval_join_temporal_behavior/article.py221 discounts_with_updates.price,
LOW…s/4.user-guide/40.temporal-data/.asof-join/.article.py1# ---
LOW…s/4.user-guide/40.temporal-data/.asof-join/.article.py21# * `Forward`: It finds the next row in the other table where the time is greater than or equal to the time in the curre
LOW…s/4.user-guide/40.temporal-data/.asof-join/.article.py41#
LOW…s/4.user-guide/40.temporal-data/.asof-join/.article.py161preview_table(merged, time_col="event_t")
LOW…40.temporal-data/.windows_temporal_behavior/article.py1# ---
LOW…40.temporal-data/.windows_temporal_behavior/article.py21#
LOW…de/40.temporal-data/.windowby-reduce-manual/article.py1# ---
LOW….user-guide/40.temporal-data/.interval-join/article.py1# ---
LOW….user-guide/40.temporal-data/.interval-join/article.py101# Pathway has a function called `interval_join` (resp. `interval_join_{outer/left/right}`) which you can use to perform
LOW…uide/20.connect/99.connectors/.fs-connector/article.py1# ---
LOW…uide/20.connect/99.connectors/.fs-connector/article.py21# name: python3
LOW…uide/20.connect/99.connectors/.fs-connector/article.py41# ## Location of files and filter.
LOW…uide/20.connect/99.connectors/.fs-connector/article.py81# ! cat plain_output/out1.txt
LOW…uide/20.connect/99.connectors/.fs-connector/article.py101
LOW…uide/20.connect/99.connectors/.fs-connector/article.py141# ! cat ./csv_output/csv_out1.txt
LOW…uide/20.connect/99.connectors/.fs-connector/article.py161 header1: str
LOW…uide/20.connect/99.connectors/.fs-connector/article.py201 new_line = ','.join(parts) + '\n'
LOW…r-guide/20.connect/99.connectors/.jsonlines/article.py1# ---
LOW…r-guide/20.connect/99.connectors/.jsonlines/article.py21# ```
LOW…r-guide/20.connect/99.connectors/.jsonlines/article.py41# ```
LOW…r-guide/20.connect/99.connectors/.jsonlines/article.py61
LOW…r-guide/20.connect/99.connectors/.jsonlines/article.py81# 3,"Julian S.","Dick F.",1707985402732,1
LOW…r-guide/20.connect/99.connectors/.jsonlines/article.py101table = pw.io.jsonlines.read("./input_file.jsonl", schema=InputSchema, mode="static")
LOW…r-guide/20.connect/99.connectors/.jsonlines/article.py121#
785 more matches not shown…
Unused Imports611 hits · 520 pts
SeverityFileLineSnippet
LOWpython/pathway/conftest.py3
LOWpython/pathway/reducers.py23
LOWpython/pathway/reducers.py23
LOWpython/pathway/reducers.py23
LOWpython/pathway/reducers.py28
LOWpython/pathway/reducers.py28
LOWpython/pathway/reducers.py28
LOWpython/pathway/reducers.py28
LOWpython/pathway/reducers.py28
LOWpython/pathway/reducers.py28
LOWpython/pathway/reducers.py28
LOWpython/pathway/reducers.py28
LOWpython/pathway/reducers.py28
LOWpython/pathway/reducers.py28
LOWpython/pathway/reducers.py28
LOWpython/pathway/reducers.py28
LOWpython/pathway/reducers.py28
LOWpython/pathway/reducers.py28
LOWpython/pathway/reducers.py28
LOWpython/pathway/reducers.py28
LOWpython/pathway/universes.py34
LOWpython/pathway/universes.py34
LOWpython/pathway/universes.py34
LOWpython/pathway/__init__.py3
LOWpython/pathway/__init__.py5
LOWpython/pathway/__init__.py6
LOWpython/pathway/__init__.py8
LOWpython/pathway/__init__.py9
LOWpython/pathway/__init__.py10
LOWpython/pathway/__init__.py10
LOWpython/pathway/__init__.py10
LOWpython/pathway/__init__.py10
LOWpython/pathway/__init__.py11
LOWpython/pathway/__init__.py11
LOWpython/pathway/__init__.py11
LOWpython/pathway/__init__.py11
LOWpython/pathway/__init__.py11
LOWpython/pathway/__init__.py11
LOWpython/pathway/__init__.py11
LOWpython/pathway/__init__.py11
LOWpython/pathway/__init__.py11
LOWpython/pathway/__init__.py11
LOWpython/pathway/__init__.py11
LOWpython/pathway/__init__.py11
LOWpython/pathway/__init__.py11
LOWpython/pathway/__init__.py11
LOWpython/pathway/__init__.py11
LOWpython/pathway/__init__.py11
LOWpython/pathway/__init__.py11
LOWpython/pathway/__init__.py11
LOWpython/pathway/__init__.py11
LOWpython/pathway/__init__.py11
LOWpython/pathway/__init__.py11
LOWpython/pathway/__init__.py11
LOWpython/pathway/__init__.py11
LOWpython/pathway/__init__.py11
LOWpython/pathway/__init__.py11
LOWpython/pathway/__init__.py11
LOWpython/pathway/__init__.py11
LOWpython/pathway/__init__.py11
551 more matches not shown…
Decorative Section Separators82 hits · 289 pts
SeverityFileLineSnippet
MEDIUMpython/pathway/tests/test_persistence_iterate.py386 # -----------------------------------------------------------------------
MEDIUMpython/pathway/tests/test_persistence_iterate.py390 # chunk=10 ────────────── chunk=40 ──────────────
MEDIUMpython/pathway/tests/test_persistence_iterate.py393 # chunk=70 ─────────────
MEDIUMpython/pathway/tests/test_persistence_iterate.py396 # -----------------------------------------------------------------------
MEDIUMpython/pathway/tests/test_persistence_iterate.py403 # -----------------------------------------------------------------------
MEDIUMpython/pathway/tests/test_persistence_iterate.py407 # chunk=10 ────────────────────
MEDIUMpython/pathway/tests/test_persistence_iterate.py411 # chunk=10 ────── chunk=25 ─────────────
MEDIUMpython/pathway/tests/test_persistence_iterate.py415 # -----------------------------------------------------------------------
MEDIUMpython/pathway/tests/test_persistence_iterate.py422 # -----------------------------------------------------------------------
MEDIUMpython/pathway/tests/test_persistence_iterate.py426 # chunk=10 ─────────────
MEDIUMpython/pathway/tests/test_persistence_iterate.py430 # chunk=10 ──────────────────
MEDIUMpython/pathway/tests/test_persistence_iterate.py433 # -----------------------------------------------------------------------
MEDIUMpython/pathway/tests/test_persistence_iterate.py439 # -----------------------------------------------------------------------
MEDIUMpython/pathway/tests/test_persistence_iterate.py443 # chunk=10 ─────────────
MEDIUMpython/pathway/tests/test_persistence_iterate.py447 # chunk=5 chunk=10 ─────────────
MEDIUMpython/pathway/tests/test_persistence_iterate.py451 # -----------------------------------------------------------------------
MEDIUMpython/pathway/tests/test_persistence_iterate.py457 # -----------------------------------------------------------------------
MEDIUMpython/pathway/tests/test_persistence_iterate.py461 # chunk=10 ────── chunk=30 ──────
MEDIUMpython/pathway/tests/test_persistence_iterate.py465 # chunk=10 ────────────────────
MEDIUMpython/pathway/tests/test_persistence_iterate.py468 # -----------------------------------------------------------------------
MEDIUMpython/pathway/tests/test_persistence_iterate.py475 # -----------------------------------------------------------------------
MEDIUMpython/pathway/tests/test_persistence_iterate.py479 # chunk=10 ────────────────────
MEDIUMpython/pathway/tests/test_persistence_iterate.py483 # chunk=10 ────── chunk=30 ──────
MEDIUMpython/pathway/tests/test_persistence_iterate.py486 # -----------------------------------------------------------------------
MEDIUMpython/pathway/tests/test_persistence_iterate.py493 # -----------------------------------------------------------------------
MEDIUMpython/pathway/tests/test_persistence_iterate.py497 # chunk=10 ────────────────────
MEDIUMpython/pathway/tests/test_persistence_iterate.py504 # -----------------------------------------------------------------------
MEDIUMpython/pathway/tests/test_persistence_iterate.py512 # -----------------------------------------------------------------------
MEDIUMpython/pathway/tests/test_persistence_iterate.py516 # chunk=10 ────────────────────────────
MEDIUMpython/pathway/tests/test_persistence_iterate.py523 # -----------------------------------------------------------------------
MEDIUMpython/pathway/tests/test_persistence_iterate.py531 # -----------------------------------------------------------------------
MEDIUMpython/pathway/tests/test_persistence_iterate.py535 # chunk=10 ────── chunk=30 ────── chunk=50 ──────
MEDIUMpython/pathway/tests/test_persistence_iterate.py542 # -----------------------------------------------------------------------
MEDIUMpython/pathway/tests/test_persistence_iterate.py550 # -----------------------------------------------------------------------
MEDIUMpython/pathway/tests/test_persistence_iterate.py554 # chunk=10 ──────
MEDIUMpython/pathway/tests/test_persistence_iterate.py564 # -----------------------------------------------------------------------
MEDIUMpython/pathway/tests/test_persistence_iterate.py572 # -----------------------------------------------------------------------
MEDIUMpython/pathway/tests/test_persistence_iterate.py576 # chunk=10 ──────────────
MEDIUMpython/pathway/tests/test_persistence_iterate.py586 # -----------------------------------------------------------------------
MEDIUMpython/pathway/tests/test_persistence_iterate.py289# ---------------------------------------------------------------------------
MEDIUMpython/pathway/tests/test_persistence_iterate.py291# ---------------------------------------------------------------------------
MEDIUMpython/pathway/tests/test_persistence_iterate.py317# ---------------------------------------------------------------------------
MEDIUMpython/pathway/tests/test_persistence_iterate.py1073# ---------------------------------------------------------------------------
MEDIUMpython/pathway/tests/test_persistence_iterate.py1077# ---------------------------------------------------------------------------
MEDIUMintegration_tests/iceberg/test_iceberg.py1293# ----------------------------------------------------------------------------
MEDIUMintegration_tests/iceberg/test_iceberg.py1297# ----------------------------------------------------------------------------
MEDIUMintegration_tests/iceberg/test_iceberg.py578# ----------------------------------------------------------------------------
MEDIUMintegration_tests/iceberg/test_iceberg.py586# ----------------------------------------------------------------------------
MEDIUMintegration_tests/kafka/test_simple.py2276# ---------------------------------------------------------------------------
MEDIUMintegration_tests/kafka/test_simple.py2283# ---------------------------------------------------------------------------
MEDIUMintegration_tests/kafka/test_simple.py2329# ---------------------------------------------------------------------------
MEDIUMintegration_tests/kafka/test_simple.py2337# ---------------------------------------------------------------------------
MEDIUMintegration_tests/kafka/test_simple.py2382# ---------------------------------------------------------------------------
MEDIUMintegration_tests/kafka/test_simple.py2389# ---------------------------------------------------------------------------
MEDIUMintegration_tests/kafka/test_simple.py2436# ---------------------------------------------------------------------------
MEDIUMintegration_tests/kafka/test_simple.py2446# ---------------------------------------------------------------------------
MEDIUMintegration_tests/kafka/test_simple.py2492# ---------------------------------------------------------------------------
MEDIUMintegration_tests/kafka/test_simple.py2502# ---------------------------------------------------------------------------
MEDIUMintegration_tests/kafka/test_simple.py2536# ---------------------------------------------------------------------------
MEDIUMintegration_tests/kafka/test_simple.py2544# ---------------------------------------------------------------------------
22 more matches not shown…
Slop Phrases55 hits · 152 pts
SeverityFileLineSnippet
LOWpython/pathway/io/iceberg/__init__.py183 Don't forget to run your program with ``pw.run`` once you define all necessary
LOWpython/pathway/io/iceberg/__init__.py309 Don't forget to run your program with ``pw.run`` once you define all necessary
MEDIUMpython/pathway/io/kafka/__init__.py129 Consider a Kafka queue running locally on port 9092. For demonstration purposes, our
MEDIUMpython/pathway/io/kafka/__init__.py618 Consider a Kafka queue running locally on port 9092. For demonstration purposes, our
LOWpython/pathway/io/pubsub/__init__.py122 At last, don't forget to add ``pw.run()`` to run your pipeline.
MEDIUM…/4.user-guide/40.temporal-data/.window-join/article.py124# Here's how you can do it:
MEDIUM…poral-data/.interval_join_temporal_behavior/article.py22# To deliver streaming data in a production system, you can use a connector like Kafka or Redpanda (you can read more ab
MEDIUM…poral-data/.interval_join_temporal_behavior/article.py88# As such, their orders are not present in the `result` table. If you want to include all orders (also those that can't
MEDIUM…40.temporal-data/.windows_temporal_behavior/article.py16# The examples in this article use the sliding windows, but you can also use behaviors with tumbling windows.
MEDIUM…40.temporal-data/.windows_temporal_behavior/article.py58# To that end, you can use sliding windows. To keep things simple, start with a piece of code that only groups data into
MEDIUM….user-guide/40.temporal-data/.interval-join/article.py101# Pathway has a function called `interval_join` (resp. `interval_join_{outer/left/right}`) which you can use to perform
MEDIUM….user-guide/40.temporal-data/.interval-join/article.py101# Pathway has a function called `interval_join` (resp. `interval_join_{outer/left/right}`) which you can use to perform
MEDIUM…uide/20.connect/99.connectors/.fs-connector/article.py121# In most cases, in order to read the data, you need to define its schema and pass it to the connector. Furthermore, for
MEDIUM…velopers/4.user-guide/20.connect/.datatypes/article.py80# Sometimes you may want to compute a value of a column, using e.g. a function from an external library, that does not d
MEDIUM…velopers/4.user-guide/20.connect/.datatypes/article.py156# Similarly, as in the conversion between `str` and `bytes`, you can use `apply_with_type` to convert a column of type `
MEDIUM…velopers/4.user-guide/20.connect/.datatypes/article.py187# As in the case of conversion from `int` to `float`, you can use `pw.cast` to convert data from type `float` to `int`.
MEDIUM…/4.user-guide/20.connect/.schema_generation/article.py80# If you have a CSV file with sample data, you can use it to generate a schema in Pathway. The name of columns will be t
MEDIUM…/4.user-guide/20.connect/.schema_generation/article.py101# For the example, you can use the default values of the optional arguments, so the Python code is:
MEDIUM…/4.user-guide/20.connect/.schema_generation/article.py116# To do that, you can use a method in the Schema class called `generate_class` to generate a string with a class definit
MEDIUM…10.introduction/.streaming-and-static-modes/article.py33# In practice, you would need two input connectors to create T1 and T2, you can use the previous sample of code to build
MEDIUM…pers/4.user-guide/70.migrating/.from_pandas/article.py115# The arguments to `select` are dynamic; you can use them to define the desired column names and then pass the content.
MEDIUMdocs/2.developers/4.user-guide/.sql/article.py76# With the `SELECT ...` and `WHERE ...` clauses, you can use the following operators:
MEDIUMdocs/2.developers/4.user-guide/.sql/article.py133# With `GROUP BY`, you can use the following aggregation functions:
MEDIUM….user-guide/30.data-transformation/.iterate/article.py315# Now you can use the `bellman_ford` in iterate. Note the [`pw.iterate_universe`](/developers/api-docs/pathway#pathway.i
MEDIUM….user-guide/30.data-transformation/.iterate/article.py324# If you wish, you can try simulating changes in the graph over time by adding `__time__` and `__diff__` columns to `pw.
MEDIUM…/30.data-transformation/.indexes-in-pathway/article.py109# Monitoring changes of answers to your queries might not be what you want. Especially if you have **a lot of** queries.
MEDIUM…/30.data-transformation/.indexes-in-pathway/article.py157# In contrast to an ordinary [`join`](/developers/user-guide/data-transformation/join-manual), `asof_now_join` is not sy
MEDIUM…/30.data-transformation/.indexes-in-pathway/article.py177# If you don't want answers to your queries to be updated, you can use [`get_nearest_items_asof_now`](/developers/api-do
MEDIUM…/30.data-transformation/.indexes-in-pathway/article.py247# To receive the queries, you can use the [`rest_connector`](/developers/api-docs/pathway-io/http#pathway.io.http.rest_c
MEDIUM…/30.data-transformation/.indexes-in-pathway/article.py274# To answer the queries, you can use the `get_nearest_items_asof_now` method of the index. It'll find the nearest points
MEDIUM…/30.data-transformation/.indexes-in-pathway/article.py370# To intercept messages in a stream, you can use the [`pw.io.subscribe`](/developers/api-docs/pathway-io#pathway.io.subs
MEDIUM…e/30.data-transformation/.async-transformer/article.py14# However, if the flexibility of the UDFs is still not enough, you can use even more general and flexible `AsyncTransfor
MEDIUM…e/30.data-transformation/.async-transformer/article.py203# If you want to guarantee an order within a group of records, you can use the `instance` parameter of the `AsyncTransfo
MEDIUM…data-transformation/.user-defined-functions/article.py35# Now you can use it as an ordinary Pathway expression, as in the example shown below.
MEDIUM…data-transformation/.user-defined-functions/article.py77# For example, you can't add a column with type `Any` to a column of type `int` (you also can't add `Any` to `Any`), but
MEDIUM…data-transformation/.user-defined-functions/article.py107# Also note that the `inc` function is only present in this guide for demonstration purposes.
MEDIUM…data-transformation/.user-defined-functions/article.py411# Note that accidentally you created a sleepsort. Values in the `finishing` messages are sorted! As an exercise, you can
MEDIUM…r-guide/30.data-transformation/.join-manual/article.py125# that here, you can use columns of both tables as arguments, e.g.
MEDIUM…r-guide/30.data-transformation/.join-manual/article.py479# To handle joins using those columns, you can use [`pointer_from`](/developers/api-docs/pathway-table#pathway.Table.poi
MEDIUM…r-guide/30.data-transformation/.join-manual/article.py499# `join_left` in `people_allergies`, you can use the following:
MEDIUM…2.developers/7.templates/ETL/.option-greeks/article.py28# Pathway comes with an easy-to-use Python API, syntax that is simple and intuitive, and you can use the same code for b
MEDIUM…2.developers/7.templates/ETL/.option-greeks/article.py373# To compute the average prices for all bids, you can use a simple [groupby/reduce](https://pathway.com/developers/user-
MEDIUM…2.developers/7.templates/ETL/.option-greeks/article.py686# Furthermore, you can use _[Databento live APIs](https://databento.com/docs/api-reference-live?historical=python&live=p
LOW…2.developers/7.templates/ETL/.option-greeks/article.py127# To continue, make sure to install all the needed packages.
MEDIUM…lopers/7.templates/ETL/.merging-time-series/article.py261# The value 1224744825 is hand-picked to fit this dataset; you can replace it by any desired value.
MEDIUM…lopers/7.templates/ETL/.merging-time-series/article.py363# To compute the distance, you can use the `pyproj` package, which you can install by uncommenting and executing the fol
MEDIUM…lopers/7.templates/ETL/.merging-time-series/article.py386# Note this is the simple 2-dimensional distance, but you can use your favorite distance metric.
MEDIUM…lopers/7.templates/ETL/.merging-time-series/article.py398# Then, you can use `ix` method to fetch values from corresponding `prev` and `next` rows, and use them to compute the i
MEDIUM…lopers/7.templates/ETL/.fuzzy_join_chapter1/article.py204# If you would like to get some more experience with Pathway, you can try those two challenges:
MEDIUM…velopers/7.templates/ETL/.live_data_jupyter/article.py43# Because Pathway [unites static and stream processing](/developers/user-guide/connect/switch-from-batch-to-streaming#sw
MEDIUM…velopers/7.templates/ETL/.live_data_jupyter/article.py154# The code is analogous to the 20-minute statistics but simpler: you can use a [`tumbling window`](/glossary/tumbling-wi
MEDIUM…emplates/ETL/.interval_over_gaussian_filter/article.py211# To do so, you can use `windowby`, that takes three arguments:
MEDIUM….developers/7.templates/rag/.multimodal-rag/article.py210# default file you can use to test
MEDIUM….developers/7.templates/rag/.multimodal-rag/article.py299# For implementing this example using open source LLMs, here’s a [private RAG app template](https://pathway.com/develope
MEDIUM…7.templates/rag/.private_rag_ollama_mistral/article.py53# To tackle this, you can use a **private RAG: locally deployed LLMs and embedders in your RAG pipeline**.
Docstring Block Structure27 hits · 135 pts
SeverityFileLineSnippet
HIGHpython/pathway/io/_subscribe.py26 Calls a callback function ``on_change`` on every change happening in table. Args: table: the table to
HIGHpython/pathway/io/http/_server.py101 Adds an example to the collection. Args: id: Short and unique ID for the example. It is us
HIGHpython/pathway/io/kafka/__init__.py556Write a table to a given topic on a Kafka instance. The produced messages consist of the key, corresponding to row'
HIGHpython/pathway/io/pubsub/__init__.py62Publish the ``table``'s stream of changes into the specified PubSub topic. Please note that ``table`` must consist o
HIGHpython/pathway/internals/table_subscription.py189 Calls a callback function on_change on every change happening in table. This method is similar to the one we ex
HIGHpython/pathway/internals/expressions/date_time.py385Returns a number of (nano,micro,milli)seconds from 1970-01-01 for naive DateTime and from 1970-01-01 UTC for tim
HIGHpython/pathway/internals/expressions/date_time.py663Converts DateTimeNaive to UTC from time zone provided as `from_timezone` argument. If the given DateTime doesn't
HIGHpython/pathway/internals/expressions/date_time.py753Converts DateTimeUtc to time zone specified as `timezone` argument. Args: timezone: The time zone t
HIGHpython/pathway/internals/expressions/date_time.py978Rounds DateTime to precision specified by `duration` argument. Args: duration: rounding precision
HIGHpython/pathway/internals/expressions/date_time.py1050Truncates DateTime to precision specified by `duration` argument. Args: duration: truncation precis
HIGHpython/pathway/stdlib/temporal/_interval_join.py65Allows testing whether two times are within a certain distance. Note: Usually used as an argument of `.inte
HIGHpython/pathway/stdlib/temporal/_interval_join.py589Performs an interval join of self with other using a time difference and join expressions. If `self_time + lower_bou
HIGHpython/pathway/stdlib/temporal/_interval_join.py793Performs an interval join of self with other using a time difference and join expressions. If `self_time + lower_bou
HIGHpython/pathway/stdlib/temporal/_interval_join.py992Performs an interval left join of self with other using a time difference and join expressions. If `self_time + lowe
HIGHpython/pathway/stdlib/temporal/_interval_join.py1209Performs an interval right join of self with other using a time difference and join expressions. If `self_time + low
HIGHpython/pathway/stdlib/temporal/_interval_join.py1415Performs an interval outer join of self with other using a time difference and join expressions. If `self_time + low
HIGHpython/pathway/stdlib/temporal/_window.py598Allows grouping together elements within a window across ordered time-like data column by locally grouping adjacent
HIGHpython/pathway/stdlib/temporal/_window.py663Allows grouping together elements within a window of a given length sliding across ordered time-like data column acc
HIGHpython/pathway/stdlib/temporal/_window.py737Allows grouping together elements within a window of a given length tumbling across ordered time-like data column st
HIGHpython/pathway/stdlib/temporal/_window.py798Allows grouping together elements within a window. Windows are created for each time t in at, by taking values with
HIGHpython/pathway/stdlib/temporal/_window_join.py167Performs a window join of self with other using a window and join expressions. If two records belong to the same win
HIGHpython/pathway/stdlib/temporal/_window_join.py370Performs a window join of self with other using a window and join expressions. If two records belong to the same win
HIGHpython/pathway/stdlib/temporal/_window_join.py571Performs a window left join of self with other using a window and join expressions. If two records belong to the sam
HIGHpython/pathway/stdlib/temporal/_window_join.py791Performs a window right join of self with other using a window and join expressions. If two records belong to the sa
HIGHpython/pathway/stdlib/temporal/_window_join.py1006Performs a window outer join of self with other using a window and join expressions. If two records belong to the sa
HIGHpython/pathway/stdlib/statistical/_interpolate.py62 Interpolates missing values in a column using the previous and next values based on a timestamps column. Args:
HIGHpython/pathway/stdlib/ordered/diff.py16 Compute the difference between the values in the ``values`` columns and the previous values according to the or
Excessive Try-Catch Wrapping112 hits · 134 pts
SeverityFileLineSnippet
LOWpython/pathway/io/deltalake/__init__.py74 except Exception:
LOWpython/pathway/io/deltalake/__init__.py221 except Exception:
LOWpython/pathway/io/deltalake/__init__.py293 except Exception:
LOWpython/pathway/io/http/_server.py483 except Exception:
LOWpython/pathway/io/http/_server.py586 except Exception:
LOWpython/pathway/io/http/_server.py696 except Exception as e:
LOWpython/pathway/io/pubsub/__init__.py48 except Exception:
LOWpython/pathway/io/airbyte/logic.py184 except Exception:
MEDIUMpython/pathway/tests/utils.py412def __call__(self):
MEDIUMpython/pathway/tests/utils.py481def checker():
MEDIUMpython/pathway/tests/utils.py508def __call__(self):
MEDIUMpython/pathway/tests/utils.py532def __call__(self):
LOWpython/pathway/tests/utils.py113 except Exception as e:
LOWpython/pathway/tests/utils.py415 except Exception:
LOWpython/pathway/tests/utils.py489 except Exception:
LOWpython/pathway/tests/utils.py519 except Exception as exception:
LOWpython/pathway/tests/utils.py535 except Exception as exception:
LOW…/expressions/test_non_existing_date_time_conversion.py303 except Exception: # FIXME: make conversions work in all cases
LOWpython/pathway/xpacks/connectors/sharepoint/__init__.py201 except Exception as e:
LOWpython/pathway/xpacks/llm/parsers.py978 except Exception:
LOWpython/pathway/xpacks/llm/parsers.py1269 except Exception as e:
LOWpython/pathway/xpacks/llm/parsers.py1299 except Exception as e:
LOWpython/pathway/xpacks/llm/embedders.py59 except Exception:
LOWpython/pathway/xpacks/llm/embedders.py65 except Exception:
MEDIUMpython/pathway/xpacks/llm/embedders.py27def _safe_aclose(self):
LOWpython/pathway/xpacks/llm/prompts.py319 except Exception:
LOWpython/pathway/xpacks/llm/tests/test_parsers.py23 except Exception:
LOWpython/pathway/internals/trace.py133 except Exception as e:
LOWpython/pathway/internals/trace.py143 except Exception as e:
MEDIUMpython/pathway/internals/trace.py130def _pathway_trace_marker(*args, **kwargs):
LOWpython/pathway/internals/udfs/retries.py94 except Exception:
LOWpython/pathway/stdlib/utils/async_transformer.py166 except Exception:
LOWintegration_tests/wordcount/base.py381 except Exception:
LOWintegration_tests/wordcount/base.py390 except Exception:
LOWintegration_tests/rag_evals/experiment.py34 except Exception:
LOWintegration_tests/rag_evals/evaluator.py303 except Exception:
MEDIUMintegration_tests/iceberg/test_iceberg.py108def __call__(self) -> bool:
MEDIUMintegration_tests/iceberg/test_iceberg.py115def provide_information_on_failure(self) -> str:
LOWintegration_tests/iceberg/test_iceberg.py112 except Exception:
LOWintegration_tests/iceberg/test_iceberg.py125 except Exception as e:
LOWintegration_tests/webserver/test_rest_connector.py332 except Exception:
LOWintegration_tests/webserver/test_rest_connector.py559 except Exception:
MEDIUMintegration_tests/webserver/test_rest_connector.py317def checker() -> bool:
MEDIUMintegration_tests/webserver/test_rest_connector.py552def checker() -> bool:
LOWintegration_tests/webserver/test_llm_xpack.py75 except Exception:
LOWintegration_tests/webserver/test_llm_xpack.py289 except Exception:
LOWintegration_tests/webserver/test_llm_xpack.py343 except Exception:
LOWintegration_tests/webserver/test_llm_xpack.py417 except Exception:
LOWintegration_tests/webserver/test_llm_xpack.py461 except Exception:
LOWintegration_tests/webserver/test_llm_xpack.py513 except Exception:
LOWintegration_tests/webserver/test_llm_xpack.py551 except Exception:
MEDIUMintegration_tests/webserver/test_llm_xpack.py283def checker() -> bool:
MEDIUMintegration_tests/webserver/test_llm_xpack.py333def checker() -> bool:
MEDIUMintegration_tests/webserver/test_llm_xpack.py395def checker() -> bool:
LOWintegration_tests/kafka/test_nats.py72 except Exception:
LOWintegration_tests/kafka/test_nats.py82 except Exception:
MEDIUMintegration_tests/kafka/test_nats.py69def _ensure_stream(self):
MEDIUMintegration_tests/kafka/test_nats.py79def _cleanup(self):
LOWintegration_tests/kafka/test_backfilling.py36 except Exception:
MEDIUMintegration_tests/kafka/test_backfilling.py33def __call__(self):
52 more matches not shown…
Cross-Language Confusion31 hits · 130 pts
SeverityFileLineSnippet
HIGHpython/pathway/io/__init__.py28 null,
HIGHpython/pathway/tests/test_errors.py1282{"a": 7, "b": 1, "c": null}
HIGHpython/pathway/tests/test_io.py582 {"k": "b", "b": 2, "c": null }
HIGHpython/pathway/tests/test_io.py259 assert result.equals(expected)
HIGHpython/pathway/tests/test_io.py437 assert result.equals(expected)
HIGHpython/pathway/tests/test_io.py2952 "_airbyte_data": '{"type": "STREAM", "stream": {"stream_descriptor": {"name": "commits", "namespace": nu
HIGHpython/pathway/tests/test_io.py2970 "_airbyte_data": '{"type": "STREAM", "stream": {"stream_descriptor": {"name": "not_commits", "namespace"
HIGHpython/pathway/tests/test_io.py2992 "_airbyte_data": '{"type": "STREAM", "stream": {"stream_descriptor": {"name": "commits", "namespace": nu
HIGHpython/pathway/tests/test_io.py3400 assert result.equals(expected)
HIGHpython/pathway/tests/test_io.py3412 assert result.equals(expected)
HIGHpython/pathway/tests/test_deltalake.py139 assert final.equals(original)
HIGHpython/pathway/tests/test_deltalake.py659 '{"append_only": false, "description": null, "dtype": {"type": "INT"}, '
HIGHpython/pathway/tests/utils.py491 return expected.equals(result)
HIGHpython/pathway/tests/test_json.py674 ('"{""a"": 1,""b"": ""foo"", ""c"": null, ""d"": [1,2,3]}"', dict),
HIGHpython/pathway/tests/test_json.py691 ('{"field": 1, "b": "foo", "c": null, "d": [1,2,3]}', dict),
HIGHpython/pathway/internals/expression.py333 - returns `json(null)` for Json
HIGHpython/pathway/stdlib/utils/pandas_transformer.py70 if output_universe_arg_index is not None and not result.index.equals(
HIGHintegration_tests/s3/test_s3_deltalake.py180 assert final.equals(original)
HIGHintegration_tests/s3/test_s3_deltalake.py222 assert final.equals(original)
HIGHintegration_tests/s3/test_s3_generic.py102 assert result.equals(expected)
HIGHintegration_tests/s3/test_s3_generic.py218 assert result.equals(expected)
HIGHintegration_tests/s3/test_s3_generic.py275 assert result.equals(expected)
HIGHintegration_tests/s3/test_s3_generic.py308 assert result.equals(expected)
HIGHintegration_tests/iceberg/test_iceberg.py1374 # `important` is required (non-null) — Pathway must supply it.
HIGHintegration_tests/iceberg/test_iceberg.py1388 ), f"error should mention that the column is required / non-null, got: {msg}"
HIGHintegration_tests/iceberg/test_iceberg.py1361 """Existing iceberg table has a required (non-null) column that the user's
HIGHintegration_tests/kafka/test_rabbitmq.py140 # AMQP 1.0 property fields exist (may be null)
HIGHintegration_tests/db_connectors/test_mssql_parsing.py471 "quote'semicolon;null,./;'l][p!@#!@#!@#%$#$#]",
HIGHintegration_tests/db_connectors/test_mongodb_parsing.py241 "quote'semicolon;null,./;'l][p!@#!@#!@#%$#$#]",
HIGH…tegration_tests/db_connectors/test_postgres_parsing.py260 "quote'semicolon;null,./;'l][p!@#!@#!@#%$#$#]",
HIGHintegration_tests/db_connectors/test_postgres.py1080 null,
AI Slop Vocabulary36 hits · 114 pts
SeverityFileLineSnippet
MEDIUM…/4.user-guide/40.temporal-data/.window-join/article.py122# The next step is to calculate the conversion rate, i.e., the proportion of product page visits that resulted in a purc
MEDIUM…/4.user-guide/40.temporal-data/.window-join/article.py170# And that concludes the demonstration of the comprehensive, data-driven approach to analyzing user behavior and convers
MEDIUM…s/4.user-guide/40.temporal-data/.asof-join/.article.py36# 1. **Temporal Alignment**: Realtime data is time-stamped and is often required to be analyzed in conjunction with oth
MEDIUM…s/4.user-guide/40.temporal-data/.asof-join/.article.py51# In the scope of this tutorial, you will be leveraging data from a public API in a static mode purely for illustrative
MEDIUM…s/4.user-guide/40.temporal-data/.asof-join/.article.py125# During the development and testing stages of your data processing pipeline, it's essential to validate that the data i
MEDIUM…de/40.temporal-data/.windowby-reduce-manual/article.py10# Pathway offers powerful features for time series data manipulation. One such feature is the `windowby` function, which
MEDIUM…de/40.temporal-data/.windowby-reduce-manual/article.py135# In this guide, you've mastered the use of the windowby-reduce scheme in the Pathway library, a robust tool for time-se
MEDIUM…de/40.temporal-data/.windowby-reduce-manual/article.py135# In this guide, you've mastered the use of the windowby-reduce scheme in the Pathway library, a robust tool for time-se
MEDIUM…de/40.temporal-data/.windowby-reduce-manual/article.py135# In this guide, you've mastered the use of the windowby-reduce scheme in the Pathway library, a robust tool for time-se
MEDIUM….user-guide/40.temporal-data/.interval-join/article.py12# This article offers a comprehensive tutorial on how to use the interval join function in Pathway to accurately compute
MEDIUM…velopers/4.user-guide/20.connect/.datatypes/article.py398# Below are a few examples demonstrating the `diff` operator. Essentially, it sorts the table with respect to one column
MEDIUM…velopers/4.user-guide/20.connect/.json_type/article.py144# You can utilize the [`flatten()`](/developers/api-docs/pathway-table#pathway.internals.table.Table.flatten) operator s
MEDIUM…uide/80.advanced/.declarative_vs_imperative/article.py14# Many real-world data processing tasks — such as those in logistics, supply chain management, or event stream analysis—
MEDIUM…uide/80.advanced/.declarative_vs_imperative/article.py16# In data processing, imperative pipelines require you to specify step-by-step instructions for how data should be trans
MEDIUM…opers/4.user-guide/80.advanced/.consistency/article.py117# Given that money was transferred from one account to another without any external funds entering the system, the total
MEDIUM…r-guide/30.data-transformation/.join-manual/article.py508# Essentially, for a join `left.join(right, *on)`, `pw.left` allows us to address the `left` table
MEDIUM…lopers/7.templates/ETL/.merging-time-series/article.py97# Combining time series is the process of combining different time series into a single time series based on a common ti
MEDIUM…lopers/7.templates/ETL/.merging-time-series/article.py106# - Environmental monitoring 🌡️: Combining time series data from air quality sensors, weather stations, and pollutant l
MEDIUM…lopers/7.templates/ETL/.merging-time-series/article.py108# - Analyzing stock market data 📈: Combining time series of stock prices, trading volumes, and financial indicators can
MEDIUM…lopers/7.templates/ETL/.merging-time-series/article.py460# By combining these time series, you could create a more comprehensive picture of the conditions affecting your area's
MEDIUM…s/7.templates/ETL/.interval_over_upsampling/article.py324# Standard digital signal processing systems assume robust sampling and would deal with out-of-order and irregularly sam
MEDIUM…emplates/ETL/.interval_over_gaussian_filter/article.py120# I will not delve into details: if you are interested you can check out the [Wikipedia article](https://en.wikipedia.or
MEDIUM…emplates/ETL/.interval_over_gaussian_filter/article.py496# Now that you understand the differences between idealized signal processing and data stream processing, I'm sure you c
MEDIUM…/2.developers/7.templates/rag/.adaptive-rag/article.py78# The LLM can respond using a paraphrase of the ground-truth answer. To properly score paraphrases we canonicalize answe
MEDIUM…/2.developers/7.templates/rag/.adaptive-rag/article.py78# The LLM can respond using a paraphrase of the ground-truth answer. To properly score paraphrases we canonicalize answe
MEDIUM….developers/7.templates/rag/.multimodal-rag/article.py62# Currently, most [RAG applications](/blog/retrieval-augmented-generation-beginners-guide-rag-apps) are mostly limited t
MEDIUM….developers/7.templates/rag/.multimodal-rag/article.py80# Multimodal search and Multimodal RAG (MM-RAG) both utilize deep learning models to create an embedding space—a map of
MEDIUM….developers/7.templates/rag/.multimodal-rag/article.py111# Building a multimodal RAG system for production requires a robust and scalable architecture that can handle diverse da
MEDIUM….developers/7.templates/rag/.multimodal-rag/article.py128# - **Pathway**: Provides real-time synchronization, secure document handling, and a robust in-memory vector store for i
MEDIUM….developers/7.templates/rag/.multimodal-rag/article.py142# 2. **Answering Questions**: Questions are sent to the LLM with the relevant context, including parsed tables. This all
MEDIUM….developers/7.templates/rag/.multimodal-rag/article.py182# - **[llm.parsers.DoclingParser](/developers/api-docs/pathway-xpacks-llm/parsers)**: The `DoclingParser` class efficien
LOWexamples/projects/option-greeks/greeks-replay.py212# We now simply add the volatility column, using the `select` operation on a Table.
MEDIUMexternal/timely-dataflow/timely/src/worker.rs28/// The `ProgressMode::Demand` variant is the most robust, and least
MEDIUM…l/timely-dataflow/timely/src/dataflow/operators/map.rs71 // TODO : This would be more robust if it captured an iterator and then pulled an appropriate
MEDIUMsrc/connectors/synchronization.rs9/// implementation but holding a unique source ID and metadata. Essentially, an accessor
MEDIUMsrc/connectors/aws/kinesis.rs178 // an equal chance to utilize this extra time.
Deep Nesting97 hits · 88 pts
SeverityFileLineSnippet
LOWpython/pathway/cli.py145
LOWpython/pathway/cli.py206
LOWpython/pathway/io/_utils.py237
LOWpython/pathway/io/_utils.py389
LOWpython/pathway/io/deltalake/__init__.py179
LOWpython/pathway/io/deltalake/__init__.py189
LOWpython/pathway/io/gdrive/__init__.py472
LOWpython/pathway/io/postgres/__init__.py605
LOWpython/pathway/io/http/_server.py669
LOWpython/pathway/io/pyfilesystem/__init__.py39
LOWpython/pathway/io/airbyte/logic.py58
LOWpython/pathway/io/airbyte/logic.py177
LOWpython/pathway/tests/test_persistence.py212
LOWpython/pathway/tests/test_io.py80
LOWpython/pathway/tests/test_io.py3507
LOWpython/pathway/tests/test_io.py3987
LOWpython/pathway/tests/test_io.py83
LOWpython/pathway/tests/test_io.py4000
LOWpython/pathway/tests/test_io.py3528
LOWpython/pathway/tests/utils.py688
LOWpython/pathway/tests/test_unpack_snapshots.py58
LOWpython/pathway/tests/test_persistence_iterate.py601
LOWpython/pathway/tests/temporal/test_windows_stream.py85
LOWpython/pathway/tests/temporal/test_window_joins.py25
LOWpython/pathway/tests/temporal/test_window_joins.py811
LOWpython/pathway/xpacks/llm/parsers.py181
LOWpython/pathway/xpacks/llm/parsers.py523
LOWpython/pathway/xpacks/llm/parsers.py608
LOWpython/pathway/xpacks/llm/llms.py66
LOWpython/pathway/xpacks/llm/llms.py824
LOWpython/pathway/xpacks/llm/llms.py867
LOWpython/pathway/xpacks/llm/_parser_utils.py312
LOWpython/pathway/internals/type_interpreter.py158
LOWpython/pathway/internals/type_interpreter.py502
LOWpython/pathway/internals/dtype.py643
LOWpython/pathway/internals/dtype.py841
LOWpython/pathway/internals/dtype.py881
LOWpython/pathway/internals/dtype.py90
LOWpython/pathway/internals/monitoring.py211
LOWpython/pathway/internals/joins.py621
LOWpython/pathway/internals/joins.py669
LOWpython/pathway/internals/api.py132
LOWpython/pathway/internals/api.py197
LOWpython/pathway/internals/arg_handlers.py72
LOWpython/pathway/internals/arg_handlers.py73
LOWpython/pathway/internals/custom_reducers.py282
LOWpython/pathway/internals/custom_reducers.py324
LOWpython/pathway/internals/custom_reducers.py326
LOWpython/pathway/internals/table.py1192
LOWpython/pathway/internals/expression_printer.py37
LOWpython/pathway/internals/fingerprints.py20
LOWpython/pathway/internals/column.py1060
LOWpython/pathway/internals/schema.py55
LOWpython/pathway/internals/graph_runner/storage_graph.py137
LOWpython/pathway/internals/graph_runner/__init__.py134
LOWpython/pathway/internals/graph_runner/__init__.py199
LOWpython/pathway/internals/graph_runner/telemetry.py163
LOW…hon/pathway/internals/graph_runner/operator_handler.py104
LOW…rnals/graph_runner/row_transformer_operator_handler.py67
LOW…rnals/graph_runner/row_transformer_operator_handler.py90
37 more matches not shown…
Hallucination Indicators6 hits · 60 pts
SeverityFileLineSnippet
CRITICAL…pers/4.user-guide/60.deployment/25.azure-aci-deploy.md293from azure.core.exceptions import HttpResponseError
CRITICAL…evelopers/7.templates/60.deploy/25.azure-aci-deploy.md293from azure.core.exceptions import HttpResponseError
CRITICALexamples/projects/azure-aci-deploy/launch.py6from azure.core.exceptions import HttpResponseError
CRITICAL…differential-dataflow/src/trace/implementations/ord.rs315 let starting_updates = self.result.vals.vals.vals.len();
CRITICAL…differential-dataflow/src/trace/implementations/ord.rs323 effort = (self.result.vals.vals.vals.len() - starting_updates) as isize;
CRITICAL…differential-dataflow/src/trace/implementations/ord.rs348 effort = (self.result.vals.vals.vals.len() - starting_updates) as isize;
Self-Referential Comments10 hits · 30 pts
SeverityFileLineSnippet
MEDIUMpython/pathway/tests/utils.py208# This class is an abstract subclass of OnChangeCallback, which takes a list of entries
MEDIUM…tegration_tests/monitoring/test_detailed_monitoring.py51 # Create an empty metrics db file so dashboard can start
MEDIUMintegration_tests/db_connectors/test_mssql.py1278 # Create a table inside the new DB via its own connection.
MEDIUM…s/4.user-guide/40.temporal-data/.asof-join/.article.py74# Define the list of ticker symbols for the 5 major US companies
MEDIUM….user-guide/40.temporal-data/.interval-join/article.py52# Create a maintenance data table
MEDIUM…lopers/7.templates/ETL/.merging-time-series/article.py120# ## Creating the dataset
MEDIUM…velopers/7.templates/ETL/.live_data_jupyter/article.py84# Create a streaming data source that replays data in a CSV file. This is an easy way to simulate a live data stream wit
MEDIUM…7.templates/rag/.private_rag_ollama_mistral/article.py147# Create a table with example questions
MEDIUMexamples/projects/conf42/main.py39# Define the model
MEDIUMexamples/templates/el-pipeline/app.yaml28# Define the Kafka data source using the schema and the configurations.
Verbosity Indicators17 hits · 28 pts
SeverityFileLineSnippet
LOW…al/timely-dataflow/timely/src/progress/reachability.rs570 // Step 0: If logging is enabled, construct and log inbound changes.
LOW…al/timely-dataflow/timely/src/progress/reachability.rs594 // Step 1: Drain `self.input_changes` and determine actual frontier changes.
LOW…al/timely-dataflow/timely/src/progress/reachability.rs638 // Step 2: Circulate implications of changes to `self.pointstamps`.
LOWexternal/differential-dataflow/src/capture.rs349 // Step 1: The MESSAGES operator.
LOWexternal/differential-dataflow/src/capture.rs427 // Step 2: The UPDATES operator.
LOWexternal/differential-dataflow/src/capture.rs473 // Step 3: The PROGRESS operator.
LOWexternal/differential-dataflow/src/capture.rs556 // Step 4: The FEEDBACK operator.
LOWsrc/connectors/mssql.rs1131 /// Step 1: Record the current CDC max LSN (retry until the capture agent
LOWsrc/connectors/mssql.rs1157 /// Step 2: Read the full table snapshot via a plain SELECT.
LOWsrc/connectors/mssql.rs1171 /// Step 3: Record the CDC max LSN again, immediately after the snapshot SELECT.
LOWsrc/connectors/mssql.rs1280 /// Step 4: Fetch CDC changes that raced with the snapshot read.
LOWsrc/connectors/mssql.rs1433 // Step 1: record LSN before snapshot read.
LOWsrc/connectors/mssql.rs1442 // Step 2: read the full table snapshot.
LOWsrc/connectors/mssql.rs1445 // Step 3: record LSN after snapshot read.
LOWsrc/connectors/mssql.rs1452 // Step 4: fetch CDC changes that raced with the snapshot read.
LOWsrc/connectors/mssql.rs1469 // Step 5: pick the offset that gets stamped on the snapshot
LOWsrc/connectors/mssql.rs1529 // Step 6: advance last_lsn so poll_cdc_changes starts strictly after
Redundant / Tautological Comments5 hits · 8 pts
SeverityFileLineSnippet
LOWpython/pathway/internals/schema.py611 # Check if self has all columns of other
LOWpython/pathway/internals/schema.py616 # Check if types of columns are the same
LOW…de/40.temporal-data/.windowby-reduce-manual/article.py82# Print the result
LOW…de/40.temporal-data/.windowby-reduce-manual/article.py104# Print the result
LOW…de/40.temporal-data/.windowby-reduce-manual/article.py129# Print the result
Magic Placeholder Names1 hit · 5 pts
SeverityFileLineSnippet
HIGH…e/20.connect/99.connectors/90.websockets-connectors.md128API_KEY = "your-api-key"
Fake / Example Data4 hits · 3 pts
SeverityFileLineSnippet
LOWpython/pathway/tests/ml/test_index.py538 Lorem ipsum dolor sit amet, consectetur adipiscing elit. | 1 | 2
LOWpython/pathway/tests/ml/test_index.py538 Lorem ipsum dolor sit amet, consectetur adipiscing elit. | 1 | 2
LOWpython/pathway/xpacks/llm/tests/test_parsers.py84 "Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod"
LOWpython/pathway/xpacks/llm/tests/test_parsers.py84 "Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod"