Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
259 matches across 11 categories. Click a row to expand file-level details.
| Severity | File | Line | Snippet |
|---|---|---|---|
| HIGH | cosyvoice/transformer/subsampling.py | 0 | input x. args: x (torch.tensor): input tensor (#batch, time, idim). x_mask (torch.tensor): input mask (#batch, 1, time). |
| HIGH | cosyvoice/transformer/subsampling.py | 0 | input x. args: x (torch.tensor): input tensor (#batch, time, idim). x_mask (torch.tensor): input mask (#batch, 1, time). |
| HIGH | cosyvoice/transformer/subsampling.py | 0 | input x. args: x (torch.tensor): input tensor (#batch, time, idim). x_mask (torch.tensor): input mask (#batch, 1, time). |
| HIGH | runtime/triton_trtllm/token2wav_dit.py | 0 | example usage cuda_visible_devices=0 \ python3 token2wav.py --enable-trt || exit 1 |
| HIGH | runtime/triton_trtllm/token2wav.py | 0 | example usage cuda_visible_devices=0 \ python3 token2wav.py --enable-trt || exit 1 |
| HIGH | …ton_trtllm/model_repo/token2wav_dit/1/token2wav_dit.py | 0 | example usage cuda_visible_devices=0 \ python3 token2wav.py --enable-trt || exit 1 |
| HIGH | runtime/triton_trtllm/offline_inference.py | 0 | extract speech ids from token strings like <|s_23456|> |
| HIGH | runtime/triton_trtllm/infer_cosyvoice3.py | 0 | extract speech ids from token strings like <|s_23456|> |
| HIGH | examples/grpo/cosyvoice2/infer_dataset.py | 0 | extract speech ids from token strings like <|s_23456|> |
| HIGH | runtime/triton_trtllm/model_repo/cosyvoice2/1/model.py | 0 | initialize the model. args: args: dictionary containing model configuration |
| HIGH | …ime/triton_trtllm/model_repo/cosyvoice2_dit/1/model.py | 0 | initialize the model. args: args: dictionary containing model configuration |
| HIGH | runtime/triton_trtllm/model_repo/token2wav/1/model.py | 0 | initialize the model. args: args: dictionary containing model configuration |
| HIGH | …me/triton_trtllm/model_repo/audio_tokenizer/1/model.py | 0 | initialize the model. args: args: dictionary containing model configuration |
| HIGH | …time/triton_trtllm/model_repo/token2wav_dit/1/model.py | 0 | initialize the model. args: args: dictionary containing model configuration |
| HIGH | …/triton_trtllm/model_repo/speaker_embedding/1/model.py | 0 | initialize the model. args: args: dictionary containing model configuration |
| HIGH | …rtllm/model_repo_cosyvoice3/audio_tokenizer/1/model.py | 0 | initialize the model. args: args: dictionary containing model configuration |
| HIGH | …llm/model_repo_cosyvoice3/speaker_embedding/1/model.py | 0 | initialize the model. args: args: dictionary containing model configuration |
| HIGH | …me/triton_trtllm/model_repo/audio_tokenizer/1/model.py | 0 | triton python model for audio tokenization. this model takes reference audio input and extracts semantic tokens using s3 |
| HIGH | …/triton_trtllm/model_repo/speaker_embedding/1/model.py | 0 | triton python model for audio tokenization. this model takes reference audio input and extracts semantic tokens using s3 |
| HIGH | …rtllm/model_repo_cosyvoice3/audio_tokenizer/1/model.py | 0 | triton python model for audio tokenization. this model takes reference audio input and extracts semantic tokens using s3 |
| HIGH | …llm/model_repo_cosyvoice3/speaker_embedding/1/model.py | 0 | triton python model for audio tokenization. this model takes reference audio input and extracts semantic tokens using s3 |
| Severity | File | Line | Snippet |
|---|---|---|---|
| LOW | webui.py | 1 | # Copyright (c) 2024 Alibaba Inc (authors: Xiang Lyu, Liu Yue) |
| LOW | cosyvoice/llm/llm.py | 1 | # Copyright (c) 2024 Alibaba Inc (authors: Xiang Lyu, Zhihao Du) |
| LOW | cosyvoice/bin/export_jit.py | 1 | # Copyright (c) 2024 Alibaba Inc (authors: Xiang Lyu) |
| LOW | cosyvoice/bin/average_model.py | 1 | # Copyright (c) 2020 Mobvoi Inc (Di Wu) |
| LOW | cosyvoice/bin/train.py | 1 | # Copyright (c) 2024 Alibaba Inc (authors: Xiang Lyu) |
| LOW | cosyvoice/bin/export_onnx.py | 1 | # Copyright (c) 2024 Antgroup Inc (authors: Zhoubofan, hexisyztem@icloud.com) |
| LOW | cosyvoice/dataset/processor.py | 1 | # Copyright (c) 2024 Alibaba Inc (authors: Xiang Lyu) |
| LOW | cosyvoice/dataset/dataset.py | 1 | # Copyright (c) 2021 Mobvoi Inc. (authors: Binbin Zhang) |
| LOW | cosyvoice/hifigan/generator.py | 1 | # Copyright (c) 2024 Alibaba Inc (authors: Xiang Lyu, Kai Hu) |
| LOW | cosyvoice/hifigan/f0_predictor.py | 1 | # Copyright (c) 2024 Alibaba Inc (authors: Xiang Lyu, Kai Hu) |
| LOW | cosyvoice/transformer/decoder.py | 1 | # Copyright (c) 2021 Mobvoi Inc. (authors: Binbin Zhang, Di Wu) |
| LOW | cosyvoice/transformer/attention.py | 1 | # Copyright (c) 2019 Shigeki Karita |
| LOW | cosyvoice/transformer/attention.py | 161 | Returns: |
| LOW | cosyvoice/transformer/attention.py | 281 | # cache(1, head, 0, d_k * 2) (16/-1, -1/-1, 16/0 mode) |
| LOW | cosyvoice/transformer/subsampling.py | 1 | # Copyright (c) 2021 Mobvoi Inc (Binbin Zhang, Di Wu) |
| LOW | cosyvoice/transformer/convolution.py | 1 | # Copyright (c) 2020 Mobvoi Inc. (authors: Binbin Zhang, Di Wu) |
| LOW | cosyvoice/transformer/upsample_encoder.py | 1 | # Copyright (c) 2021 Mobvoi Inc (Binbin Zhang, Di Wu) |
| LOW | cosyvoice/transformer/embedding.py | 1 | # Copyright (c) 2020 Mobvoi Inc. (authors: Binbin Zhang, Di Wu) |
| LOW | cosyvoice/transformer/encoder_layer.py | 1 | # Copyright (c) 2021 Mobvoi Inc (Binbin Zhang, Di Wu) |
| LOW | cosyvoice/transformer/encoder.py | 1 | # Copyright (c) 2021 Mobvoi Inc (Binbin Zhang, Di Wu) |
| LOW | cosyvoice/transformer/activation.py | 1 | # Copyright (c) 2020 Johns Hopkins University (Shinji Watanabe) |
| LOW | cosyvoice/transformer/positionwise_feed_forward.py | 1 | # Copyright (c) 2019 Shigeki Karita |
| LOW | cosyvoice/transformer/decoder_layer.py | 1 | # Copyright (c) 2019 Shigeki Karita |
| LOW | cosyvoice/transformer/label_smoothing_loss.py | 1 | # Copyright (c) 2019 Shigeki Karita |
| LOW | cosyvoice/vllm/cosyvoice2.py | 1 | # SPDX-License-Identifier: Apache-2.0 |
| LOW | cosyvoice/utils/class_utils.py | 1 | # Copyright [2023-11-28] <sxc19@mails.tsinghua.edu.cn, Xingchen Song> |
| LOW | cosyvoice/utils/mask.py | 1 | # Copyright (c) 2019 Shigeki Karita |
| LOW | cosyvoice/utils/train_utils.py | 1 | # Copyright (c) 2021 Mobvoi Inc. (authors: Binbin Zhang) |
| LOW | cosyvoice/utils/common.py | 1 | # Copyright (c) 2020 Mobvoi Inc (Binbin Zhang) |
| LOW | cosyvoice/utils/file_utils.py | 1 | # Copyright (c) 2021 Mobvoi Inc. (authors: Binbin Zhang) |
| LOW | cosyvoice/utils/scheduler.py | 1 | # Copyright (c) 2020 Mobvoi Inc (Binbin Zhang) |
| LOW | cosyvoice/utils/frontend_utils.py | 1 | # Copyright (c) 2024 Alibaba Inc (authors: Xiang Lyu, Zhihao Du) |
| LOW | cosyvoice/utils/executor.py | 1 | # Copyright (c) 2020 Mobvoi Inc (Binbin Zhang) |
| LOW | cosyvoice/cli/frontend.py | 1 | # Copyright (c) 2024 Alibaba Inc (authors: Xiang Lyu) |
| LOW | cosyvoice/cli/cosyvoice.py | 1 | # Copyright (c) 2024 Alibaba Inc (authors: Xiang Lyu) |
| LOW | cosyvoice/cli/model.py | 1 | # Copyright (c) 2024 Alibaba Inc (authors: Xiang Lyu) |
| LOW | cosyvoice/flow/decoder.py | 1 | # Copyright (c) 2024 Alibaba Inc (authors: Xiang Lyu, Zhihao Du) |
| LOW | cosyvoice/flow/flow.py | 1 | # Copyright (c) 2024 Alibaba Inc (authors: Xiang Lyu, Zhihao Du) |
| LOW | cosyvoice/flow/length_regulator.py | 1 | # Copyright (c) 2024 Alibaba Inc (authors: Xiang Lyu, Zhihao Du) |
| LOW | cosyvoice/flow/flow_matching.py | 1 | # Copyright (c) 2024 Alibaba Inc (authors: Xiang Lyu, Zhihao Du) |
| LOW | tools/make_parquet_list.py | 1 | #!/usr/bin/env python3 |
| LOW | tools/extract_speech_token.py | 1 | #!/usr/bin/env python3 |
| LOW | tools/extract_embedding.py | 1 | #!/usr/bin/env python3 |
| LOW | runtime/python/grpc/server.py | 1 | # Copyright (c) 2024 Alibaba Inc (authors: Xiang Lyu) |
| LOW | runtime/python/grpc/client.py | 1 | # Copyright (c) 2024 Alibaba Inc (authors: Xiang Lyu) |
| LOW | runtime/python/fastapi/server.py | 1 | # Copyright (c) 2024 Alibaba Inc (authors: Xiang Lyu) |
| LOW | runtime/python/fastapi/client.py | 1 | # Copyright (c) 2024 Alibaba Inc (authors: Xiang Lyu) |
| LOW | runtime/triton_trtllm/token2wav_dit.py | 1 | # SPDX-FileCopyrightText: Copyright (c) 2025, NVIDIA CORPORATION. All rights reserved. |
| LOW | runtime/triton_trtllm/run_stepaudio2_dit_token2wav.sh | 101 | if [ $stage -le 3 ] && [ $stop_stage -ge 3 ]; then |
| LOW | runtime/triton_trtllm/client_http.py | 1 | # Copyright 2025, NVIDIA CORPORATION & AFFILIATES. All rights reserved. |
| LOW | runtime/triton_trtllm/offline_inference.py | 1 | # SPDX-FileCopyrightText: Copyright (c) 2025, NVIDIA CORPORATION. All rights reserved. |
| LOW | runtime/triton_trtllm/client_grpc.py | 1 | # Copyright 2022 Xiaomi Corp. (authors: Fangjun Kuang) |
| LOW | runtime/triton_trtllm/token2wav.py | 1 | # SPDX-FileCopyrightText: Copyright (c) 2025, NVIDIA CORPORATION. All rights reserved. |
| LOW | runtime/triton_trtllm/model_repo/cosyvoice2/1/model.py | 1 | # Copyright 2025, NVIDIA CORPORATION & AFFILIATES. All rights reserved. |
| LOW | …ime/triton_trtllm/model_repo/cosyvoice2_dit/1/model.py | 1 | # Copyright 2025, NVIDIA CORPORATION & AFFILIATES. All rights reserved. |
| LOW | runtime/triton_trtllm/model_repo/token2wav/1/model.py | 1 | # Copyright 2025, NVIDIA CORPORATION & AFFILIATES. All rights reserved. |
| LOW | …me/triton_trtllm/model_repo/audio_tokenizer/1/model.py | 1 | # Copyright 2025, NVIDIA CORPORATION & AFFILIATES. All rights reserved. |
| LOW | …ton_trtllm/model_repo/token2wav_dit/1/token2wav_dit.py | 1 | # SPDX-FileCopyrightText: Copyright (c) 2025, NVIDIA CORPORATION. All rights reserved. |
| LOW | …time/triton_trtllm/model_repo/token2wav_dit/1/model.py | 1 | # Copyright 2025, NVIDIA CORPORATION & AFFILIATES. All rights reserved. |
| LOW | …/triton_trtllm/model_repo/speaker_embedding/1/model.py | 1 | # Copyright 2025, NVIDIA CORPORATION & AFFILIATES. All rights reserved. |
| 10 more matches not shown… | |||
| Severity | File | Line | Snippet |
|---|---|---|---|
| LOW | webui.py | 19 | |
| LOW | webui.py | 22 | |
| LOW | cosyvoice/llm/llm.py | 18 | |
| LOW | cosyvoice/llm/llm.py | 20 | |
| LOW | cosyvoice/bin/export_jit.py | 15 | |
| LOW | cosyvoice/bin/train.py | 15 | |
| LOW | cosyvoice/bin/export_onnx.py | 16 | |
| LOW | cosyvoice/vllm/cosyvoice2.py | 35 | |
| LOW | cosyvoice/flow/DiT/modules.py | 11 | |
| LOW | cosyvoice/flow/DiT/dit.py | 11 | |
| LOW | runtime/triton_trtllm/offline_inference.py | 34 | |
| LOW | runtime/triton_trtllm/offline_inference.py | 39 | |
| LOW | runtime/triton_trtllm/streaming_inference.py | 7 | |
| LOW | runtime/triton_trtllm/token2wav_cosyvoice3.py | 12 | |
| LOW | runtime/triton_trtllm/token2wav_cosyvoice3.py | 14 | |
| LOW | runtime/triton_trtllm/token2wav_cosyvoice3.py | 19 | |
| LOW | runtime/triton_trtllm/token2wav_cosyvoice3.py | 20 | |
| LOW | …ime/triton_trtllm/model_repo/cosyvoice2_dit/1/model.py | 28 | |
| LOW | …ime/triton_trtllm/model_repo/cosyvoice2_dit/1/model.py | 29 | |
| LOW | …ime/triton_trtllm/model_repo/cosyvoice2_dit/1/model.py | 32 | |
| LOW | …ime/triton_trtllm/model_repo/cosyvoice2_dit/1/model.py | 32 | |
| LOW | …ime/triton_trtllm/model_repo/cosyvoice2_dit/1/model.py | 32 | |
| LOW | …ime/triton_trtllm/model_repo/cosyvoice2_dit/1/model.py | 42 | |
| LOW | runtime/triton_trtllm/model_repo/token2wav/1/model.py | 40 | |
| LOW | …me/triton_trtllm/model_repo/audio_tokenizer/1/model.py | 33 | |
| LOW | …time/triton_trtllm/model_repo/token2wav_dit/1/model.py | 28 | |
| LOW | …time/triton_trtllm/model_repo/token2wav_dit/1/model.py | 31 | |
| LOW | …time/triton_trtllm/model_repo/token2wav_dit/1/model.py | 31 | |
| LOW | …time/triton_trtllm/model_repo/token2wav_dit/1/model.py | 35 | |
| LOW | …time/triton_trtllm/model_repo/token2wav_dit/1/model.py | 39 | |
| LOW | …time/triton_trtllm/model_repo/token2wav_dit/1/model.py | 40 | |
| LOW | …time/triton_trtllm/model_repo/token2wav_dit/1/model.py | 41 | |
| LOW | …time/triton_trtllm/model_repo/token2wav_dit/1/model.py | 41 | |
| LOW | …time/triton_trtllm/model_repo/token2wav_dit/1/model.py | 42 | |
| LOW | …time/triton_trtllm/model_repo/token2wav_dit/1/model.py | 43 | |
| LOW | …time/triton_trtllm/model_repo/token2wav_dit/1/model.py | 44 | |
| LOW | …/triton_trtllm/model_repo/speaker_embedding/1/model.py | 33 | |
| LOW | …iton_trtllm/model_repo_cosyvoice3/token2wav/1/model.py | 7 | |
| LOW | …rtllm/model_repo_cosyvoice3/audio_tokenizer/1/model.py | 33 | |
| LOW | …llm/model_repo_cosyvoice3/speaker_embedding/1/model.py | 33 | |
| LOW | runtime/triton_trtllm/scripts/test_llm.py | 19 | |
| LOW | …time/triton_trtllm/scripts/convert_cosyvoice3_to_hf.py | 39 | |
| LOW | …time/triton_trtllm/scripts/convert_cosyvoice3_to_hf.py | 39 | |
| LOW | …time/triton_trtllm/scripts/convert_cosyvoice3_to_hf.py | 75 | |
| LOW | examples/grpo/cosyvoice2/prepare_data.py | 20 | |
| LOW | examples/grpo/cosyvoice2/infer_dataset.py | 30 | |
| LOW | examples/grpo/cosyvoice2/infer_dataset.py | 33 | |
| LOW | examples/grpo/cosyvoice2/infer_dataset.py | 37 | |
| LOW | examples/grpo/cosyvoice2/infer_dataset.py | 43 | |
| LOW | examples/grpo/cosyvoice2/pretrained_to_huggingface.py | 27 | |
| LOW | examples/grpo/cosyvoice2/token2wav_asr_server.py | 20 | |
| LOW | examples/grpo/cosyvoice2/token2wav_asr_server.py | 25 | |
| LOW | examples/grpo/cosyvoice2/token2wav_asr_server.py | 27 | |
| LOW | examples/grpo/cosyvoice2/token2wav_asr_server.py | 33 | |
| LOW | examples/grpo/cosyvoice2/reward_tts.py | 19 | |
| LOW | examples/grpo/cosyvoice2/reward_tts.py | 119 | |
| LOW | …amples/grpo/cosyvoice2/scripts/offline-decode-files.py | 87 |
| Severity | File | Line | Snippet |
|---|---|---|---|
| LOW | cosyvoice/dataset/processor.py | 51 | except Exception as ex: |
| MEDIUM | runtime/triton_trtllm/client_grpc.py | 100 | print(f"An error occurred in the stream callback: {error}") |
| MEDIUM | runtime/triton_trtllm/client_grpc.py | 576 | print(f"Error: Audio file not found for item {i}: {item['audio_filepath']}") |
| LOW | runtime/triton_trtllm/client_grpc.py | 577 | except Exception as e: |
| MEDIUM | runtime/triton_trtllm/client_grpc.py | 578 | print(f"Error processing item {i} ({item['target_audio_path']}): {e}") |
| LOW | runtime/triton_trtllm/client_grpc.py | 588 | except Exception as e: |
| LOW | runtime/triton_trtllm/client_grpc.py | 742 | except Exception as e: |
| LOW | runtime/triton_trtllm/client_grpc.py | 903 | except Exception as e: |
| LOW | runtime/triton_trtllm/client_grpc.py | 910 | except Exception as e: |
| MEDIUM | runtime/triton_trtllm/client_grpc.py | 911 | print(f"Error closing async stats client: {e}") |
| LOW | runtime/triton_trtllm/client_grpc.py | 918 | except Exception as e: |
| MEDIUM | runtime/triton_trtllm/client_grpc.py | 919 | print(f"An error occurred in main: {e}") |
| MEDIUM | runtime/triton_trtllm/client_grpc.py | 915 | def run_main(): |
| LOW | …ton_trtllm/model_repo_cosyvoice3/cosyvoice3/1/model.py | 427 | except Exception as e: |
| LOW | …ton_trtllm/model_repo_cosyvoice3/cosyvoice3/1/model.py | 483 | except Exception as e: |
| LOW | runtime/triton_trtllm/scripts/convert_checkpoint.py | 293 | except Exception as e: |
| LOW | examples/grpo/cosyvoice2/reward_tts.py | 106 | except Exception as e: |
| LOW | examples/grpo/cosyvoice2/reward_tts.py | 212 | except Exception as e: |
| MEDIUM | examples/grpo/cosyvoice2/reward_tts.py | 213 | print(f"Error computing score: {e}") |
| MEDIUM | examples/grpo/cosyvoice2/reward_tts.py | 229 | print(f"Error: File not found - {args.input}") |
| LOW | examples/grpo/cosyvoice2/reward_tts.py | 232 | except Exception as e: |
| MEDIUM | examples/grpo/cosyvoice2/reward_tts.py | 233 | print(f"Error: {e}") |
| Severity | File | Line | Snippet |
|---|---|---|---|
| HIGH | cosyvoice/utils/mask.py | 18 | def subsequent_mask( size: int, device: torch.device = torch.device("cpu"), ) -> torch.Tensor: """C |
| HIGH | cosyvoice/utils/mask.py | 57 | Create mask for subsequent steps (size, size). This mask is used only in decoder which works in an auto-regressive |
| HIGH | cosyvoice/utils/mask.py | 95 | Create mask for subsequent steps (size, size) with chunk size, this is for streaming encoder Args: s |
| HIGH | cosyvoice/utils/mask.py | 133 | Create mask for subsequent steps (size, size) with chunk size, this is for streaming encoder Args: s |
| HIGH | cosyvoice/utils/mask.py | 240 | Make mask tensor containing indices of padded part. See description of make_non_pad_mask. Args: length |
| HIGH | cosyvoice/utils/common.py | 57 | Perform padding for the list of tensors. Args: xs (List): List of Tensors [(T_1, `*`), (T_2, `*`), ..., (T_ |
| HIGH | cosyvoice/flow/decoder.py | 211 | Forward pass of the UNet1DConditional model. Args: x (torch.Tensor): shape (batch_size, in_channels |
| HIGH | cosyvoice/flow/decoder.py | 406 | Forward pass of the UNet1DConditional model. Args: x (torch.Tensor): shape (batch_size, in_channels |
| Severity | File | Line | Snippet |
|---|---|---|---|
| LOW | webui.py | 50 | |
| LOW | cosyvoice/llm/llm.py | 302 | |
| LOW | cosyvoice/llm/llm.py | 505 | |
| LOW | cosyvoice/llm/llm.py | 552 | |
| LOW | cosyvoice/bin/average_model.py | 43 | |
| LOW | cosyvoice/dataset/processor.py | 31 | |
| LOW | cosyvoice/utils/mask.py | 161 | |
| LOW | cosyvoice/utils/train_utils.py | 111 | |
| LOW | cosyvoice/utils/train_utils.py | 291 | |
| LOW | cosyvoice/cli/model.py | 101 | |
| LOW | cosyvoice/cli/model.py | 175 | |
| LOW | cosyvoice/flow/decoder.py | 196 | |
| LOW | runtime/python/grpc/client.py | 30 | |
| LOW | runtime/triton_trtllm/token2wav_dit.py | 175 | |
| LOW | runtime/triton_trtllm/offline_inference.py | 325 | |
| LOW | runtime/triton_trtllm/client_grpc.py | 112 | |
| LOW | runtime/triton_trtllm/client_grpc.py | 159 | |
| LOW | runtime/triton_trtllm/client_grpc.py | 412 | |
| LOW | runtime/triton_trtllm/client_grpc.py | 501 | |
| LOW | runtime/triton_trtllm/client_grpc.py | 687 | |
| LOW | runtime/triton_trtllm/token2wav_cosyvoice3.py | 193 | |
| LOW | runtime/triton_trtllm/token2wav.py | 127 | |
| LOW | runtime/triton_trtllm/infer_cosyvoice3.py | 219 | |
| LOW | runtime/triton_trtllm/model_repo/cosyvoice2/1/model.py | 315 | |
| LOW | …ime/triton_trtllm/model_repo/cosyvoice2_dit/1/model.py | 121 | |
| LOW | …ime/triton_trtllm/model_repo/cosyvoice2_dit/1/model.py | 295 | |
| LOW | …ton_trtllm/model_repo/token2wav_dit/1/token2wav_dit.py | 175 | |
| LOW | …/triton_trtllm/model_repo/speaker_embedding/1/model.py | 91 | |
| LOW | …ton_trtllm/model_repo_cosyvoice3/cosyvoice3/1/model.py | 84 | |
| LOW | …ton_trtllm/model_repo_cosyvoice3/cosyvoice3/1/model.py | 305 | |
| LOW | …llm/model_repo_cosyvoice3/speaker_embedding/1/model.py | 91 | |
| LOW | runtime/triton_trtllm/scripts/fill_template.py | 6 | |
| LOW | runtime/triton_trtllm/scripts/convert_checkpoint.py | 152 | |
| LOW | runtime/triton_trtllm/scripts/convert_checkpoint.py | 186 | |
| LOW | runtime/triton_trtllm/scripts/convert_checkpoint.py | 282 | |
| LOW | examples/grpo/cosyvoice2/infer_dataset.py | 282 | |
| LOW | …amples/grpo/cosyvoice2/scripts/offline-decode-files.py | 137 |
| Severity | File | Line | Snippet |
|---|---|---|---|
| LOW | cosyvoice/transformer/decoder.py | 178 | def forward_layers_checkpointed(self, x: torch.Tensor, |
| LOW | cosyvoice/transformer/encoder.py | 173 | def forward_layers_checkpointed(self, xs: torch.Tensor, |
| LOW | cosyvoice/utils/mask.py | 89 | def subsequent_chunk_mask_deprecated( |
| LOW | cosyvoice/utils/train_utils.py | 53 | def init_dataset_and_dataloader(args, configs, gan, dpo): |
| LOW | cosyvoice/utils/train_utils.py | 72 | def check_modify_and_save_config(args, configs): |
| LOW | cosyvoice/utils/train_utils.py | 111 | def init_optimizer_and_scheduler(args, configs, model, gan): |
| LOW | cosyvoice/utils/scheduler.py | 397 | def _linear_warmup_with_cosine_annealing(max_lr, warmup_steps, step, |
| LOW | cosyvoice/utils/scheduler.py | 543 | def _get_linear_warmup_with_cosine_annealing_lr(self, step): |
| LOW | cosyvoice/cli/frontend.py | 89 | def _extract_text_token_generator(self, text_generator): |
| LOW | runtime/triton_trtllm/token2wav_dit.py | 237 | def get_trt_kwargs_dynamic_batch(self, opt_batch_size=2, max_batch_size=64, streaming=False): |
| LOW | runtime/triton_trtllm/token2wav_dit.py | 264 | def prompt_audio_tokenization(self, prompt_audios_list: list[torch.Tensor]) -> list[list[int]]: |
| LOW | runtime/triton_trtllm/token2wav_dit.py | 371 | def get_prompt_audio_cache_for_streaming_tts( |
| LOW | runtime/triton_trtllm/offline_inference.py | 63 | async def send_batch_requests_async(api_base, model_name, chats, temperature, top_p, top_k): |
| LOW | runtime/triton_trtllm/client_grpc.py | 359 | def prepare_request_input_output( |
| LOW | runtime/triton_trtllm/client_grpc.py | 412 | def run_sync_streaming_inference( |
| LOW | runtime/triton_trtllm/token2wav_cosyvoice3.py | 219 | def prompt_audio_tokenization(self, prompt_audios_list): |
| LOW | runtime/triton_trtllm/token2wav.py | 185 | def get_trt_kwargs_dynamic_batch(self, opt_bs=2, max_batch_size=64): |
| LOW | runtime/triton_trtllm/token2wav.py | 193 | def prompt_audio_tokenization(self, prompt_audios_list: list[torch.Tensor]) -> list[list[int]]: |
| LOW | runtime/triton_trtllm/infer_cosyvoice3.py | 45 | async def send_batch_requests_async(api_base, model_name, chats, temperature, top_p, top_k): |
| LOW | runtime/triton_trtllm/model_repo/cosyvoice2/1/model.py | 201 | def forward_speaker_embedding(self, wav): |
| LOW | …ime/triton_trtllm/model_repo/cosyvoice2_dit/1/model.py | 52 | def parse_speech_token_string(response_text: str) -> List[int]: |
| LOW | …ime/triton_trtllm/model_repo/cosyvoice2_dit/1/model.py | 108 | def _convert_speech_tokens_to_str(self, speech_tokens: Union[torch.Tensor, List]) -> str: |
| LOW | …ime/triton_trtllm/model_repo/cosyvoice2_dit/1/model.py | 207 | def forward_speaker_embedding(self, wav): |
| LOW | …ton_trtllm/model_repo/token2wav_dit/1/token2wav_dit.py | 237 | def get_trt_kwargs_dynamic_batch(self, opt_batch_size=2, max_batch_size=64, streaming=False): |
| LOW | …ton_trtllm/model_repo/token2wav_dit/1/token2wav_dit.py | 264 | def prompt_audio_tokenization(self, prompt_audios_list: list[torch.Tensor]) -> list[list[int]]: |
| LOW | …ton_trtllm/model_repo/token2wav_dit/1/token2wav_dit.py | 371 | def get_prompt_audio_cache_for_streaming_tts( |
| LOW | …time/triton_trtllm/model_repo/token2wav_dit/1/model.py | 56 | def get_spk_id_from_prompt_audio(tensor: torch.Tensor) -> str: |
| LOW | …ton_trtllm/model_repo_cosyvoice3/cosyvoice3/1/model.py | 25 | def parse_speech_token_string(response_text): |
| LOW | …ton_trtllm/model_repo_cosyvoice3/cosyvoice3/1/model.py | 72 | def _convert_speech_tokens_to_str(self, speech_tokens): |
| LOW | …ton_trtllm/model_repo_cosyvoice3/cosyvoice3/1/model.py | 181 | def forward_speaker_embedding(self, wav): |
| LOW | …ton_trtllm/model_repo_cosyvoice3/cosyvoice3/1/model.py | 305 | async def _process_request_streaming(self, request): |
| LOW | runtime/triton_trtllm/scripts/convert_checkpoint.py | 186 | def update_quant_config_from_hf(quant_config, hf_config, |
| LOW | examples/grpo/cosyvoice2/token2wav_asr_server.py | 116 | def get_random_prompt_from_dataset(dataset): |
| LOW | …amples/grpo/cosyvoice2/scripts/offline-decode-files.py | 588 | def normalize_text_alimeeting(text: str) -> str: |
| Severity | File | Line | Snippet |
|---|---|---|---|
| HIGH | runtime/triton_trtllm/token2wav_dit.py | 17 | python3 token2wav.py --enable-trt || exit 1 |
| HIGH | runtime/triton_trtllm/token2wav_cosyvoice3.py | 3 | python3 token2wav_cosyvoice3.py --enable-trt || exit 1 |
| HIGH | runtime/triton_trtllm/token2wav.py | 17 | python3 token2wav.py --enable-trt || exit 1 |
| HIGH | …ton_trtllm/model_repo/token2wav_dit/1/token2wav_dit.py | 17 | python3 token2wav.py --enable-trt || exit 1 |
| Severity | File | Line | Snippet |
|---|---|---|---|
| LOW | cosyvoice/transformer/convolution.py | 125 | # It's better we just return None if no cache is required, |
| MEDIUM | cosyvoice/utils/executor.py | 45 | # A context manager to be used in conjunction with an instance of |
| MEDIUM | cosyvoice/utils/executor.py | 97 | # A context manager to be used in conjunction with an instance of |
| Severity | File | Line | Snippet |
|---|---|---|---|
| MEDIUM | …time/triton_trtllm/model_repo/token2wav_dit/1/model.py | 64 | # Create a SHA-256 hash of the byte string |
| MEDIUM | examples/grpo/cosyvoice2/run.sh | 9 | # This function is from espnet |
| Severity | File | Line | Snippet |
|---|---|---|---|
| MEDIUM | example.py | 60 | # bistream usage, you can use generator as input, this is useful when using text llm model as input |