Python PyQt Convert Speech Recognition CodeSource

Pocket TTS: High-Quality Local Voice Cloning Without GPU

Pocket TTS delivers high-quality text-to-speech on standard CPUs. No GPU, no cloud APIs. It is the first local TTS with voice ...

IEEE

Convolution-Augmented Transformers for Enhanced Speaker-Independent Dysarthric Speech Recognition

Abstract: Dysarthria is a motor speech disorder characterized by muscle movement difficulties that complicate verbal communication. It poses significant challenges to Automatic Speech Recognition (ASR ...

GitHub

SSMD - Speech Synthesis Markdown

SSMD (Speech Synthesis Markdown) is a lightweight Python library that provides a human-friendly markdown-like syntax for creating SSML (Speech Synthesis Markup Language) documents. It's designed to ...

GitHub

Baloon is a command-line tool and Python library for converting between geospatial vector formats. Convert BLN, Shapefile, GeoJSON, KML, GeoPackage, and SVG files quickly and ...

The output path must be a directory; it will be created if it does not exist. Input formats are auto-detected per file. Unsupported formats and write-only inputs (e.g., SVG) are skipped with a warning ...

Frontiers

A knowledge-driven framework for surgical safety check integration using speech recognition and speaker verification

1 Air Force Medical University Tangdu Hospital, Xi’an, China 2 Department of Gastroenterology, The First Affiliated Hospital of Xi'an Jiaotong University, Xi’an, China Background: The WHO Surgical ...

IEEE

A Lightweight Forward–Backward Independent Temporal-Aware Causal Network for Speech Emotion Recognition

Abstract: Speech Emotion Recognition (SER) technology analyzes speech characteristics in human-computer interactions to understand user intent and improve interaction experience. It is widely used in ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results