Projects with this topic
-
🚛 ✈️ An advanced Streamlit dashboard designed as an AI-powered assistant for World Movers Phils Inc. This application leverages Google Gemini for multimodal interactions, enabling users to get information, request quotes, marketing, analyze documents/images, use voice commands, and more, all within a custom-themed interface. -
A PWA that provides a method to access chat like messaging functionality by using Rest API integrations from the Rhea Generative Framework.
The Rhea client app is component within the Rhea Generative Framework.
Originally intended as a way to demonstrate functionality found within the Rhea Generative Framework. The client app was designed to both demonstrate functionality and provide a foundation to build other components such as live chat (embeddable, etc.).
It has evolved over time to include additional functions for demonstration:
Persona management (role activation and management) Speech-to-text and text-to-speech (browser independent, both part of Rhea's Generative Framework server-side components and available for local hosting) STT captures audio for x seconds, transcribes and offers to either continue transcribing, send as a message, or manually edit. -
Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation
-
This project provides a client package and example scripts for TypeScript to access the alphaspeech pro ASR stream API.
-
This project provides a client package and example scripts for python to access the alphaspeech pro ASR APIs.
-
Oremi Izwi is a Text-to-Speech (TTS) application based on Piper. It's designed to to convert text into natural-sounding speech, enabling seamless integration of high-quality text-to-speech capabilities into applications. https://demsking.gitlab.io/oremi-izwi
-
The Python script utilizes the win32com library to interact with the Windows Speech API (SAPI), prompting the user to input text to be spoken aloud. It continuously speaks the input text using the default system text-to-speech engine until the user inputs "-1" to terminate the program.
-
Simple module that helps you to create aloud report with numeric values.
-
Talking Calendar is a desktop calendar for Linux which has some speech capability. It has been developed using C and GTK4 and uses a built-in diphone speech synthesizer.
-
Python package to normalize text for speech-language models using different libraries.
-
An easy to use wrapper for the BARK text-to-speech engine
-
Gtts4j (Google Text-to-Speech for Java). Convert text to speech using Google Translate results returning an mp3 file or you can manipulate the audio bits as well. When working with Google Translate the translation has also been integrated
-
A golang app that fetches news via RSS/Atom feeds, reads the news aloud to you, keeps track of which ones you've listened to. Some NLP and other stuff buried in here too.
-
Convert Russian text provided in a text file, to an audio file using Google's text-to-speech api.
-
This project contains the code for the talk at the tekom 2019 in Stuttgart for "Introduction to Python for Technical Communicators"
-
Text-to-Speech Tablet Application (AAC for the Speech Impaired)