Check out this video demo to see Voice GPT in action (this video has audio, so turn it on):
vgpt.mp4
Voice GPT is a cutting-edge application that leverages the power of GPT (Generative Pre-trained Transformer) technology to provide users with a next-level voice-powered conversational experience. It is a voice-enabled version of ChatGPT, designed to recognize and respond to voice commands and provide customized responses in real-time.
Voice GPT is perfect for people who want to have a natural and intuitive conversation with a virtual assistant without typing a single word. With Voice GPT, you can ask for directions, book a table at a restaurant, or even make a reservation at a hotel, all by simply speaking out loud. Voice GPT takes care of the rest, providing you with accurate and relevant responses.
- Voice-Enabled: Unlike ChatGPT, Voice GPT allows users to use voice input to communicate with the application.
- Custom Responses: Voice GPT provides users with customized responses that are tailored to their specific needs.
- Multi-Lingual: Voice GPT supports multiple languages, making it accessible to users around the world.
- Contextual Awareness: Voice GPT uses context-awareness to understand the meaning behind users' requests and provide relevant responses.
- Secure and Private: Voice GPT uses industry-standard security measures to protect users' data and privacy.
Voice GPT is built using a range of cutting-edge technologies, including:
- OpenAI API: The OpenAI API offers a powerful set of tools for building cutting-edge AI-powered applications, including natural language processing, computer vision, and more, with state-of-the-art pre-trained models and advanced customization options.
- NextJS: Voice GPT is built using Next.js offers server-side rendering, static site generation, automatic code splitting, hot module replacement, file system routing, built-in CSS support, API routes, and TypeScript support, making it a versatile and powerful framework for building modern web applications.
- React: The front-end of the application is built using React(Next.js), a powerful and popular modern JavaScript library for building user interfaces.
- Firebase: Firebase offers a comprehensive suite of tools and services for building scalable and secure mobile and web applications with real-time data synchronization, serverless functions, authentication, and more.
- Web Speech API: Web Speech API's WebkitSpeechRecognition enables developers to add voice recognition capabilities to their web applications, providing a more natural and intuitive user experience that can be used for a variety of applications, such as voice commands, dictation, and transcription.
To get started with Voice GPT and try it on your device, follow the instructions below:
- Clone the GitHub repo.
- Install all dependencies using
npm install. - Create a
.env.localfile in the root directory. - Set up a Firebase DB (You will need to know how to set up a Firebase DB before moving ahead).
- Create a Firebase Database and update the details of the database in the
.env.localfile. You will need aFIREBASE_APIKEY, and update thefirebase.tsfile with your ownfirebaseConfigbefore moving forward. - Get a
FIREBASE_ADMIN_SERVICE_ACCOUNT_KEYand add this in the.env.localfile. - For Google Authentication, you need to get a
GOOGLE_IDandGOOGLE_SECRETand add these values into the.env.localfile. - Set up the OpenAI API key. Get the OpenAI API key and add it as
OPEN_AI_KEYin the.env.localfile. - Search
process.envand ensure the environment variables in the.env.localfile have the same names as theprocess.envvariables. - After adding all the environment variables, run the application using
npm run dev.