NextJS 14 Server Actions with Azure Whisper AI and Azure OpenAI

Project Overview

This project is a Next.js 14 with TypeScript application that leverages Server Actions, Azure Whisper AI, Azure OpenAI, MediaStream API, and Web Speech AI to create a virtual assistant similar to Siri 2.0. The application transforms voice input into text using Azure Whisper AI, processes the text to generate a response using Azure OpenAI, and then converts the response into audio using the MediaStream API and Web Speech AI. The project also utilizes Tailwind CSS for styling.

Features

Voice input transformation using Azure Whisper AI.
Text processing and response generation using Azure OpenAI.
Utilized MediaStream API for capturing audio input.
Audio response using Web Speech AI.

Setup

Clone the repository to your local machine:

git clone [email protected]:FabioDiCeglie/Siri-2.0.git

Navigate to the project directory:
Install dependencies:
```
npm install
```
Set up Azure services:
- Create accounts for Azure Whisper AI and Azure OpenAI if you haven't already.
- Obtain the necessary API keys and configure them in the project.
- Follow Azure documentation for guidance on how to set up and obtain API keys.
- Create a .env.local file in the project directory.
- Add the following lines to the .env file:
```
AZURE_API_KEY='your_azure_api_key'
AZURE_ENDPOINT='your_azure_endpoint'
AZURE_DEPLOYMENT_NAME='your_azure_deployment_name'
AZURE_DEPLOYMENT_COMPLETIONS_NAME='your_azure_deployment_completions_name'
```
Replace 'your_azure_api_key', 'your_azure_endpoint', 'your_azure_deployment_name', and 'your_azure_deployment_completions_name' with the corresponding values provided by Azure.
Start the development server:
```
npm run dev
```
Access the application in your browser:
```
http://localhost:3000
```

Usage

Click on the microphone icon to activate voice input.
Speak your command or question clearly into the microphone.
Wait for the application to process your input and generate a response.
Listen to the response.

Deployment

This project is deployed with Vercel. You can access the live version of the application here.

Technologies Used

Next.js 14
TypeScript
Azure Whisper AI
Azure OpenAI
MediaStream API
Web Speech API
Tailwind CSS

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
actions		actions
app		app
components		components
lib		lib
public/img		public/img
utils		utils
.env.local.example		.env.local.example
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
README.md		README.md
components.json		components.json
next.config.mjs		next.config.mjs
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NextJS 14 Server Actions with Azure Whisper AI and Azure OpenAI

Project Overview

Features

Setup

Usage

Deployment

Technologies Used

About

Releases

Packages

Languages

FabioDiCeglie/Siri-2.0

Folders and files

Latest commit

History

Repository files navigation

NextJS 14 Server Actions with Azure Whisper AI and Azure OpenAI

Project Overview

Features

Setup

Usage

Deployment

Technologies Used

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages