Intel Virtual Assistant Chatbot

⚙️ Problem Statement

Running GenAI on Intel AI Laptops and Simple LLM Inference on CPU and fine-tuning of LLM Models using Intel® OpenVINO™

🕵️ Our Solution:

-> Fine-tuned Llama2-7b model on Intel Products and Services FAQ custom Dataset.

-> Comverted to OpenVINO IR Format for optimized inferencing, being 56% faster than the original model.

📺 Demo (Youtube)

🏃‍♂️ Workflow

📂 Dataset

The dataset was prepared by scraping data regarding Intel Product and Services from the Intel FAQ and help websites. The capability of this model is limited to the dataset used, which includes the below Intel Products

📊 Intel Products

🚀 Intel Gaudi
🔧 POP Intel
⚡ Intel Optane
🛠️ IPP Intel
🔗 Intel MPI Library
🧠 Intel OpenVINO

❓ FAQ Categories

🛡️ Product Support FAQ
📦 Product Installation FAQ
🌐 General Intel Information

🏃‍♂️ How to perform inference with the fine-tuned Intel OpenVINO Model?

Install packages required for using Optimum Intel integration with the OpenVINO backend:

pip install optimum[openvino]

Import and initialize the model from HuggingFace:

from transformers import AutoTokenizer
from optimum.intel.openvino import OVModelForCausalLM

model_name = "OjasPatil/intel-llama2-7b-ov"
tokenizer = AutoTokenizer.from_pretrained(model_name)
base_model = OVModelForCausalLM.from_pretrained(model_name)

Perform Inference with the OpenVINO Optimized Fine-tuned Intel Virutal Assistant:

message = "What is Intel OpenVINO?"
prompt = f"[INST] {message} [/INST]"
inputs = tokenizer(prompt, return_tensors="pt")
outputs = base_model.generate(**inputs, max_new_tokens=50)
response = tokenizer.decode(outputs[0], skip_special_tokens=True).replace(prompt+" ", "")
print(response)

🌠 Results

The OpenVINO IR Format Model performs 56% faster than the original model.

Figure: Performance comparison between the OpenVINO IR Format Model and the original model.

The performance of the model is also evaluated using ROUGE scores:

-> ROUGE-1: 35.23

-> ROUGE-2: 18.97

-> ROUGE-L: 28.82

Demo Video Link: Project Demo

🤝 Team Aurora

Harinee J
Mhanjhusriee Baskar
Amit Das
Ojas Patil

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
Dataset		Dataset
Fine-tuning Notebooks		Fine-tuning Notebooks
Model Weights		Model Weights
OpenVINO Inference Notebooks		OpenVINO Inference Notebooks
Intel Unnati Presentation.pdf		Intel Unnati Presentation.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Intel Virtual Assistant Chatbot

⚙️ Problem Statement

🕵️ Our Solution:

📺 Demo (Youtube)

🏃‍♂️ Workflow

📂 Dataset

📊 Intel Products

❓ FAQ Categories

🏃‍♂️ How to perform inference with the fine-tuned Intel OpenVINO Model?

🌠 Results

Demo Video Link: Project Demo

🤝 Team Aurora

About

Releases

Packages

Languages

Harxnee/Intel-QA-Chatbot

Folders and files

Latest commit

History

Repository files navigation

Intel Virtual Assistant Chatbot

⚙️ Problem Statement

🕵️ Our Solution:

📺 Demo (Youtube)

🏃‍♂️ Workflow

📂 Dataset

📊 Intel Products

❓ FAQ Categories

🏃‍♂️ How to perform inference with the fine-tuned Intel OpenVINO Model?

🌠 Results

Demo Video Link: Project Demo

🤝 Team Aurora

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages