Skip to content

Latest commit

 

History

History
46 lines (32 loc) · 2.68 KB

Challenge-05.md

File metadata and controls

46 lines (32 loc) · 2.68 KB

Challenge 05 - Responsible AI

< Previous Challenge - Home

Pre-requisites

  • Cognitive Search Index implementation from Challenge 4
  • Install required libraries in the requirements.txt file via pip install -r requirements.txt if you have not already.

Introduction

As LLMs grow in popularity and use around the world, the need to manage and monitor their outputs becomes increasingly important. In this challenge, you will learn how to evaluate the outputs of LLMs and how to identify and mitigate potential biases in the model.

Description

Questions you should be able to answer by the end of this challenge:

  • What are services and tools to identify and evaluate harms and data leakage in LLMs?
  • What are ways to evaluate truthfulness and reduce hallucinations?
  • What are methods to evaluate a model if you don't have a ground truth dataset for comparison?

Sections in this Challenge:

  1. Identifying harms and detecting Personal Identifiable Information (PII)
  2. Evaluating truthfulness using Ground-Truth Datasets
  3. Evaluating truthfulness using GPT without Ground-Truth Datasets

You will run the following Jupyter notebook for this challenge. They can be found in your Codespace under the notebooks folder. If you are working locally or in the Cloud, you can find it in the /Notebooks folder of Resources.zip file.

  • CH-05-ResponsibleAI.ipynb

Success Criteria

To complete this challenge successfully, you should be able to:

  • Articulate Responsible AI principles with OpenAI
  • Demonstrate methods and approaches for evaluating LLMs
  • Identify tools available to identify and mitigate harms in LLMs

Additional Resources