DeepLearning.AI
AI is the new electricity and will transform and improve nearly all areas of human lives.

๐Ÿ’ป ย  Accessing Utils File and Helper Functions

In each notebook on the top menu:

1: ย  Click on "File"

2: ย  Then, click on "Open"

You will be able to see all the notebook files for the lesson, including any helper functions used in the notebook on the left sidebar. See the following image for the steps above.


๐Ÿ’ป ย  Downloading Notebooks

In each notebook on the top menu:

1: ย  Click on "File"

2: ย  Then, click on "Download as"

3: ย  Then, click on "Notebook (.ipynb)"


๐Ÿ’ป ย  Uploading Your Files

After following the steps shown in the previous section ("File" => "Open"), then click on "Upload" button to upload your files.


๐Ÿ“— ย  See Your Progress

Once you enroll in this courseโ€”or any other short course on the DeepLearning.AI platformโ€”and open it, you can click on 'My Learning' at the top right corner of the desktop view. There, you will be able to see all the short courses you have enrolled in and your progress in each one.

Additionally, your progress in each short course is displayed at the bottom-left corner of the learning page for each course (desktop view).


๐Ÿ“ฑ ย  Features to Use

๐ŸŽž ย  Adjust Video Speed: Click on the gear icon (โš™) on the video and then from the Speed option, choose your desired video speed.

๐Ÿ—ฃ ย  Captions (English and Spanish): Click on the gear icon (โš™) on the video and then from the Captions option, choose to see the captions either in English or Spanish.

๐Ÿ”… ย  Video Quality: If you do not have access to high-speed internet, click on the gear icon (โš™) on the video and then from Quality, choose the quality that works the best for your Internet speed.

๐Ÿ–ฅ ย  Picture in Picture (PiP): This feature allows you to continue watching the video when you switch to another browser tab or window. Click on the small rectangle shape on the video to go to PiP mode.

โˆš ย  Hide and Unhide Lesson Navigation Menu: If you do not have a large screen, you may click on the small hamburger icon beside the title of the course to hide the left-side navigation menu. You can then unhide it by clicking on the same icon again.


๐Ÿง‘ ย  Efficient Learning Tips

The following tips can help you have an efficient learning experience with this short course and other courses.

๐Ÿง‘ ย  Create a Dedicated Study Space: Establish a quiet, organized workspace free from distractions. A dedicated learning environment can significantly improve concentration and overall learning efficiency.

๐Ÿ“… ย  Develop a Consistent Learning Schedule: Consistency is key to learning. Set out specific times in your day for study and make it a routine. Consistent study times help build a habit and improve information retention.

Tip: Set a recurring event and reminder in your calendar, with clear action items, to get regular notifications about your study plans and goals.

โ˜• ย  Take Regular Breaks: Include short breaks in your study sessions. The Pomodoro Technique, which involves studying for 25 minutes followed by a 5-minute break, can be particularly effective.

๐Ÿ’ฌ ย  Engage with the Community: Participate in forums, discussions, and group activities. Engaging with peers can provide additional insights, create a sense of community, and make learning more enjoyable.

โœ ย  Practice Active Learning: Don't just read or run notebooks or watch the material. Engage actively by taking notes, summarizing what you learn, teaching the concept to someone else, or applying the knowledge in your practical projects.


๐Ÿ“š ย  Enroll in Other Short Courses

Keep learning by enrolling in other short courses. We add new short courses regularly. Visit DeepLearning.AI Short Courses page to see our latest courses and begin learning new topics. ๐Ÿ‘‡

๐Ÿ‘‰๐Ÿ‘‰ ๐Ÿ”— DeepLearning.AI โ€“ All Short Courses [+]


๐Ÿ™‚ ย  Let Us Know What You Think

Your feedback helps us know what you liked and didn't like about the course. We read all your feedback and use them to improve this course and future courses. Please submit your feedback by clicking on "Course Feedback" option at the bottom of the lessons list menu (desktop view).

Also, you are more than welcome to join our community ๐Ÿ‘‰๐Ÿ‘‰ ๐Ÿ”— DeepLearning.AI Forum


Sign in

Create Your Account

Or, sign up with your email
Email Address

Already have an account? Sign in here!

By signing up, you agree to our Terms Of Use and Privacy Policy

Choose Your Learning Path

Enjoy 30% Off Now. Cancel Anytime!

MonthlyYearly

Change Your Plan

Your subscription plan will change at the end of your current billing period. Youโ€™ll continue to have access to your current plan until then.

View All Plans and Features

Welcome back!

Hi ,

We'd like to know you better so we can create more relevant courses. What do you do for work?

DeepLearning.AI
  • Explore Courses
  • Community
    • Forum
    • Events
    • Ambassadors
    • Ambassador Spotlight
  • My Learnings
  • daily streak fire

    You've achieved today's streak!

    Complete one lesson every day to keep the streak going.

    Su

    Mo

    Tu

    We

    Th

    Fr

    Sa

    free pass got

    You earned a Free Pass!

    Free Passes help protect your daily streak. Complete more lessons to earn up to 3 Free Passes.

    Free PassFree PassFree Pass
built in partnership with CircleCI. Software testing helps you identify bugs and security vulnerabilities in your applications, and automated testing frees up your time and energy further, so they can focus on the creative parts of designing and building your application. In this course, you learn modern software engineering practices focused on testing for the practical development and deployment of LLM-based applications. Two kinds of LLM evaluations that you implement in this course are rule-based evaluations and model-graded evaluations. Rule-based evals use string or pattern matching, for example, regular expression matching, and are fast and cost-effective to run. I use these whenever I want to evaluate outputs that have a clear right answer, such as sentiment classification and if, say, you have ground-truth labels. Rule-based evals are quick and cheap to run, so you can run these tests every time you commit a code change to get fast feedback on the health of your application. Model graded evaluations are relevant for applications where there are many possible good or bad outputs. For example, if you ask an LLM to write text content for you, there can be more than one high-quality response. Here, you might prompt an evaluation LLM to have it assess the quality of the output of your application LLM. In other words, use an LLM to evaluate the output of another LLM. Model-graded evals take more time and cost more, but they allow you to assess more complex outputs. I'm delighted to introduce our instructor for this course, Rob Zuber, Chief Technology Officer for CircleCI. Rob has spent decades leading engineering teams and also helping customers scale up their software delivery practice by making processes repeatable, scalable, and reliable. He'll show you how to do this for your applications as well with an emphasis on testing. Thanks, Andrew. In your software development process, you and your teammates may commit code updates or bug fixes multiple times per day. In this course, you'll learn to set triggers that automatically run your evaluations whenever you or your teammates commit code changes to the repository. Your team may also release updated versions of the app on a broader cadence, perhaps once every two weeks. Before deploying to users, you can also automate more holistic, comprehensive, pre-release evaluations. For per-commit evals, you can include rules-based evaluations because they're fast and cheap to run. And for those pre-release evals, you can include rules-based evaluations because they're fast and cheap to run. And for those pre-release evals, it may be very helpful to use model-graded evals to do more thorough testing before deployment. By the end of the course, you will combine per-commit and pre-release evals into an automated testing suite. And for this course, you'll design tests to detect hallucinations in LLM responses. Many people have worked to make this course possible. I'd like to thank on the CircleCI side, Michael Webster, Jacob Schmidt, and Emma Webb. From DeepLearning.ai, Eddie Hsu and Eshmal Gargari have also contributed to this course. The first lesson will be a quick overview of continuous integration terms and technologies that we'll use as the foundation for building our automated LLM testing pipeline. When you finish this course, that will be a real testament to your dedication to building good applications. Or if you're not sure how much you'll use these ideas, you can still test the waters. So let's go on to the next video to get started.
course detail
Next Lesson
Automated Testing for LLMOps
  • Introduction
    Video
    ใƒป
    3 mins
  • Introduction to Continuous Integration (CI)
    Video
    ใƒป
    4 mins
  • Overview of Automated Evals
    Video with Code Example
    ใƒป
    23 mins
  • Automating Model-Graded Evals
    Video with Code Example
    ใƒป
    7 mins
  • Comprehensive Testing Framework
    Video with Code Example
    ใƒป
    12 mins
  • Conclusion
    Video
    ใƒป
    1 min
  • Optional: Exploring the CircleCI config file
    Code Example
    ใƒป
    10 mins
  • Course Feedback
  • Community