AI is the new electricity and will transform and improve nearly all areas of human lives.

Quick Guide & Tips

💻 Accessing Utils File and Helper Functions

In each notebook on the top menu:

1: Click on "File"

2: Then, click on "Open"

You will be able to see all the notebook files for the lesson, including any helper functions used in the notebook on the left sidebar. See the following image for the steps above.

🔄 Reset User Workspace

If you need to reset your workspace to its original state, follow these quick steps:

1: Access the Menu: Look for the three-dot menu (⋮) in the top-right corner of the notebook toolbar.

2: Restore Original Version: Click on "Restore Original Version" from the dropdown menu.

For more detailed instructions, please visit our Reset Workspace Guide.

💻 Downloading Notebooks

In each notebook on the top menu:

1: Click on "File"

2: Then, click on "Download as"

3: Then, click on "Notebook (.ipynb)"

💻 Uploading Your Files

After following the steps shown in the previous section ("File" => "Open"), then click on "Upload" button to upload your files.

📗 See Your Progress

Once you enroll in this course—or any other short course on the DeepLearning.AI platform—and open it, you can click on 'My Learning' at the top right corner of the desktop view. There, you will be able to see all the short courses you have enrolled in and your progress in each one.

Additionally, your progress in each short course is displayed at the bottom-left corner of the learning page for each course (desktop view).

📱 Features to Use

🎞 Adjust Video Speed: Click on the gear icon (⚙) on the video and then from the Speed option, choose your desired video speed.

🗣 Captions (English and Spanish): Click on the gear icon (⚙) on the video and then from the Captions option, choose to see the captions either in English or Spanish.

🔅 Video Quality: If you do not have access to high-speed internet, click on the gear icon (⚙) on the video and then from Quality, choose the quality that works the best for your Internet speed.

🖥 Picture in Picture (PiP): This feature allows you to continue watching the video when you switch to another browser tab or window. Click on the small rectangle shape on the video to go to PiP mode.

√ Hide and Unhide Lesson Navigation Menu: If you do not have a large screen, you may click on the small hamburger icon beside the title of the course to hide the left-side navigation menu. You can then unhide it by clicking on the same icon again.

🧑 Efficient Learning Tips

The following tips can help you have an efficient learning experience with this short course and other courses.

🧑 Create a Dedicated Study Space: Establish a quiet, organized workspace free from distractions. A dedicated learning environment can significantly improve concentration and overall learning efficiency.

📅 Develop a Consistent Learning Schedule: Consistency is key to learning. Set out specific times in your day for study and make it a routine. Consistent study times help build a habit and improve information retention.

Tip: Set a recurring event and reminder in your calendar, with clear action items, to get regular notifications about your study plans and goals.

☕ Take Regular Breaks: Include short breaks in your study sessions. The Pomodoro Technique, which involves studying for 25 minutes followed by a 5-minute break, can be particularly effective.

💬 Engage with the Community: Participate in forums, discussions, and group activities. Engaging with peers can provide additional insights, create a sense of community, and make learning more enjoyable.

✍ Practice Active Learning: Don't just read or run notebooks or watch the material. Engage actively by taking notes, summarizing what you learn, teaching the concept to someone else, or applying the knowledge in your practical projects.

📚 Enroll in Other Short Courses

Keep learning by enrolling in other short courses. We add new short courses regularly. Visit DeepLearning.AI Short Courses page to see our latest courses and begin learning new topics. 👇

👉👉 🔗 DeepLearning.AI – All Short Courses [+]

🙂 Let Us Know What You Think

Your feedback helps us know what you liked and didn't like about the course. We read all your feedback and use them to improve this course and future courses. Please submit your feedback by clicking on "Course Feedback" option at the bottom of the lessons list menu (desktop view).

Also, you are more than welcome to join our community 👉👉 🔗 DeepLearning.AI Forum

Sign in

Or, sign in with your email

Email

Password

Forgot password?

Don't have an account? Create account

By signing up, you agree to our Terms Of Use and Privacy Policy

Create Your Account

Or, sign up with your email

Email Address

Already have an account? Sign in here!

By signing up, you agree to our Terms Of Use and Privacy Policy

Choose Your Plan

Planning for more users?

What best describes you?

This helps us tune the catalog to suit you best.

Software Engineer

Data Scientist

Machine Learning Engineer

Data Analyst

Product Manager

Entrepreneur

Business / Consulting

Research / Academic

Student

Other

Subscribe to receive AI news, events and course updates from DeepLearning.AI!

Join Team Success

You have successfully joined undefined

You now have access to all Pro features. Click below to start learning!

Session Expired

Session expired — please return to Cornerstone to restart the session and complete the course.

/

Open Source Models with Hugging Face

All Courses

/

Open Source Models with Hugging Face

All Courses

Open Source Models with Hugging Face

Open Source Models with Hugging Face

Course Syllabus

Elevate Your Career with Full Learning Experience

Unlock Plus AI learning and gain exclusive insights from industry leaders

Access exclusive features like graded notebooks and quizzes

Earn unlimited certificates to enhance your resume

Starting at $1 USD/mo after a free trial – cancel anytime

In this final audio lesson, we'll tackle text-to-audio generation by converting text to speech. Text-to-speech is a challenging task because it is a one-to-many problem. In classification, you have one correct label, maybe a few. In automatic speech recognition, there's one correct transcription for a given utterance. However, there's an infinite amount of ways to say the same sentence. Each person has a different way of speaking, but they are all valid and correct. Think about different voices, dialects, speaking styles, and so on. Despite these challenges, there are open-source models that can handle this task really well, and you're about to use one of them. We'll use a VITS pre-trained model from Kakao Enterprise. This is one of the two models that can fit in this environment. And this model has a permissive license. Once you have the pipeline, all you need to do is to pass some text to it. Let's write some text. Now let's pass this text to the pipeline. Let's give it a listen. Researchers at the Allen Institute for AI are going to face Microsoft. The University of Washington, Carnegie Mellon University, and the Hebrew University of Jerusalem developed a tool that measures atmospheric carbon emitted by cloud servers while training machine learning models. After a model's size, the biggest variables were the server's location and time of day it was active. And just like that, you can convert text into an aerated audio recording. Feel free to paste your text into your computer. Feel free to paste your own text and play with the pipeline. In the next lesson, Yunus will show you how to build an object detector. Let's go on to the next lesson.

deco top

deco bottom

Open Source Models with Hugging Face

Sign in to continue learning

Open Source Models with Hugging Face

Beginner

2h13m

Topics

Chatbots

Generative Models

MultiModal

NLP

Prompt Engineering

Transformers

Collaborator

Open Source Models with Hugging Face

Introduction
Video
・
5m

Selecting models
Video
・
5m

Natural Language Processing (NLP)
Video with Code Example
・
9m

Translation and Summarization
Video with Code Example
・
5m

Sentence Embeddings
Video with Code Example
・
5m

Zero-Shot Audio Classification
Video with Code Example
・
9m

Automatic Speech Recognition
Video with Code Example
・
15m

Text to Speech
Video with Code Example
・
2m

Object Detection
Video with Code Example
・
11m

Image Segmentation
Video with Code Example
・
16m

Image Retrieval
Video with Code Example
・
7m

Image Captioning
Video with Code Example
・
5m

Multimodal Visual Question Answering
Video with Code Example
・
4m

Zero-Shot Image Classification
Video with Code Example
・
6m

Deployment
Video with Code Example
・
11m

Conclusion
Video
・
1m

Graded・Quiz

Course Details