Using OpenAI APIs: Using Image & Audio APIs
Explore OpenAI’s DALL-E and Whisper models. Generate stunning images with text prompts, transcribe and translate multilingual audio with high accuracy.
About this course
DALL-E and Whisper are OpenAI’s image and audio-based model offerings. DALL-E, an image generation model, demonstrates the ability to create visually striking images based on textual prompts. Whisper represents a state-of-the-art automatic speech recognition (ASR) system. With its high accuracy in transcribing spoken words, Whisper finds utility in various applications, from voice assistants to transcription services. You will begin this course by generating images using OpenAI’s DALL-E model. You will generate images using text prompts, create variations of existing images, and perform image inpainting using natural language. Then, you will work with the Whisper model, which caters to speech transcription and translation. You will transcribe and translate audio in different languages and accents, and you will evaluate the performance of these models.
Learning objectives
Discover the key concepts covered in this course
Generate images using dall-e
Create image variations and perform inpainting
Show all
There are no reviews yet.