From the course: Generative AI Tools for Productivity and Research

Unlock the full course today

Join today to access over 23,200 courses taught by industry experts.

Extract text from an image with Google Gemini

Extract text from an image with Google Gemini

- [Instructor] Gemini is truly multimodal. It can work with different data types. For example, it can extract text data from images, upload image_4 from the provided images. This image is a process flow from one of my projects. Now, let's write a prompt, "extract the text component from this image." Okay, so Gemini was successfully able to extract the text component from these images. The first part is ideation. It has sub topics like Topic Selection, Topic Level, Target Audience, Vocabulary Development, and Scripting. That aligns with exactly what's in the image. And the same for production, you can see Video Recording, Video Editing, and Subtitling. Well, subtitling is originally under production in the original image. Let's see if all the drafts got it right. So this is Draft 2, okay, and also Draft 3. In this particular instance though, Gemini made an attempt to extract as much as possible as it can see. Subtitling was placed under production in the original image. In this…

Contents