A follow-up query concerning the closing rating was answered accurately, however Gemini bought the title of the scorer of the primary landing mistaken: The AI advised it was Johan Dotson. Dotson was proven getting a landing within the highlights with the scores at 0-0, nevertheless it was dominated out—an instance of the nuances that AI does not essentially decide up on.
Gemini did efficiently establish when the Kansas Metropolis Chiefs bought their first factors, and even included a timestamp linking straight to the landing within the YouTube clip. It additionally bought the title of the scorer proper. It appears Gemini is closely reliant on the commentary for sports activities clips, which is not stunning.
Summarize Video Contents
Subsequent, we tried placing Gemini up in opposition to a behind-the-scenes featurette for The Grand Budapest Lodge, directed by Wes Anderson. The clip runs to four-and-a-half minutes, and Gemini fired again some replies nearly immediately: It recognized the title of the movie being talked about, and the primary beats of the clip’s narrative.
Nonetheless, it is all reliant on the audio (or the transcript) once more—there does not appear to be any evaluation of the particular video contents. The AI could not say who the speaking heads had been within the video, though their names had been proven on display screen, and wasn’t capable of say who the director was (though this was additionally talked about within the video description).
On the plus facet, Gemini did do a powerful job of summing up the audio of the video. It accurately recognized a few of the filmmaking challenges that had been talked about all through, and offered timestamps to them — from on the lookout for a set to characterize the Grand Budapest, to filling it with extras.
Summarize Interviews
Lastly, we tried Google Gemini with an interview: Channel 4 within the UK talking to Charlie Brooker and Siena Kelly concerning the newest sequence of Black Mirror (maybe acceptable for an article on AI). Gemini proved itself very succesful at selecting out the speaking factors, and including timestamps, although after all the entire video is usually speaking.
Once more although, there is not any context about something exterior of the audio or the transcript. Gemini AI could not say the place the interview passed off, or how the contributors had been performing, or the rest concerning the visuals of the video—which is price making an allowance for when you use it your self.
For movies the place the solutions you need are within the audio of a YouTube video, and its related transcript, Gemini works very well at summarizing and offering correct solutions (offered the commentators point out when a landing is dominated out, in addition to when one is scored). For any form of visible info, you are still going to have to observe the video your self.