— AI NEWS —

Cheap AI “video scraping” can now extract data from any screen recording

Oct 18, 2024

Quick summary

Feeding screen recordings or other video into a video-enabled model such as Google’s Gemini allows you to extract data from the video.

Why it matters

In this experiment, a researcher needed to add up some numeric values scattered across twelve different emails. He made a screen recording of himself scrolling through the emails. He then got Google Gemini to extract the numbers from his screen recording into a CSV file for use in a spreadsheet.

While this is a simple example, the implications of the ability to video-scrape screencasts are significant. It means anything you can display on your screen (websites, apps, e-learning, etc.), and anything that can be captured as video from a phone or camera (books on a bookshelf, panoramic displays), has the potential to become usable input for AI.

Although several major models, including those from OpenAI and Anthropic, have research previews that demonstrate the ability to accept video as input, only Google Gemini has released this feature. This is probably because the computation costs of processing video are so high. However, computation costs will inevitably fall, so expect video as input to be widely available in the near future.

More AI News

Sep 25, 2025

Perplexity challenges Google with new search API

Google dominates the search space, but does not allow other AI model vendors to use its search results. Perplexity’s new search API could help close the search capability gap between Gemini and its rivals by providing better search results than current Google search alternatives.
Sep 11, 2025

Anthropic’s Claude Introduces Memory Feature

Anthropic has launched a memory feature for Claude that allows it to remember team projects, preferences, and work patterns across conversations. Available for Team and Enterprise plans, the feature creates separate memories for different projects and includes an Incognito mode for sensitive discussions.
Jul 11, 2025

Articulate 360 Rise adds AI audio features

Rise has new AI audio features, including AI voice narration from a script, auto-generated audio transcripts, and the ability to add sound to 22 additional block types.
Oct 18, 2024

Cheap AI “video scraping” can now extract data from any screen recording

Feeding screen recordings or other video into a video-enabled model such as Google’s Gemini allows you to extract data from the video.

Thanks for subscribing!

Almost done. To activate your subscription, check your email inbox and click the confirmation link.

Subscribe

Subscribing to Ingenuity Learning updates is simple and free.

  • Stay Updated: Subscribers receive periodic updates directly to their inbox, so you’re always in the loop.
  • Unsubscribe Easily: If you change your mind about subscribing, there is an unsubscribe link in every email you receive.
  • No spam: The information you provide is used solely to manage your subscription. It’s never sold or shared without your permission.