Toot

Written by John Allsopp on 2024-11-16 at 09:47

@darrell73 I use gpt4 and Gemini 1.5 for something related. I take screen grabs of the slides from presentation videos. I get the models to return structured html with descriptions of images, tables graphs and so on.

They do an excellent job though not perfect.

Though the information density of a page is greater so ymmv.

IME it never hallucinates.

=> More informations about this toot | View the thread | More toots from johnallsopp@indieweb.social

Mentions

=> View darrell73@mastodon.online profile

Tags

Proxy Information
Original URL
gemini://mastogem.picasoft.net/toot/113491995614996555
Status Code
Success (20)
Meta
text/gemini
Capsule Response Time
229.074917 milliseconds
Gemini-to-HTML Time
0.322134 milliseconds

This content has been proxied by September (3851b).