Question is why even use these small models? When you've Google Flash which is l...

dartos · on Sept 12, 2024

Sometimes you don’t want to share all your data with the largest corporations on the planet.

oezi · on Sept 12, 2024

What is Google Flash? Do you mean Gemini Flash? If so, then the article talks about that general purpose LLMs are worse than this specialized LLM for Markdown conversion.

sippeangelo · on Sept 12, 2024

In this case it is not, though. As much as I'd like a self-hostable, cheap and lean model for this specific task, instead we have a completely inflexible model that I can't just prompt tweak to behave better in even not-so-special cases like above.

I'm sure there are good examples of specialised LLMs that do work well (like ones that are trained on specific sciences), but here the model doesn't have enough language comprehension to understand plain English instructions. How do I tweak it without fine-tuning? With a traditional approach to scraping this is trivial, but here it's unfeasible to the end user.

randomdata · on Sept 12, 2024

Small models often do a much better job when you have a well-defined task.

FL33TW00D · on Sept 12, 2024

Privacy, Cost, Latency, Connectivity.