Terminal [SMBC]

cannedtuna@lemmy.world · 3 days ago

Terminal [SMBC]

fonix232@fedia.io · 3 days ago

On the topic of AI… I’ve found that smaller, self hosted models are actually quite adept at helping you search better. I’m not talking about replacing a search engine with an AI interface (like e.g. Perplexity did), but when you struggle to find relevant hits, throwing the query into a mini LLM (3-4B params), one you can even run on your phone, can be super helpful in refining the query itself, including further keywords you might not have thought of. Again, not talking about using a cloud hosted big model that burns through ten times more kwh to give a worse result.

Kairos@lemmy.today · 2 days ago

Oh I’ve never thought of that. Thanks.

Doublenut@lemmy.zip · 3 days ago

Do you have any examples of such models? Models like this are the only interest I have in LLMs.

fonix232@fedia.io · 3 days ago

I’ve been mainly using gemma4:e4b - interestingly, even though it’s supposed be a subset/lesser variant of Gemini for self-hosting, its overall approach in reasoning and iterations over search terminology and query optimisation is much, much better than what you get from Gemini.

And to be fair I also quite like Perplexity - its ability to find relevant information from multiple sources, find both supporting and counter arguments where relevant, and build quite precise summaries. I do wish it was DIY-able, but with all the malicious actors I don’t feel comfortable allowing any LLM to just browse the net, who knows, even with properly secured tool calling, prompt injection can happen.