Mal kurz das Tretlager neu schmieren.

OhneHose@feddit.org · 57 minutes ago

Ja, hätte er wirklich nicht tun müssen.

Ist ja quasi Fakt 👀

OhneHose@feddit.org · 1 hour ago

Ok Mr grumpy

OhneHose@feddit.org · 1 hour ago

I think if you open up a study, you should and probably need to be specific with the terms. Since llms are just large machine learning models. Just not trained for a single specific use case. You can also achieve very impressive results with small models, you don’t need chatgpt 5 for document classification. You can also fine-tune these models for specific tasks and/or “lobotomize” them. But f.e. go with a small qwen model with just 36B parameters or less and you will get very good results. And sure there are the good old OCR methods but you’ll need a significant pipeline behind a classic ocr machine. And it would probably still fail to decipher/classify a machine written document with hand written annotations. When you use a decent LLM, it will in most cases be able to differentiate between handwriting & machine letters, it will be able to output both in different variables and it might even be able to put the annotations in context to the original document. And this is an enormous task to program by hand.

And when we talk about speed and sustainability, not every document would be thrown at the expensive model first. But you would build a layered approach, so that 95% of the easy documents would be handled by a cheap and fast solution, but when that has a low confidence, then you would hand the document over to the bigger slower model.

Then add graphs or tables to the document and you’ll be nearly completely lost with a classic approach.

I’ve been working in this field for a couple years, so I speak from personal experience.

But still all those models still have an issue with context sizes and you and your business pipeline will fail if you don’t know the boundaries of what’s possible today. For the most high profile cases there should always be a human in the loop. Do companies do that? Most likely not, but they can get in big trouble if they make a critical mistake, at least in Europe, can’t speak for the wild West/US.

Note: You can self host qwen3.6 with 32gb or better 64gb and play it. It is shockingly good.

Data gathering and theft of IP is a completely different topic. But “luckily” many people now upload their data for free, directly to one of the big hosting companies. But privacy is also a different topic.

So again, be very specific if you choose your topic.

OhneHose@feddit.org · 4 hours ago

Yeah and if you’d use Linux your work laptop would be sitting at like 6gb, if you’d just browse the web and edit text documents.

The only time I ever need more than 10 gigs on my main machine is when I play games. Windows and Linux aren’t even close in normal use cases.

You can even try it yourself, grab yourself a 10 year old laptop and install any Linux distro on it, it will feel like a new machine.

OhneHose@feddit.org · 4 hours ago

Yeah, it’s really important to specify, what cases because they aren’t the same at all.

There’s even some really good uses cases for llms in companies. With a declining demographic most European countries face, a goal for company could f.e. be being more efficient, since (f.e. the company I worked for) 30-40% of staff will be in retirement age in the next 10 years. And if you work with documents (what we did) there’s a real benefit of llms classifying and compressing these documents. We speak of 10s of thousands a day. And the now used systems lack in flexibility to reliably classify or even read some of those documents. On top of that, you don’t need a 200B+ model for those tasks.

But that’s the good side in my eyes.

There’s loads more of problematic and socio economic issues with those models. Especially revolving around how people learn, decide and interact with each other.

You’re diving into a really broad field here and you’ll have to pick out very specific cases. It is for sure a super interesting field.

And on top of that, it’s a really old computer science field, dating back to the 60ies. It just now comes to “fruition” since our tech advanced so much that we can actually process these stupid amounts of data.

Before open ai & others popped up this all was labeled under computer linguistics & Data science, which just doesn’t sound as sexy I guess.

OhneHose@feddit.org · edit-2 6 hours ago

I mean, ai isn’t inherently bad. It’s more of an issue how the big companies do push it.

Ai in research is phenomenal, especially in medicine applications. It’s not a black & white issue.

And we are just at the beginning of it, best of luck! U’d also need to differntiate between LLM(general use cases) and task/research specific ai.

OhneHose@feddit.org · 6 hours ago

I’d not use ollama, it’s basically just a fancy wrapper around lama.cpp.

There’s also modules/docker containers to hot swap models with lama.cpp

My model hosting setup is: Lama.cpp -> Open web UI

Lama.cpp is running in a local shell on my Mac Mini, since setting up GPU support with metal is (or was?) a pain. And open web UI sits in a docker with a local storage mounted so it have persistence when updating or moving the docker.

16gigs vram however ain’t too much, you’ll be fairly limited to fairly low quants. It will be reasonably fast tho. If you can use most of your system ram you could go and host f.e. qwen 3.6 bf8(~56gb) or bf4 (~30gb). It would be slower but you also gain a lot of usability from that.

Or you host two models a smaller one on the GPU and bigger one with system ram so you can switch between “knowledge” and speed.

Using lama.cpp you’ll have to take a look at huggingface & use gguf models.

OhneHose@feddit.org · 8 hours ago

Linux? Depending on the distro, ram usage is a joke. Got a rpi and a NUC running Debian as a server (so without running the display environment constantly) and the use ~ 500-800mb ram. With a DE, they need like 1-1.2gigs.

The nuc is particularly funny since it hosts 2 Nextcloud instances, 2 websites and a qwen3 embedding model and sits at ~ 6gb ram.

OhneHose@feddit.org · 1 day ago

This is the answer op should be looking for.

There’s nothing wrong with Ubuntu/Kubuntu. KDE with Ubuntu (Kubuntu) solved a lot of multiscreen issues I had with gnome. The customisation options with KDE are basically limitless.

OhneHose@feddit.org · 1 day ago

“What’s going on? I don’t know”

Well that’s some lack of reflection that is actually impressive.

OhneHose@feddit.org · 1 day ago

It’s a kink party

OhneHose@feddit.org · 2 days ago

They are influencing elections since 2015/2016. There’s really no surprise here.

Cambridge analytica even made a talk in how they influenced the first trump election.

OhneHose@feddit.org · 3 days ago

Because then you get no advertising moneyzzzz

OhneHose@feddit.org · 3 days ago

Yes, and they need to be gone or heavily restricted before things get better.

OhneHose@feddit.org · 5 days ago

Mama told me, I’m big!

OhneHose@feddit.org · 5 days ago

Und wer dauerhaft hupt ist unglaubwürdig :D

OhneHose@feddit.org · 5 days ago

Wer als erstes hupt!

OhneHose@feddit.org · 5 days ago

Wie sehen die Spuren denn aus? Kratzer? Risse? Verfärbt? Wenn du mit dem Nagel drüber gehst und nicht hängen bleibst, ist alles jut. Zu Not mit feinem Schleifpapier vorsichtig nachhelfen.

OhneHose@feddit.org · 5 days ago

But there are cars that do it. Just because you never saw it, doesn’t mean it doesn’t exist.

OhneHose@feddit.org · 7 days ago

Peter Thiel hat vor über einem Jahrzehnt behauptet er müsse den Antichrist aufhalten. Ich wünschte es wäre ein Scherz, ist aber leider war. Dlf hat 1nen sehr guten Podvast über P Thiel, der erklärt (leider) einiges.

Die komplette milliardärsbubble hat einen Dachschaden, schade daß sie so mächtig sind und die meisten Menschen das einfach so hinnehmen.

OhneHose@feddit.org · 9 days ago

Mal kurz das Tretlager neu schmieren.

OhneHose

Mal kurz das Tretlager neu schmieren.

Mal kurz das Tretlager neu schmieren.