Just thinking about making this a monthly post, which model are you using? what are the positives and negatives?

  • @[email protected]
    link
    fedilink
    English
    89 days ago

    I mainly use Llama-3-8B abliterated for everyday questions, and DeepSeek-Coder-V2-Lite for programming/Linux stuff.

    • iktOP
      link
      fedilink
      English
      26 days ago

      Using DeepSeek-Coder-V2-Lite now, it’s awesome!

  • @[email protected]
    link
    fedilink
    English
    69 days ago

    I find that for the purpose of my projects (narrative building, tabletop rpg simulation) gemma3:14b (with low temperature) works perfectly to create consistent psychological overviews.

  • @[email protected]
    link
    fedilink
    English
    59 days ago

    QWQ-32B for most questions, llama-3.1-8B for agents. I’m looking for new models to replace them though, especially the agent one.

    Want to test the new GLM models, but I’d rather wait for llama.cpp to definitely fix the bugs with them first.

  • SmokeyDopeM
    link
    fedilink
    English
    5
    edit-2
    9 days ago

    I have been using deephermes daily. I think CoT reasoning is so awesome and such a game changer! It really helps the model give better answers especially for hard logical problems. But I don’t want it all the time especially on an already slow model. Being able to turn it on and off wirhout switching models is awesome. Mistral 24b deephermes is relatively uncensored, powerful and not painfully slow on my hardware. a high quant of llama 3.1 8b deephermes is able to fit entirely on my 8gb vram.

    • @[email protected]
      link
      fedilink
      English
      39 days ago

      Newbie here. I’m not sure if the documentation tells me if it can run with ollama. If I understand correctly you have to build it «by hand»? I mainly use ollama/models on the official website and I’m too scared to plunge deeper into the mechanics haha.

      • @[email protected]
        link
        fedilink
        English
        4
        edit-2
        9 days ago

        Not for GGUF converted models.

        Just run the following command in ollama

        ollama run hf.co/wanlige/li-14b-v0.4-Q4_K_M-GGUF
        

        I also recommend you set up Open WebUI as your front-end instead of staying in a terminal.

        Also, look into “abliterated” uncensored models. There’s plenty of them on hf that are already converted for you

  • @[email protected]
    link
    fedilink
    English
    49 days ago

    Fallen Gemma. The writing style is really good and it can keep relatively persistent personalities. On the other hand it’s stupid af compared to other recent models and even the vanilla Gemma 3.