• Vlyn@lemmy.zip
    link
    fedilink
    English
    arrow-up
    5
    ·
    1 day ago

    You have 170+ GB VRAM at home? (:

    I mainly use DeepSeek v4 Flash now, it’s the cheapest around and the quality is high enough for coding. At work we’re throwing tons of money at Claude, but even there I usually stick to Sonnet (as Opus is burning money).

    • De Lancre@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      arrow-down
      1
      ·
      edit-2
      14 hours ago

      You don’t need 170+ GB of VRAM. Whole model can be run at around 1 token/second on a modern hardware from an ssd. Which is slow, don’t get me wrong, but it still somewhat useable.

      Upd.Once again, for those who use AI because struggles to read: it is slow, but it is usable. Which is, by definition, means that you don’t need 170+ GB of VRAM to run this model. Period. It runs from ssd. That is a fact.

      • placebo@lemmy.zip
        link
        fedilink
        English
        arrow-up
        2
        arrow-down
        1
        ·
        18 hours ago

        “Somewhat” is doing a lot of heavy lifting there 😂 How much time does it take to process your average request?

      • dil@lemmy.zip
        link
        fedilink
        English
        arrow-up
        1
        arrow-down
        1
        ·
        14 hours ago

        1 token/second isn’t remotely acceptsble lol