Communick News
  • Communities
  • Create Post
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
No-Link-2778@alien.topB to LocalLLaMA@poweruser.forumEnglish · 2 years ago

Deepseek llm 67b Chat & Base

message-square
message-square
22
link
fedilink
1
message-square

Deepseek llm 67b Chat & Base

No-Link-2778@alien.topB to LocalLLaMA@poweruser.forumEnglish · 2 years ago
message-square
22
link
fedilink

https://huggingface.co/deepseek-ai/deepseek-llm-67b-chat

https://huggingface.co/deepseek-ai/deepseek-llm-67b-base

Knowledge cutoff May 2023, not bad.

Online demo: https://chat.deepseek.com/ (Google oauth login)

another Chinese model, demo is censored by keywords, not that censored on local.

alert-triangle
You must log in or register to comment.
  • Independent_Key1940@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    2 years ago

    I asked it to create a simple chat interface to talk with open ai’s gpt 3.5 api and to use stream = true option. On the first try, it didn’t know how to handle the stream, so it simply used res.json(). After that, I told it that we needed to take care of streaming text in a special way. It understood this and wrote the correct code. Overall, I’m quite impressed. Way to go deepseak coder!

  • a_beautiful_rhind@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    2 years ago

    Does it give refusals on base? 67B sounds like full foundation train.

  • farkinga@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    2 years ago

    GGUF via TheBloke:

    https://huggingface.co/TheBloke/deepseek-llm-67b-chat-GGUF/

  • AntoItaly@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    2 years ago

    Wow, this model seems very good for the Italian language!

  • eachcitizen100@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    2 years ago

    I wish there was a 13b model which can just fit in on my GPU with quant

  • uti24@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    2 years ago

    Seems I am doing something wrong with this one.

    I got abismal results with 4_K_M: it had silly grammatical errors and typos, it also did not stick to prompt, so I don’t know.

    • LocoLanguageModel@alien.topB
      link
      fedilink
      English
      arrow-up
      1
      ·
      2 years ago

      I don’t know if this helps but I’m using the GGUF version of that and it’s working perfectly

  • OrdinaryAdditional91@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    2 years ago

    Just find that this is released by high-flyer quant, one of the largest private equity firm in China.

  • pseudonerv@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    2 years ago

    The chat model is the first that knows how to compare the weight of bricks and feathers.

    The weight of an object is determined by its mass and the gravitational force acting on it. In this case, both objects are being compared under the same gravitational conditions (assuming they’re both on Earth), so we can compare their masses directly to determine which weighs more.

    1kg of bricks has a mass of 1 kilogram. 2kg of feathers has a mass of 2 kilograms.

    Since 2 is greater than 1, the 2kg of feathers weigh more than the 1kg of bricks.

  • Beb_Nan0vor@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    2 years ago

    That coding is pretty damn good based off of limited tests. I’ll have to experiment more.

  • quantomworks@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    2 years ago

    I made it write about itself using LocalAI https://sfxworks.net/posts/deepseek/

    I will post a how-to on using local-ai on my free time if anyone is interested

  • oobabooga4@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    2 years ago

    I’m desensitized at this point. I wonder if this is yet another Pretraining on the Test Set Is All You Need marketing stunt or not, as most new models lately have been.

    • Neologismus@alien.topB
      link
      fedilink
      English
      arrow-up
      1
      ·
      2 years ago

      I threw my reasoning test questions at the web version and it performed worse than most 70B i tried. About the level of Yi.

  • nested_dreams@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    2 years ago

    Brother u/The-Bloke , can we get a quant of the uncensored base model too? ♥╣[-_-]╠♥

  • llama_in_sunglasses@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    2 years ago

    LoneStriker has a 2.4 bpw quant up: https://huggingface.co/LoneStriker/deepseek-llm-67b-chat-2.4bpw-h6-exl2

  • Lance_lake@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    2 years ago

    not that censored on local.

    So… Some censoring?

  • ab2377@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    2 years ago

    depepseek is one of my fav, i use it everyday for code generation. its got an extra option for the chat now at the link you shared, just general chat about anything, pretty good at it

LocalLLaMA@poweruser.forum

localllama@poweruser.forum

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !localllama@poweruser.forum

Community to discuss about Llama, the family of large language models created by Meta AI.

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 4 users / day
  • 1 user / week
  • 4 users / month
  • 4 users / 6 months
  • 1 local subscriber
  • 4 subscribers
  • 1.02K Posts
  • 5.81K Comments
  • Modlog
  • mods:
  • communick@poweruser.forum
  • BE: 0.19.11
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org