Communick News
  • Communities
  • Create Post
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
Little-Name9809@alien.topB to LocalLLaMA@poweruser.forumEnglish · 2 years ago

Safety checks in Llama 2

message-square
message-square
4
link
fedilink
1
message-square

Safety checks in Llama 2

Little-Name9809@alien.topB to LocalLLaMA@poweruser.forumEnglish · 2 years ago
message-square
4
link
fedilink

Recently came across this AI Safety test report from LinkedIn: https://airtable.com/app8zluNDCNogk4Ld/shrYRW3r0gL4DgMuW/tblpLubmd8cFsbmp5

From this report it seems Llama 2 (7B version?) lacks some safety checks compared to OpenAI models. Same with Mistral. Did anyone find the same result? Has it been a concern for you?

  • CookieCat171@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    2 years ago

    afety checks in Llama 2

    it seems it’s comparing chat models: https://airtable.com/app8zluNDCNogk4Ld/shrYRW3r0gL4DgMuW/tblpLubmd8cFsbmp5

    • phree_radical@alien.topB
      link
      fedilink
      English
      arrow-up
      1
      ·
      2 years ago

      Looks like you’ve now made some changes. Columns now read “Llama2-7b-chat” instead of “llama2.” Also, chat responses below the completions, chastising the inappropriate messages. However, a completion was generated, first, and the item is still marked as “fail.” Very poor show

LocalLLaMA@poweruser.forum

localllama@poweruser.forum

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !localllama@poweruser.forum

Community to discuss about Llama, the family of large language models created by Meta AI.

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 4 users / day
  • 1 user / week
  • 4 users / month
  • 4 users / 6 months
  • 1 local subscriber
  • 4 subscribers
  • 1.02K Posts
  • 5.81K Comments
  • Modlog
  • mods:
  • communick@poweruser.forum
  • BE: 0.19.11
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org