Looking to maybe self host my own instance, I’m still learning about the fediverse. If a different instance that I federate with hosts something illegal are there risks to me? Is anything from other instances hosted on my server like a copy of it? Or would I only end up hosting things my users post? I’m paranoid and sorry if this is a silly question.

  • pe1uca@lemmy.pe1uca.dev
    link
    fedilink
    English
    arrow-up
    7
    ·
    1 year ago

    I’m running it in the smallest VPS of vultr with 25GB of disk.
    This instance only has 3 users, with me being the only active. It says it’s been up for almost a month and I’ve only used 3GB.

    Here are the docker volumes which have the actual data of your instance, and from inside the DB the biggest table is the one called activity which the devs said it’s only sometimes used to validate the data, but could be truncated if needed (there’s a schedule task which only keeps up to 6 months).
    Also the thing to have in mind is to properly configure the logs of whichever installation guide you follow.
    After that I’ve seen other admins say the next biggest is the media uploaded (from bigger instances).

    $ du -h --max-depth=1
    640K    ./pictrs
    3.2G    ./postgres
    3.2G    .
    
    lemmy=# select
      table_name,
      pg_size_pretty(pg_relation_size(quote_ident(table_name))),
      pg_relation_size(quote_ident(table_name))
    from information_schema.tables
    where table_schema = 'public'
    order by 3 desc;
             table_name         | pg_size_pretty | pg_relation_size
    ----------------------------+----------------+------------------
     activity                   | 2187 MB        |       2292867072
     comment                    | 56 MB          |         58212352
     person                     | 48 MB          |         50307072
     comment_like               | 45 MB          |         47161344
     post_like                  | 22 MB          |         22781952
     comment_aggregates         | 14 MB          |         14811136
     post                       | 13 MB          |         13623296
    
    • gabe565@lemmy.cook.gg
      link
      fedilink
      English
      arrow-up
      3
      ·
      1 year ago

      The activity table is also used to deduplicate incoming federation data, so instead of truncating it, I’d suggest deleting rows after a certain amount of time.

      For my personal instance, I set up a cron to delete entries older than 3 days, and my db is only ~500MB with a few weeks of content! I also haven’t seen any duplicated posts or comments. Even with Lemmy’s retries, 3 days seems to be long enough before dropping rows from that table.

    • 𝙚𝙧𝙧𝙚@feddit.win
      link
      fedilink
      English
      arrow-up
      2
      ·
      edit-2
      1 year ago

      How are you keeping your pictrs directory so small?

      Mine is at about 5GB after two weeks with just a single user. 😬