I have a tiny little instance that’s being absolutely overwhelmed after I connected it to other communities. I’ve run a script to give me something like 40K posts to toss off to the purge API, but somehow my disk usage is expanding while this purge is going on. My disk usage is being caused by all the media, but I’m sure how to nuke media from outside of the instance efficiently. The API calls are kind of slow. I’d rather just issue a direct command to delete the media from existence, but I haven’t been able to find where the delete tokens for posts are stored to just rapid fire issue the command from within my server (and thus not have to stagger my calls to not be rate limited)
Can someone help me? I feel like there’s something pretty simple I’m overlooking here.
EDIT 1: Running some diagnostics, I learned that 10GB of my disk is media and 10GB is the activity table (Thanks @[email protected] for pointing that out to me)
I am still left wondering how to purge the 10GB of worthless media in a way that doesn’t leave everything corrupted. Of course I can just navigate to where it is on disk and just deleted, but this feels like a bad idea. My attempt to just run purge API calls has been stymied by rate limiting. Congrats to lemmy for that, but really sucks for me who needs to delete a lot of files.
Media isn’t federated. The media should just be referenced with a link to the original source.
Normally, the largest use of disk space is the Activity table. It is stored for six months, and only useful for debugging. Below is the Issue, along with SQL commands to check and purge this debugging table. Let us know if this was the issue
https://github.com/LemmyNet/lemmy/issues/3103
Media absolutely gets federated. My pictrs folder is 10GB. Another 10GB is the activity table, so I tip my hat to you for finding that. I still have a very significant amount of worthless data on my disk though