Celery seems to be implicated in more than it's in more than its fair share of issues. I've had good results with RQ in the past. Maybe it's a simpler and more robust option?
After fixing the issue of media being marked as failed on celery restart and also the missing celery logging, things aren't looking too bad. Maybe this is not a problem unless we hit another major celery issue.