Comment

Torsten Will

Yes, exactly my thought. slimmer.py wont gain much if compression is applied anyway.

But what about translation to uppercase and removing all vowels... ;-)

Parent comment

Fredrik

Since the slimmer filter doesn't really increase the document entropy, you still get roughly the same percentages for zip(slimmer(text)) vs. zip(text) as you get for slimmer(text) vs. text. (but all zip() variants are of course a lot smaller than the others, so you save a lot less bytes. in this case, it's just over 4k, if my calculations are correct).