a thread in the fabric

24 March 2011

Benchmark pigz vs gzip

What is pigz ?

“pigz, which stands for parallel implementation of gzip, is a fully functional replacement for gzip that exploits multiple processors and multiple cores to the hilt when compressing data.” – pigz

Who doesn’t love benchmarks ?

File Data: Tarred Enron Corpus
File Size: 1823139840 bytes ~ 1.7 gigabytes

System Specs:
Processor Name: Intel(R) Xeon(R) CPU E5507
Processor Speed: 2.7 GHz
Number Of Processors: 1
Total Number Of Cores: 4
L2 Cache (per core): 256 KB
L3 Cache: 4 MB

pigz results

compress

time pigz -9 enron_mail_082109.tar
real 0m44.136s
user 2m5.400s
sys 0m1.570s

decompress

time pigz -d -9 enron_mail_082109.tar.gz
real 0m14.354s
user 0m12.270s
sys 0m3.260s

gzip results

compress

time gzip -9 enron_mail_082109.tar
real 2m27.748s
user 1m53.860s
sys 0m1.380s

decompress

time gunzip -9 enron_mail_082109.tar.gz
real 0m29.564s
user 0m18.270s
sys 0m2.250s

Here’s another benchmark