{"id":17,"date":"2007-07-18T11:59:43","date_gmt":"2007-07-18T09:59:43","guid":{"rendered":"http:\/\/lisas.de\/~alex\/?p=17"},"modified":"2007-07-18T21:16:48","modified_gmt":"2007-07-18T19:16:48","slug":"beyond-bzip2","status":"publish","type":"post","link":"https:\/\/lisas.de\/~alex\/?p=17","title":{"rendered":"Beyond bzip2"},"content":{"rendered":"<p>Yesterday I wanted to help pushing large chunks of log data (30G) through a ridiculously low-bandwidth channel. I remembered reading about Con Kolivas&#8217; lrzip and wondered whether it would provide better compression than bzip2 in this case. So I ran a little benchmark on a 1G chunk of data and compressed it with <a href=\"http:\/\/www.bzip.org\/\">bzip2<\/a>, <a href=\"http:\/\/ck.kolivas.org\/apps\/lrzip\/\">lrzip<\/a> and <a href=\"http:\/\/p7zip.sourceforge.net\/\">p7zip<\/a>. <\/p>\n<table align=\"center\" border=\"1\">\n<thead>\n<tr>\n<td width=\"20%\"><\/td>\n<td width=\"20%\"  align=\"center\"><em>ratio<\/em><\/td>\n<td width=\"20%\"  align=\"center\"><em>time<\/em><\/td>\n<td width=\"20%\"  align=\"center\"><em>compr.<\/em><\/td>\n<td width=\"20%\"  align=\"center\"><em>cost<\/em><\/td>\n<\/tr>\n<\/thead>\n<tr>\n<td><strong>bzip2<\/strong><\/td>\n<td align=\"center\">7.14<\/td>\n<td align=\"center\">05:45<\/td>\n<td align=\"center\">100.00%<\/td>\n<td align=\"center\">100%<\/td>\n<\/tr>\n<tr>\n<td><strong>lrzip<\/strong><\/td>\n<td align=\"center\">7.26<\/td>\n<td align=\"center\">24:33<\/td>\n<td align=\"center\">101.76%<\/td>\n<td align=\"center\">427%<\/td>\n<\/tr>\n<tr>\n<td><strong>p7zip<\/strong><\/td>\n<td align=\"center\">8.42<\/td>\n<td align=\"center\">28:13<\/td>\n<td align=\"center\">118.01%<\/td>\n<td align=\"center\">490%<\/td>\n<\/tr>\n<\/table>\n<p>OK, so I&#8217;ll stick with bzip2 for now. At least for that kind of data lrzip is not really an option &#8211; ~330% extra effort to get less than 2% improvement just doesn&#8217;t seem worth it. I have to say I am impressed what kind of compression p7zip (or better 7-Zip) can achieve but it is very expensive nevertheless. I like the fact that it seems to utilize all available CPUs automatically, though (in realtime it ran only ~19 minutes on a hyperthreading machine).<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Yesterday I wanted to help pushing large chunks of log data (30G) through a ridiculously low-bandwidth channel. I remembered reading about Con Kolivas&#8217; lrzip and wondered whether it would provide better compression than bzip2 in this case. So I ran a little benchmark on a 1G chunk of data and compressed it with bzip2, lrzip [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[8],"tags":[],"class_list":["post-17","post","type-post","status-publish","format-standard","hentry","category-compression"],"_links":{"self":[{"href":"https:\/\/lisas.de\/~alex\/index.php?rest_route=\/wp\/v2\/posts\/17","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/lisas.de\/~alex\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/lisas.de\/~alex\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/lisas.de\/~alex\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/lisas.de\/~alex\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=17"}],"version-history":[{"count":0,"href":"https:\/\/lisas.de\/~alex\/index.php?rest_route=\/wp\/v2\/posts\/17\/revisions"}],"wp:attachment":[{"href":"https:\/\/lisas.de\/~alex\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=17"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/lisas.de\/~alex\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=17"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/lisas.de\/~alex\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=17"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}