|
 |
 |
 |
 |
 |
 |
The Art
of Lossless
Data Compression
vol. 24t
Here are the results of tests performed in August 2002 to compare
lossless compression of "plain" texts by all known good enough programs
developed for such purpose, including RK, DC, YBS, Bzip2, RAR, ACE and 7-zip.
See Archive Comparison Test by J.Gilchrist for more details:
http://compression.ca
If anybody wants to start or continue such tests,
or can suggest some other sets of texts, or other compression programs,
(not sources or algorithm descriptions, executable programs only)
or knows we have missed something important,
(some new fantastic technology, an algorithm or even a program capable
of lossless compression of up to 1000:1 etc.)
please let us know immediately: artest@inbox.ru Thank you!
[[1]] COMPRESSION QUALITY
=========================
(see also
[[2]] Speed
[[3]] Details
[[4]] Comments)
Last seventh line shows results for the sum of all 1231 texts in six sets.
Original Entropy PPMonstr PPMN Slim Compressia PPMd RK PPMy BEE
553.31% 100% 101.76 105.51 104.60 105.43 107.64 105.64 108.51 110.23
543.05% 100% 108.50 110.03 111.28 110.13 113.63 112.26 115.04 113.97
435.56% 100% 107.96 107.05 109.84 108.56 111.94 110.78 110.84 112.84
492.76% 100% 110.61 110.13 113.47 112.24 115.48 114.58 115.34 116.03
799.24% 101.80 100% 115.93 115.32 117.76 109.06 110.51 126.27 114.11
353.09% 100% 102.50 101.06 104.12 108.60 105.61 109.98 105.80 106.40
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
468.44% 100% 107.25 107.48 110.27 110.72 111.76 112.23 112.56 112.71
RAR DC SBC BOA YBS UHARC ERI 7-zip BZip2 Pkzip
109.41 108.66 108.98 110.03 108.95 107.33 109.53 116.80 124.06 159.23
114.56 114.28 115.55 117.47 117.38 115.17 119.44 120.55 130.92 178.03
112.57 111.19 112.57 113.19 113.95 112.43 115.21 115.85 124.48 163.20
116.21 115.77 116.82 117.62 118.62 117.78 120.29 121.73 131.87 174.67
121.49 121.44 118.06 122.31 120.56 122.76 125.99 138.39 149.67 205.34
106.15 111.66 110.30 107.42 109.44 113.54 111.44 106.72 116.85 148.43
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
113.08 114.02 114.27 114.34 115.32 115.72 117.20 117.69 127.59 167.57
[[2]] Speed
===========
Canterbury Corpus Large Set http://corpus.canterbury.ac.nz/resources/large.zip
was used for this test, and a 970MHz PC with 256Mb RAM and Windows98.
Programs, Compression/ Overall Average Users' Compressed
options Extraction, Score Score Size
seconds seconds, % seconds, % bytes , %
no compression 0 0 4446 559 4446 577 16005619 600
777 a -mg 140 84 1209 152 1070 139 3544038 133
777 a -mg -mu32 157 102 1234 155 1079 140 3513162 132
7za a -t7z 82 2 1110 140 1028 134 3692764 138
7za a -t7z -mx 127 2 1089 137 963 125 3454527 129
7za a -tzip 24 1 1245 157 1222 159 4393637 165
7za a -tzip -mx 45 1 1269 160 1224 159 4401174 165
acb B 395 396 1722 217 1331 173 3352388 126
acb b 514 516 1939 244 1430 186 3272388 123
acb u 710 711 2317 291 1614 210 3225662 121
ace32 a -d4096 66 2 1124 141 1058 137 3801917 142
ace32 a -d4096 -m1 31 2 1134 143 1104 143 3965841 149
ace32 a -d4096 -m5 206 2 1249 157 1045 136 3746553 140
arh a 38 40 1091 137 1053 137 3647067 137
arh a -2 -1 68 40 1121 141 1054 137 3647067 137
ba -k -50 35 12 964 121 929 121 3298943 124
bee a -m2 123 125 1134 143 1012 131 3188565 120
bee a -m3 162 164 1203 151 1043 135 3156801 118
bee a -m3 -s 204 206 1294 163 1092 142 3185012 119
bix a -mdg -s 92 1 1069 134 978 127 3514944 132
boa -m1 86 88 1253 158 1168 152 3886863 146
boa -m15 139 141 1165 146 1027 133 3182739 119
boa -m15 -s 138 140 1148 144 1011 131 3132810 117
bzip2 -k 21 6 1032 130 1011 131 3616113 136
bzip2 -k -9 20 6 1031 130 1011 131 3616113 136
ery-e.bat t o12 94 95 1003 126 910 118 2932445 110
ery-e.bat t o16 98 99 1001 126 904 117 2892711 108
ery-e.bat t o32 105 106 1009 127 905 118 2873677 108
ery-e.bat t o64 112 111 1022 128 911 118 2873318 108
compcl c -b15 37 20 904 114 868 113 3049569 114
compcl c -b15 -s 38 29 808 102 770 100 2668128 100
dc e 13 7 903 114 890 116 3179173 119
dc e -b16300 -mt5 17 7 795 100 778 101 2773427 104
eri a 39 17 936 118 897 116 3168414 119
eri a -m3 59 21 996 125 937 122 3295385 124
eri a -m6 59 21 989 124 931 121 3272926 123
gcac a 26 12 980 123 954 124 3390603 127
gcac s 26 12 981 123 955 124 3395064 127
imp98 a -mm 31 1 1175 148 1143 148 4112387 154
imp98 a -mm -2 13 5 999 126 986 128 3533761 132
imp98 a -2 -s4 13 5 999 126 986 128 3533693 132
pkzip -es 1 1 1654 208 1652 215 5945622 223
pkzip -a 4 1 1308 164 1304 169 4691491 176
pkzip -exx 16 1 1296 163 1280 166 4605942 173
ppmdi e -o7 -m232 11 12 904 114 893 116 3169000 119
ppmdi e -o12 -m232 25 26 915 115 891 116 3113630 117
ppmdi e -o16 -m232 27 28 916 115 890 116 3100943 116
ppmn_km e -o6 -MT1 30 30 931 117 901 117 3132278 117
ppmn_km e -o8 -MT1 64 65 993 125 929 121 3107654 116
ppmn_km e -o9 62 63 990 125 929 121 3115560 117
ppmn_km e -o9 -M:50 49 50 949 119 900 117 3058436 115
ppmonstr e -o7 -m232 64 67 974 123 911 118 3035498 114
ppmonstr e -o8 -m232 71 74 980 123 910 118 3007964 113
ppmonstr e -o64 -m232 101 103 1020 128 920 119 2937387 110
ppmy /o6 /m220 240 248 1430 180 1193 155 3391576 127
ppmy /o7 /m220 273 281 1456 183 1186 154 3251362 122
ppmy /o9 /m220 323 333 1514 190 1194 155 3088212 116
qlfc a 22 11 973 122 952 124 3385084 127
rar a -m4 20 11 914 115 895 116 3178768 119
rar a -m5 25 16 920 116 896 116 3164821 119
rar a -m5 -s 25 17 927 117 902 117 3185542 119
rar a -mc16:128t -s 41 31 927 117 886 115 3077033 115
rk -mf2 50 20 1108 139 1058 137 3735704 140
rk -mx1 144 143 1147 144 1004 130 3093640 116
rk -mx2 173 173 1203 151 1032 134 3086312 116
sbc c -b63 29 9 914 115 885 115 3151930 118
sbc c -os -b63 29 9 810 102 782 101 2779632 104
slim a -d32 -w22 524 522 1862 234 1343 174 2935640 110
slim a -d16 -w22 507 512 1835 231 1333 173 2935647 110
slim a -d8 -w22 477 485 1778 224 1306 170 2935701 110
slim a -d4 -w22 483 414 1713 215 1235 160 2936380 110
szip -o4 4 10 1027 129 1023 133 3647445 137
szip -o6 17 14 996 125 979 127 3475264 130
szip -o8 -b41 27 17 973 122 947 123 3348344 125
ufa a -mg 93 104 1138 143 1046 136 3387626 127
ufa a -mg -mu32 118 129 1176 148 1059 137 3344003 125
ufa a -mg -s 247 11 1339 168 1094 142 3890454 146
uharc a -m3 -md32768 112 5 983 124 872 113 3115786 117
uharc a -mz -md32768 8 10 1093 137 1084 141 3868781 145
uharc a -mx -md32768 51 47 934 117 883 115 3009015 113
ybs -m16mu 34 9 837 105 803 104 2857446 107
ybs -m16mu -r 34 9 843 106 809 105 2878433 108
zzip a 21 11 977 123 956 124 3400243 127
zzip a -mx 22 12 973 122 952 124 3383060 127
zzip a -mx -30m 30 12 940 118 910 118 3233147 121
Overall score is calculated by adding compression time, extraction time, and
time it would take to transfer the compressed file over a 28,800bps network:
(compressed_size)/3600
Average Users' score is calculated by adding (compress_time/10)+ extract_time +
time it would take to transfer the compressed file over a 28,800bps network.
Compression time is divided by 10 here, because more than 90% of people would
never compress anything during their life (with compression programs), but they
use compressed data almost _every_ time they use computers and/or Internet.
That's why compression time is not so actual for them.
[[3]] Details
=============
are no longer put to this main text
(1513 lines reporting 73042 results on 1231 files in 6 sets),
but can be found in FULL version with TEXTS.DAT and *.BAT
at http://geocities.com/SiliconValley/Bay/1995/artest24.zip
or http://artest1.tripod.com/artest24.zip
[[4]] Comments
==============
Links to download programs:
~~~~~~~~~~~~~~~~~~~~~~~~~~~
ACE 2.04 :W http://winace.host.sk/ace204.exe 593K
ERI 5.1fre :e http://geocities.com/eri32/eri51fre.zip 89K
GCac 0.9k :W http://www.emit.jp/gca/gca_v09k.exe 295K
UHArc 0.4b :eW ftp://ftp.elf.stuba.sk/pub/pc/pack/uharc04.zip 220K
QLFC 6.6W :W http://ghido.shelter.ro/DownloadQLFC.php 49K
ZZip 0.36c :W http://www.via.ecp.fr/~damien/downloads/zzip-win32.zip 35K
PPMD var.I,
PPmonstr v.I :W ftp://ftp.elf.stuba.sk/pub/pc/pack/ppmdi1.rar 73K
PkzipC 4.00 :W ftp://ftp.pkware.com/pkzc400s.exe 3470K
PkZip 2.50 :a ftp://ftp.simtel.net/pub/simtelnet/msdos/arcers/pk250dos.exe 202K
RK-dos 1.04.1 :e http://rksoft.virtualave.net/downloads/rk104a1d.exe 461K
RK 1.04.1 :W http://rksoft.virtualave.net/downloads/rk104a1w.exe 380K
BA 1.01b5 :e http://hem.spray.se/mikael.lundqvist/ba101br5.zip 61K
BIX 1.00b7 :W http://www.7-zip.org/dl/ufa/bix100b7.zip 89K
777 0.04b1 :W http://www.7-zip.org/dl/ufa/777004b1.zip 72K
UFA 0.04b1 :W http://www.7-zip.org/dl/ufa/ufa004b1.zip 64K
ArHanGeL 1.40 :a http://geocities.com/SiliconValley/Lab/6606/arh140.zip 50K
Imp 1.1 :e http://www.technelysium.com.au/imp110d.zip 266K
Imp-win 1.12 :W http://www.technelysium.com.au/imp112.exe 122K
ACB 2.00c :e ftp://ftp.simtel.net/pub/simtelnet/msdos/compress/acb_200c.zip 42K
BOA 0.58b :e ftp://ftp.elf.stuba.sk/pub/pc/pack/boa058.zip 74K
DC 0.98b :W ftp://ftp.elf.stuba.sk/pub/pc/pack/dc124.zip 55K
SZip 1.12a :W http://www.compressconsult.com/szip/szip_112a_win32.zip 71K
YBS 0.03e :e http://compression.graphicon.ru/ybs/ybs003ed.zip 56K
YBS 0.03e :W http://compression.graphicon.ru/ybs/ybs003ew.zip 44K
Slim b10 :W http://slim-fb.by.ru/slim0010.zip 22K
Bzip2 1.0.2 :W ftp://sources.redhat.com/pub/bzip2/v102/bzip2-102-x86-win32.exe
Compressia 0.98b :W http://www.compressia.com/compcl.exe
SBC_d 0.969br1 :e http://personal.inet.fi/musiikki/sjm/sbc0969b_dos.zip
SBC 0.969br1 :W http://personal.inet.fi/musiikki/sjm/sbc0969b_win32.zip
BEE 0.6.3 :W http://compression.graphicon.ru/fa/files2/bee063.rar
PPMN_km b4 :W http://compression.graphicon.ru/ms/ppmn_km.rar
PPMY 0.3c :W http://compression.graphicon.ru/sh/ppmy_3c.rar
7-Zip 2.30b23 :W http://belnet.dl.sourceforge.net/sevenzip/7z230b23.exe
WinRAR 3.00 :W http://www.rarlab.com/rar/wrar300.exe
RAR32 3.00 :e http://www.rarlab.com/rar/rarx300.exe
:a - any DOS - DOS programs, will run under pure DOS or in a DOS box
:e - extender - DOS programs using DOS extenders like DOS/4GW or CWSDPMI
:W - windows - Windows95/98/NT/etc programs
If direct link doesn't work-most probably newer version of the program appeared
at the same site: visit web page, or read the whole directory from ftp server
(i.e. try the same URL, but without filename).
Homepages:
~~~~~~~~~~
Arhangel : http://geocities.com/SiliconValley/Lab/6606
BA : http://hem.spray.se/mikael.lundqvist
Eri32 : http://geocities.com/eri32
mirror : http://artest1.tripod.com
RK : http://rksoft.virtualave.net
Imp,WinImp : http://www.technelysium.com.au/winimp.html
ACE,WinACE : http://www.winace.com
PkZip : http://www.pkware.com
RAR,WinRAR : http://www.rarlab.com
BZip2 : http://sources.redhat.com/bzip2
SZip : http://www.compressconsult.com/szip
PPMN : http://compression.graphicon.ru/ms
YBS : http://compression.graphicon.ru/ybs
SBC : http://sbcarchiver.netfirms.com
QLFC : http://ghido.shelter.ro
GCAC : http://emit.jp/gca/gca.html
PPMy : http://www.pilabs.org.ua/sh
ZZip : ? http://www.zzip.f2s.com
Compressia : http://www.compressia.com/
Slim : http://slim-fb.by.ru/
Ufa,777,
BIX,7-Zip : http://www.7-zip.org
What's new:
~~~~~~~~~~~
12 new programs were tested:
PPMd v.I
PPMonstr v.I
7-zip 2.30b22
RAR 3.00
BZip2 1.0.2
PPMN_km b4
SBC 0.969b r1
Entropy 0.7
Compressia 0.98b
Bee 0.6.3 u1
PPMy 0.3c
Slim aug17
7-zip, RAR and BZip2 perform exactly as their previous versions
tested in March or earlier.
Latest beta versions of Entropy, Slim, DC were available
from authors by e-mail request:
Entropy: artest@inbox.ru
Slim: grayw@secom.lg.ua
DC: EdgarBinder@t-online.de
Results of BA, ZZip, ACB, 777, GCAC, SZip, QLFC, ArHanGeL, IMP, BICOM,
BIX, ACE are in full version only, TEXTS.DAT file.
The set of Russian texts is at http://arte.nm.ru/m120
Some old or less efficient programs will not be included next time:
777, UFA, BIX, ACB, BA, PPMy.
WARNINGS:
~~~~~~~~~
BA 1.00beta5 can't correctly decompress shaks12.txt and set used for speed
measurements.
DC 0.99.158b failed to decompress 1DFRE10.dc , ANDES10.dc , and BTI0110.dc,
saying "Corrupted block" (while t(est) command writes "Test successful").
ERI 5.1fre can't compress files larger than (free DPMI memory)/6, i.e.
about 10Mb on a PC with 64Mb RAM. The largest 44Mb file was split to 5 chunks
9000000 bytes long (last chunk was 8894190 bytes).
Problems in all other compressors were not found.
The LATEST RELEASE, and all previous versions of these tests can be found
at http://geocities.com/SiliconValley/Bay/1995/ and http://artest1.tripod.com/
Send your suggestions, comments to artest@inbox.ru
With best kind regards,
A.Ratushnyak
Back to main ARTest page
|
|
Последнее обновление:
20-October-2002
|
Сайт о сжатии >>
Новинки |
О сервере |
Статистика
Книга "Методы сжатия данных" >>
Универсальные |
Изображений |
Видео
Разделы >>
Download (статьи+исходники) |
Ссылки |
Ru.compress |
Arctest |
Видео |
Каталог ссылок |
Форум
Проекты >>
Д.Ватолина
|
А.Ратушняка |
М.Смирнова |
В.Юкина |
Е.Шелвина |
А.Филинского |
Д.Шкарина |
С.Оснача
|
 |
 |
 |
 |
 |
 |
|