Bzip2

Bzip2

Infobox file format
name = bzip2
extension = .bz2
mime = application/x-bzip
owner = Julian Seward
type code = Bzp2
magic = BZh
genre = Data compression
container for =
contained by =
extended from =
extended to =

Infobox_Software
name = bzip2



caption =
developer = Julian Seward
latest_release_version = 1.0.5
latest_release_date = March 17, 2008
operating_system = Cross-platform
genre = data compression
license = BSD licencecite web|publisher=Julian Seward|url=http://www.bzip.org/|title=bzip2 : Home|accessdate=2008-09-27|quote=Why would I want to use it? [..] Because it's open-source (BSD-style license), and, as far as I know, patent-free.]
website = [http://bzip.org/ bzip.org]

bzip2 is a free and open source lossless data compression algorithm and program developed by Julian Seward. Seward made the first public release of bzip2, version 0.15, in July 1996. The compressor's stability and popularity grew over the next several years, and Seward released version 1.0 in late 2000.

Compression efficiency

bzip2 compresses most files more effectively than more traditional gzip or ZIP but is slowerFact|date=August 2008. In this manner it is fairly similar to other recent-generation compression algorithms. Unlike other formats such as RAR or ZIP (but similar to gzip), bzip2 is only a data compressor, not an archiver. The program itself has no facilities for multiple files, encryption or archive-splitting, but, in the UNIX tradition, relies instead on separate external utilities such as tar and GnuPG for these tasks.

In most cases bzip2 is surpassed by PPM algorithms in terms of absolute compression efficiency. bzip2 gets within ten to fifteen percent of PPM, while being roughly twice as fast at compression and six times faster at decompression. [ [http://www.bzip.org/ bzip2 Web site] ]

bzip2 uses the Burrows-Wheeler transform to convert frequently recurring character sequences into strings of identical letters, and then applies a move-to-front transform and finally Huffman coding. In bzip2 the blocks are generally all the same size in plaintext, which can be selected by a command-line argument between 100 kB–900 kB. Compression blocks are delimited by a 48-bit sequence (magic number) derived from the binary-coded decimal representation of π, 0x314159265359, with the end-of-stream similarly delimited by a value representing sqrt(π), 0x177245385090.

Originally, bzip2's ancestor bzip used arithmetic coding after the blocksort; this was discontinued because of the patent restriction to be replaced by the Huffman coding currently used in bzip2Fact|date=August 2008.

bzip2 is known to be quite slow at compressing, leading users to opt for alternatives such as gzip when time is an issue. This problem is asymmetric, as decompression is relatively fast. Motivated by the large CPU time required for compression, a modified version was created in 2003 that supported multi-threading, giving significant speed improvements on multi-cpu and multi-core computersFact|date=August 2008. As of January 2008 this functionality has not been incorporated into the main project.

Compression stack

Bzip2 uses several layers of compression techniques stacked on top of each other, which occur in the following order during compression and the reverse order during decompression:
# Run-length encoding (RLE): any sequence of 4 to 255 consecutive duplicate symbols is replaced by the first four symbols and a repeat length between 0 and 251. Thus the sequence "AAAAAAABBBBCCCD" is replaced with "AAAA3BBBB


Wikimedia Foundation. 2010.

Игры ⚽ Нужна курсовая?

Look at other dictionaries:

  • Bzip2 — бесплатная свободная утилита командной строки (а также алгоритм) с открытым исходным кодом для сжатия данных. Разработана и впервые опубликована Джулианом Сьюардом (англ. Julian Seward) в июле 1996 (версия 0.15). Стабильность и популярность… …   Википедия

  • bzip2 — Расширение .bz2 MIME application/x bzip Сигнатура BZh Разработан Джулиан Сюард Тип формата Сжатие данных Открытый формат? …   Википедия

  • BZIP2 — Entwickler: Julian Seward Aktuelle Version: 1.0.5 (17. März 2008) Betriebssystem …   Deutsch Wikipedia

  • BZip2 — Entwickler: Julian Seward Aktuelle Version: 1.0.5 (17. März 2008) Betriebssystem …   Deutsch Wikipedia

  • Bzip2 — Entwickler: Julian Seward Aktuelle Version: 1.0.5 (17. März 2008) Betriebssystem …   Deutsch Wikipedia

  • bzip2 — Entwickler Julian Seward Aktuelle Version 1.0.6 (20. September 2010) Betriebssystem …   Deutsch Wikipedia

  • Bzip2 — Saltar a navegación, búsqueda bzip2 Desarrollador Julian Seward Información general Extensión de archivo .bz2, .tar.bz2, .tbz2,.tb2 …   Wikipedia Español

  • Bzip2 — Extension de fichier .bz2, .tar.bz2, .tbz2, .tb2 Type MIME application/x bzip Développé par Julian Seward Type de format Compression de données Spécification …   Wikipédia en Français

  • bzip2 — Extension .bz2, .tar.bz2, .tbz2, .tb2 Type MIME application/x bzip Développé par Julian Seward Type de format Compression de données Spécification Format ouvert …   Wikipédia en Français

  • Bzip2 — se trata de un programa de código abierto desarrollado bajo licencia del estilo BSD que comprime y descomprime ficheros usando los algoritmos de compresión de Burrows Wheeler y de codificación de Huffman. El porcentaje de compresión alcanzado… …   Enciclopedia Universal

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”