Compressing packages with bzip2 instead gzip?
Phillip Susi
psusi at cfl.rr.com
Thu Jan 19 03:23:48 GMT 2006
John C. McCabe-Dansted wrote:
<snip>
>Take for example tetex...3.3 -> tetex...3.4:
>
> Adding the --rsyncable flag did not help, with lookinside it required 30.8%
>of the file either way. Using --rsyncable without lookinside (using the -Z
>flag to generate the .zsync file), we had to download 43.6% of the file.
>Using -Z without --rsyncable we had to download 87.6% of the file.
>
>
Are you sure that zsync was looking inside the compressed stream? Did
you do this with the .deb or the .tar.gz? It looks like zsync can look
inside a .gz if you tell it to, but I doubt it is smart enough to look
inside the .deb, find the two gzip streams, then open those up, which is
what you want it to do.
>To get downloads in the range 5%~10%, you basically have to use bsdiff.
>
>
>
Right, indivudal binary diffs between two versions will be smaller than
the amount of data rsync or zsync will need to send to change between
those two versions, BUT you must keep a diff between each and every
version, and clients can not skip a version; they must apply each
subsequent patch to the old version they have to get up to date. With
zsync you don't have to store/download diffs between every version. The
clients also must still have the original unmodified deb, which they
usually don't. With zsync, they could approximately rebuild the
original deb from the files installed on their system ( conf files and
whatnot may be different ), then zsync it up to date.
<snip>
>The bsdiff approach has two main advantages over zsync:
>- about four times less bandwidth required.
>- works with compression other than gzip.
>
>
>
It only works with non gzip compressors if you decompress first, then
compute the bsdiff against the uncompressed data.
More information about the ubuntu-devel
mailing list