Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • Home
  • SEARCH
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 6980799
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 27, 20262026-05-27T18:04:24+00:00 2026-05-27T18:04:24+00:00

Tools exist to provide random access to gzip and bzip2 archives: gzip zran.c from

  • 0

Tools exist to provide random access to gzip and bzip2 archives:

  • gzip zran.c from the ghostscript source code
  • bzip2 seek-bzip by James Taylor

I’m looking for any similar solution for 7zip

(The goal is to utilize the sometimes gigantic Wikipedia dump files offline without keeping decompressed copies around)

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-27T18:04:25+00:00Added an answer on May 27, 2026 at 6:04 pm

    I tought it’s better to summarize GZIP, BZIP2 and LZMA internals to make clear something:

    1. GZIP is actually a format which uses Deflate algorithm. Due to static huffman codes (deflate documents also mentions about dynamic huffman, but actually they are static too) deflate should be encoded as block-wise (sliding window is another term in here). zran.c seems to find those blocks boundaries and tries to decode up to 2 consecutive blocks at most which could contain a few KiB uncompressed data for collecting enough data to decompress (to fill entire 32 KiB window). So, random access is quite possible even without index table.

    2. BZIP2 is actually a BWT class compression algorithm. And due to BWT nature, it’s no wonder that it’s block-wise. It’s blocks limited up to 900 KiB for each individual blocks. Also, blocks boundaries are well defined to easy recovering process (has huge distinct markers). So, you can even use multiple threads at once to decompress all data. In another words, random access is quite possible even without any table (it’s already supported by format).

    3. LZMA supports up to 1 GiB dictionary and it’s not block-wise encoded. It uses a range coder to encode probabilities rather than huffman coder. Even considering 64 MiB window size (very common value), due to range coder nature we can’t simply decode at a given random point until fill-up entire window. Also, LZMA’s state machine can be bothersome too. So, it’s implementation is quite hard or even impossible.

    Maybe LZMA2 or PPM methods can be used for such usages (7-zip supports them as well within 7-zip format). PPM flushes it’s model when it’s statistics are full and LZMA2 intentionally flushes some state at some interval to enable multi-threaded decompression. Their random access implementation can be possible.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

There exist tools for comparing code against a custom specified set of coding guidelines/standards
Are there tools that make the job easier? If command-line only tools exist, then
Why don't databases automatically index tables based on query frequency? Do any tools exist
What kind of tools or techniques exist for this purpose?
What kind of tools and techniques exist for this purpose?
I have a website made to provide free web-based tools for making indie games.
Background There are many tools to analyze logfiles from servers like Apache HTTP server,
I imported the Software PAD File Database from http://paddatabase.net/download.html into Microsoft Access in a
Does a tool exist for dynamically altering running javascript in a browser? For example,
Does an auto-formatting tool exist for vi that'll allow me to define per language

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.