There are cases where you have a compressed GZIP file for which you want to determine the uncompressed data size without having to extract it.
For example, if you work with large text-based documents, you can either display their content directly in the browser or share it as a file upon request depending on the file size.
Luckily for us, the GZIP file format specification includes the following statement:
+=======================+ |...compressed blocks...| (more-->) +=======================+ 0 1 2 3 4 5 6 7 +---+---+---+---+---+---+---+---+ | CRC32 | ISIZE | +---+---+---+---+---+---+---+---+ ISIZE (Input SIZE) This contains the size of the original (uncompressed) input data modulo 2^32.
It means that as long as the uncompressed payload is less than 4GB, the ISIZE value will represent the uncompressed data size.
You can get it in Ruby by combining #seek
, #read
and #unpack1
as followed:
# Open file for reading file = File.open('data.gzip') # Move to the end to obtain only the ISIZE data file.seek(-4, 2) # Read the needed data and decode it to unsigned int size = file.read(4).unpack1('I') # Close the file after reading file.close # Print the result puts "Your uncompressed data size: #{size} bytes"
Cover photo by Daniel Go on Attribution-NonCommercial 2.0 Generic (CC BY-NC 2.0). Image has been cropped to 766x450px.
The post Reading the uncompressed GZIP file size in Ruby without decompression appeared first on Closer to Code.
Data centers are popping up everywhere. With the rapid growth of AI, cloud services, streaming…
Our commitment to building a thriving open source community is stronger than ever. We believe…
The clock was ticking: Node.js 18’s upstream End of Life (EOL) The OpenJS Foundation is…
June 25th, 2025 – Canonical, the company behind Ubuntu, and Pure Storage, the IT pioneer…
Co-authored with Julie Muzina A year ago, during our Madrid Engineering Sprint, we challenged ourselves…
Welcome to the Ubuntu Weekly Newsletter, Issue 897 for the week of June 15 –…