What is gz file

Last updated: April 1, 2026

Quick Answer: A .gz file is a compressed data file created using gzip compression, containing one or more files reduced in size through lossless compression algorithms.

Key Facts

Understanding .GZ Files

A .gz file is a compressed file created using the gzip compression standard. These files are commonly found in software distribution, backup systems, and web server transmissions. The .gz extension indicates that gzip compression has been applied to reduce the file size while preserving all original data. Unlike lossy compression used for images and videos, gzip is lossless, meaning decompressed files are identical to originals.

File Structure and Format

The .gz format includes a header containing metadata, the compressed data stream, and a footer with checksum information. Each .gz file begins with a magic number (1f 8b in hexadecimal), allowing systems to identify the file type. The format supports optional original filename, modification time, and comment storage. This metadata helps users track compressed file information without decompression.

Creating and Extracting .GZ Files

On Unix/Linux systems, the 'gzip' command creates .gz files: 'gzip filename' compresses the file and replaces the original. Decompression uses 'gunzip filename.gz' or 'gzip -d filename.gz'. Many systems also support the 'zcat' command to view compressed file contents without extraction. On Windows, tools like 7-Zip, WinRAR, and Windows 10+ provide native .gz support through file managers.

.TAR.GZ Archives

The .tar.gz format combines tar archiving (bundling multiple files) with gzip compression, creating a single compressed file containing entire directory structures. This is the standard format for distributing source code and software on Unix/Linux systems. Creating a tar.gz file uses 'tar -czf archive.tar.gz directory/', while extraction uses 'tar -xzf archive.tar.gz'. The .tgz extension is a shorthand for .tar.gz files.

Web and Server Usage

Modern web servers automatically compress HTML, CSS, and JavaScript files with gzip before transmission. Browsers decompress these files instantly, reducing bandwidth by 60-80%. Server logs are commonly compressed to .gz format for long-term storage. This compression is transparent to users, happening automatically during web browsing and downloads.

Related Questions

How much space does a .gz file save?

Compression ratios vary by file type: text files typically compress to 10-30% of original size, while already-compressed formats like images and videos show minimal savings. Typical web files compress by 70-80%.

Can .gz files be corrupted?

Yes, .gz files can be corrupted like any digital file. Gzip includes checksum verification to detect corruption. Partially downloaded or damaged .gz files usually fail during decompression, preventing accidental use of corrupt data.

What is the difference between .gz and .zip?

.gz compresses single files while .zip archives and compresses multiple files into one. .zip is more common on Windows, .gz on Linux. .zip files can be partially extracted; .gz requires full decompression.

Sources

  1. Wikipedia - Gzip CC-BY-SA-4.0
  2. GNU Gzip Manual GFDL