How to Zip and Unzip Files in C#: A Complete Guide

File compression is an essential skill for any C# developer. Whether you're creating backups, reducing storage space, or preparing files for transmission, knowing how to zip and unzip files programmatically can streamline your applications.

This guide walks you through the process using C#'s built-in System.IO.Compression namespace.

Prerequisites

Before getting started, ensure you have:

  • Visual Studio or your preferred C# IDE
  • .NET Framework 4.5 or later
  • Basic understanding of C# file operations

Creating Zip Files in C#

The System.IO.Compression namespace provides the ZipFile and ZipArchive classes for handling zip operations. Here's how to create a zip file:

using System.IO.Compression;

// Create a zip file from a directory
ZipFile.CreateFromDirectory(@"C:\SourceFolder", @"C:\output.zip");

// Create a zip file with custom settings
using (var zipArchive = ZipFile.Open(@"C:\custom.zip", ZipArchiveMode.Create))
{
    zipArchive.CreateEntryFromFile(@"C:\file1.txt", "file1.txt");
    zipArchive.CreateEntryFromFile(@"C:\file2.pdf", "file2.pdf");
}

Extracting Zip Files

Unzipping files is just as straightforward:

// Extract all files to a directory
ZipFile.ExtractToDirectory(@"C:\archive.zip", @"C:\ExtractedFolder");

// Extract specific files
using (var archive = ZipFile.OpenRead(@"C:\archive.zip"))
{
    foreach (var entry in archive.Entries)
    {
        if (entry.Name.EndsWith(".txt"))
        {
            entry.ExtractToFile(Path.Combine(@"C:\ExtractedFolder", entry.Name));
        }
    }
}

Best Practices and Tips

  1. Always use 'using' statements when working with ZipArchive objects to ensure proper resource disposal.
  2. Handle exceptions appropriately, as file operations can fail due to permissions or file access issues.
  3. Check available disk space before extracting large zip files.
  4. Consider using compression levels for optimal file size versus speed trade-offs.

Advanced Features

The System.IO.Compression namespace offers additional features:

// Set compression level
using (var archive = ZipFile.Open(@"C:\compressed.zip", ZipArchiveMode.Create))
{
    archive.CreateEntryFromFile(@"C:\largefile.dat", "largefile.dat", CompressionLevel.Optimal);
}

// Update existing zip files
using (var archive = ZipFile.Open(@"C:\existing.zip", ZipArchiveMode.Update))
{
    archive.CreateEntryFromFile(@"C:\newfile.txt", "newfile.txt");
}

Common Issues and Solutions

  • File Access Errors: Ensure files aren't in use by other processes before zipping/unzipping.
  • Path Too Long: Use shorter file paths or enable long path support in Windows.
  • Out of Memory: Process large files in chunks rather than loading entirely into memory.

Conclusion

Mastering zip operations in C# enables you to create more efficient applications that handle file compression seamlessly. The System.IO.Compression namespace provides all the tools needed for basic to advanced zip operations, making it easy to implement file compression in your C# projects.

Remember to always test your zip operations thoroughly and implement proper error handling to ensure robust file compression functionality in your applications.

5
699

Related

Closing a SqlDataReader correctly prevents memory leaks, connection issues, and unclosed resources. Here’s the best way to do it.

Use 'using' to Auto-Close

Using using statements ensures SqlDataReader and SqlConnection are closed even if an exception occurs.

Example

using (SqlConnection conn = new SqlConnection(connectionString))
{
    conn.Open();
    using (SqlCommand cmd = new SqlCommand("SELECT * FROM Users", conn))
    using (SqlDataReader reader = cmd.ExecuteReader())
    {
        while (reader.Read())
        {
            Console.WriteLine(reader["Username"]);
        }
    } // ✅ Auto-closes reader here
} // ✅ Auto-closes connection here

This approach auto-closes resources when done and it is cleaner and less error-prone than manual closing.

⚡ Alternative: Manually Close in finally Block

If you need explicit control, you can manually close it inside a finally block.

SqlDataReader? reader = null;
try
{
    using SqlConnection conn = new SqlConnection(connectionString);
    conn.Open();
    using SqlCommand cmd = new SqlCommand("SELECT * FROM Users", conn);
    reader = cmd.ExecuteReader();

    while (reader.Read())
    {
        Console.WriteLine(reader["Username"]);
    }
}
finally
{
    reader?.Close();  // ✅ Closes reader if it was opened
}

This is slightly more error prone if you forget to add a finally block. But might make sense when you need to handle the reader separately from the command or connection.

1
168

Raw string literals in C# provide a flexible way to work with multiline strings, with some interesting rules around how quotes work.

The key insight is that you can use any number of double quotes (three or more) to delimit your string, as long as the opening and closing sequences have the same number of quotes.

The Basic Rules

  1. You must use at least three double quotes (""") to start and end a raw string literal
  2. The opening and closing quotes must have the same count
  3. The closing quotes must be on their own line for proper indentation
  4. If your string content contains a sequence of double quotes, you need to use more quotes in your delimiter than the longest sequence in your content

Examples with Different Quote Counts

// Three quotes - most common usage
string basic = """
    This is a basic
    multiline string
    """;

// Four quotes - when your content has three quotes
string withThreeQuotes = """"
    Here's some text with """quoted""" content
    """";

// Five quotes - when your content has four quotes
string withFourQuotes = """""
    Here's text with """"nested"""" quotes
    """"";

// Six quotes - for even more complex scenarios
string withFiveQuotes = """"""
    Look at these """""nested""""" quotes!
    """""";

The N+1 Rule

The general rule is that if your string content contains N consecutive double quotes, you need to wrap the entire string with at least N+1 quotes. This ensures the compiler can properly distinguish between your content and the string's delimiters.

// Example demonstrating the N+1 rule
string example1 = """
    No quotes inside
    """; // 3 quotes is fine

string example2 = """"
    Contains """three quotes"""
    """"; // Needs 4 quotes (3+1)

string example3 = """""
    Has """"four quotes""""
    """""; // Needs 5 quotes (4+1)

Practical Tips

  • Start with three quotes (""") as your default
  • Only increase the quote count when you actually need to embed quote sequences in your content
  • The closing quotes must be on their own line and should line up with the indentation you want
  • Any whitespace to the left of the closing quotes defines the baseline indentation
// Indentation example
string properlyIndented = """
    {
        "property": "value",
        "nested": {
            "deeper": "content"
        }
    }
    """; // This line's position determines the indentation

This flexibility with quote counts makes raw string literals extremely versatile, especially when dealing with content that itself contains quotes, like JSON, XML, or other structured text formats.

2
75

Removing duplicates from a list in C# is a common task, especially when working with large datasets. C# provides multiple ways to achieve this efficiently, leveraging built-in collections and LINQ.

Using HashSet (Fastest for Unique Elements)

A HashSet<T> automatically removes duplicates since it only stores unique values. This is one of the fastest methods:

List<int> numbers = new List<int> { 1, 2, 2, 3, 4, 4, 5 };
numbers = new HashSet<int>(numbers).ToList();
Console.WriteLine(string.Join(", ", numbers)); // Output: 1, 2, 3, 4, 5

Using LINQ Distinct (Concise and Readable)

LINQ’s Distinct() method provides an elegant way to remove duplicates:

List<int> numbers = new List<int> { 1, 2, 2, 3, 4, 4, 5 };
numbers = numbers.Distinct().ToList();
Console.WriteLine(string.Join(", ", numbers)); // Output: 1, 2, 3, 4, 5

Removing Duplicates by Custom Property (For Complex Objects)

When working with objects, DistinctBy() from .NET 6+ simplifies duplicate removal based on a property:

using System.Linq;
using System.Collections.Generic;

class Person
{
    public string Name { get; set; }
    public int Age { get; set; }
}

List<Person> people = new List<Person>
{
    new Person { Name = "Alice", Age = 30 },
    new Person { Name = "Bob", Age = 25 },
    new Person { Name = "Alice", Age = 30 }
};

people = people.DistinctBy(p => p.Name).ToList();
Console.WriteLine(string.Join(", ", people.Select(p => p.Name))); // Output: Alice, Bob

For earlier .NET versions, use GroupBy():

people = people.GroupBy(p => p.Name).Select(g => g.First()).ToList();

Performance Considerations

  • HashSet<T> is the fastest but only works for simple types.
  • Distinct() is easy to use but slower than HashSet<T> for large lists.
  • DistinctBy() (or GroupBy()) is useful for complex objects but may have performance trade-offs.

Conclusion

Choosing the best approach depends on the data type and use case. HashSet<T> is ideal for primitive types, Distinct() is simple and readable, and DistinctBy() (or GroupBy()) is effective for objects.

1
338