You've been there: a colleague sends a 45MB PDF that needs to be uploaded to a portal with a 10MB limit. Or you're attaching a proposal to an email and Gmail bounces it back. PDF compression is one of the most common file management tasks, yet most people don't understand why PDFs get large or how to reduce their size without making the content unreadable.
Why Do PDFs Get So Large?
PDFs are often much larger than they need to be. The culprits fall into a few categories:
- Embedded images: This is the #1 cause. A 12-megapixel photo from a smartphone (typically 3–6MB as JPEG) embedded uncompressed in a PDF can balloon to 20–30MB. Multi-page documents with several images compound this quickly
- Unoptimized fonts: Embedding a full font file (which can be 500KB–2MB for CJK fonts) when you only use a subset of characters. A Chinese document using 200 unique characters doesn't need the entire 5MB SimSun font embedded
- Duplicate data: Some PDF generators (especially older versions of Word and Acrobat) embed the same image or font data multiple times without deduplication
- Unnecessary metadata: Author info, creation dates, thumbnails, and edit histories can add hundreds of kilobytes
- Vector graphics complexity: SVGs or Illustrator exports with thousands of path points create large PDFs even without raster images
Compression Techniques Explained
Image Compression
For image-heavy PDFs, this is where you'll see the biggest savings. The key settings are:
- Resolution downsampling: Screen viewing only needs 72–150 DPI. Print-quality needs 300 DPI. If the PDF will only be viewed on screen, downsampling from 300 DPI to 150 DPI can cut image sizes by 75%
- JPEG quality: 60–80% quality is virtually indistinguishable from 100% for photographs. Going below 50% introduces visible artifacts
- Format conversion: PNG is lossless but large for photos. Converting PNGs to JPEGs inside the PDF saves significant space for photographic content
Font Optimization
Font subsetting — embedding only the characters actually used — is the most effective technique. A document using the Open Sans font with 300 unique characters needs about 15KB of font data instead of the full 200KB+ font file. Most modern PDF tools do this automatically, but older tools and some scanner software don't.
Content Stream Compression
PDF supports internal compression (FlateDecode, similar to ZIP). Most well-formed PDFs already use this. If your PDF isn't using stream compression, enabling it alone can reduce size by 30–50% with zero quality loss.
Real-World Scenarios
Email Attachments
Gmail and most email providers have a 25MB attachment limit. A 30-page report with embedded screenshots often exceeds this. For email purposes, aim for 72–96 DPI image resolution and 60–70% JPEG quality. The text remains perfectly readable; only very fine details in images degrade slightly.
Web Uploads and Forms
Government portals, job applications, and banking sites often have strict 2–5MB limits. These usually accept lower quality because they're primarily checking document content, not image fidelity. Aggressive compression (150 DPI, 50% quality) is appropriate here.
Archiving
For long-term storage where you need to preserve quality, use lossless compression only (FlateDecode). Remove metadata and duplicate resources, but don't touch image quality. The savings will be modest (10–30%) but your document stays pixel-perfect.
Print Production
If the PDF is going to a commercial printer, don't compress. In fact, you may need to increase resolution and convert images to CMYK. Print production has different requirements entirely — file size is less important than color accuracy and resolution.
What to Look For in a PDF Compression Tool
- Client-side processing: Your documents may contain sensitive information — financial statements, contracts, medical records. Tools that process files in your browser (like Risetop's PDF compressor) keep your data private
- Quality control: The best tools let you choose compression level rather than applying a one-size-fits-all approach
- Batch processing: If you regularly compress PDFs, batch support saves significant time
- No file size limits: Free tools often cap uploads at 5–10MB — which defeats the purpose when your file is exactly the one that needs compressing
Before and After: Typical Compression Results
- Text-only PDF (20 pages): 500KB → 200KB (60% reduction via stream compression)
- Report with charts (30 pages): 15MB → 3MB (80% reduction via image downsampling)
- Scanned document (50 pages): 120MB → 12MB (90% reduction via aggressive image compression)
- Photo portfolio PDF (10 pages): 45MB → 8MB (82% reduction via JPEG conversion + downsampling)
Results vary based on content, but image-heavy documents consistently see the most dramatic savings.
Conclusion
PDF compression isn't magic — it's a combination of well-understood techniques applied to the right parts of your document. The key is knowing your use case: email and web uploads call for aggressive image compression, while archival and print require gentler approaches. Understanding what's making your PDF large (almost always images) lets you target the right solution rather than blindly applying maximum compression to everything.