iformat.io Logo iformat.io

Scanned Documents Too Large? How to Compress Without Losing Readability

P
Mar 13, 2026
3 min read
You scanned your Aadhaar card, marksheets, and address proof using a phone scanner app. Each page came out as a 3 MB PDF. You need to upload all of them to a government portal that accepts a maximum of 500 KB per document. The numbers don't add up, and you're starting to think you need a different scanner. You don't — you need a different approach to file size.

Why Scans Are So Large

When you scan a document (or photograph it with a scanner app), the result is essentially a photograph of a page. A full-page image at 300 DPI is roughly 2480 x 3508 pixels — that's an 8.7 megapixel image per page. Even with JPG compression, each page ends up at 1-3 MB. A 10-page document becomes 10-30 MB. For comparison, the same text content typed into a Word document would be about 50 KB.

The Fast Fix: Compress the PDF

A PDF compressor reduces the resolution of embedded images while keeping the overall document structure intact. A scanned page at 300 DPI compressed to 150 DPI cuts the file size roughly in half. For most on-screen viewing and upload purposes, 150 DPI is perfectly readable — you only need 300 DPI for printing.
If the portal needs files under 200 KB, you might need more aggressive compression. At 100 DPI, text is still readable but fine details (signatures, small print) start to soften. Test by zooming in on the compressed version — if you can read all the important parts, it's good enough.

Better Approach: Scan Smarter

If you haven't scanned yet, or can rescan, adjust your scanner settings before scanning. Set resolution to 150-200 DPI instead of the default 300. Choose grayscale instead of colour for text documents (colour triples the file size for no benefit on black-and-white documents). Use the JPG format at 85% quality instead of PNG or TIFF.
Phone scanner apps like CamScanner, Adobe Scan, and Microsoft Lens have "file size" or "quality" settings buried in their settings menus. Turning quality from "High" to "Medium" typically reduces output by 60% with no visible difference in text readability.

For Text Documents: OCR Changes Everything

If your scanned document is mostly text (certificates, letters, marksheets), OCR (Optical Character Recognition) can convert the page images into actual text. An image-to-text tool extracts the text content, which you can then paste into a Word document and save as PDF. A 3 MB scanned certificate becomes a 100 KB text-based PDF.
OCR works best on clearly printed text with good contrast. Handwritten content, faded prints, or low-contrast scans may not convert accurately. For government ID documents where you need an exact visual copy (like Aadhaar or PAN), compression is usually better than OCR.

Multi-Page Scans and Merging

Government portals and university applications often require multiple documents (ID proof, address proof, marksheets) combined into a single PDF. Scanning each document separately and merging them into one file is cleaner than scanning everything in one go. Scan each document, compress each individual PDF to the target size, then merge them using a PDF merge tool. This gives you more control over the quality of each section and keeps the final file size predictable.
When merging, check the total file size of the combined document. If the portal has a strict limit (say 2 MB for all documents combined), divide the budget across your pages. Four scanned pages at 150 DPI in JPG-compressed PDF format typically total under 1 MB — well within most upload limits.

Quick Reference

Quick fix: Compress the scanned PDF to reduce embedded image resolution. Need under 500 KB? Compress to 150 DPI — still readable on screen. Need under 200 KB? Compress to 100 DPI and use grayscale. Haven't scanned yet? Scan at 150-200 DPI in grayscale JPG. Text documents? Use OCR to convert image to text — dramatically smaller files. Golden rule: Scan once at high quality, then compress copies for different upload requirements.
Browse All Posts