Creating Text-Based PDFs

A variety of accessible formats of text options are available to students with disabilities, including audio, large print, and braille. Most students at CC who are approved for this accommodation use text-to-speech software to create an audio version of the text materials. To do this, students need access to documents that are text-based (OCR or Optical Character Recognition enabled) rather than image-based. Individual words and letters can be highlighted on a text-based document. Image-based text appears as a square block when the text is highlighted. Faculty can help to support an inclusive academic experience by providing students with text-based documents that offer immediate access to course materials for all students. Accessible PDFs can be created easily by scanning documents on one of the college’s HP brand copier/scanners. Instructions are provided below.

What is a high-quality scanned PDF? 

  • The text is not cut off.
  • The pages are straight, not crooked or curved.
  • No shadows are covering the print.
  • Handwriting, highlighting, annotations, circles, or underlining is limited and do not cover any text.
  • There is good color contrast throughout the page.
  • The scan is clear with no blurry text.
  • The text is highlightable indicating that it is text based.

Tips & Tricks

  • When scanning a book with a spine, push the spine of the book down while scanning to ensure the text will copy clearly.
  • Consider using an overhead scanner for higher quality files. An overhead scanner is available in Tutt Library. 
  • Scan one page at a time, and do not combine pages. This improves reading on mobile platforms and speeds up the conversion process for optical character recognition (OCR) software.
  • Avoid scanned documents that need to be rotated when viewed on a screen.
  • Document names should correspond to how documents are named on the syllabus and in Canvas.
  • Avoid rescanning old documents, as the quality will deteriorate with each scan. Use the original source when possible. 

Instructions for Scanning

HP brand machines on campus have the option to scan to OCR. To do so, follow either of these two methods: 

  • “Quick Scan” with Papercut Hive​ 
    • Enter your Papercut Hive access code​ 
    • Click “quick scan”​ 
    • Automatically scans as an OCR document 

 OR

  • Scanning to email​ 
    • Select “Scan” > “Scan to Email” 
    •  Enable OCR under “Options” > “File Type & Resolution.” Under “File Type” select “Searchable PDF (OCR)” 

Example of a high-quality scan (Click to view full OCR document): This example shows a high quality scan from a page in a book that is clear and easy to read

Additional information about text-based documents and text-to-speech can be found on the assistive technology page. 

Table shows images with examples of low quality scans and an explanation for why each image would not be easily accessible for students using text to speech.

There are two images showing examples of a scan that is too faded and the contrast is too light

The text is faded and the contrast is very light in both examples.

 

The image shows an example of a scanned page from a book with a black margin that blocks out any text making it impossible to read

This scan has a darkened margin and shadows the text making it illegible. 

The image shows a scan of a book with two pages, but one page is cut off.

Part of a cut-off page has been included in the scan which can slow down the conversion process.

This image shows a scanned book with a crooked page, making it difficult to use text-to-speech software

The page is crooked.

This image shows a scanned page from a book with lots of handwritten underlines and notes making it hard to read the text

There is too much underlining and it goes over some of the text.

This image shows a scanned book with 2 pages side-by-side and rotated 90%

The scan has heavy annotations, is 2 side-by-side pages, and is rotated.

 

show all / hide all

Report an issue - Last updated: 03/13/2025