Technical Specifications

The items described below are the maximum technical limits of Copyleaks.

Definition of a Page
Page in Copyleaks API: We define a page in our systems as a document with up to 250 words.
For example, a document with 251 words will need 2 credits for a complete scan.
Supported File Types:
Supported Online Formats: The supported online format are html and txt (UTF8 encoded) files. Submitted by URL.
Supported Non-Textual File Types: The supported local file types are pdf , docx, doc, txt, rtf, xml, pptx, ppt, odt, chm, epub, odp, ppsx, pages, xlsx, xls and csv. You can access this list programmatically, for more info click here.
Supported Image Types (OCR): The supported image files are gif, png, bmp, jpg and jpeg. The files must contain textual content. Upload only. You can access this list programmatically, for more info click here.
Input Limits
Supported Languages: Any language supported by Unicode. More inforamtion here.
Supported OCR Languages: See full list here.
Maximum Document Length: The maximum length allowed is 2000 pages (500K words)
File Size:
Description Maximum File Size (MB)
HTML files (html, htm, ...) 5 MB
Text files (txt - UTF8 encoded) 3 MB
Non-Textual Documents (pdf, doc, docx, ...) 25 MB
Image Types (jpg, png, bmp, ...) 25 MB
Time
Time Format dd/MM/yyyy HH:mm:ss
Time Zone UTC
Default HTTP Request Timeout 110 seconds