What is Papercuts?
Papercuts /parse/
API allows you to extract structured data from a wide range of unstructured file formats using a unified endpoint.
It supports OCR, audio transcription, and LLM-based parsing to convert files like PDFs, images, or audio recordings into structured JSON based on a template JSON schema you provide.
This output can be directly consumed by downstream systems such as:
By aligning the parsed result to your custom schema, the Parse API ensures interoperability with your backend systems and AI workflows.
.pdf
, .docx
, .pptx
.png
, .jpg
, .jpeg
, .bmp
, .tiff
.mp3
, .wav
, .ogg
, .m4a
.mp4
, .mov
, .avi
, .mkv
Note: File size must be less than 32MB.
All requests must include an x-api-key
header with a valid API key.
Papercuts is currently in closed beta. To get access, contact us at aditya@papercuts.ai.
Send a POST
request to /parse/
with:
file
: The input file to be parsed (as multipart/form-data
)template_json
: A JSON string describing the expected output schemaStart using the API