Extract Text from PDF document
Extract Text method of the Extractions endpoint lets you pull the text content from a PDF so you can reuse it in your application. You can request the full document or limit extraction to specific pages using the pageIndices parameter.
readingOrder boolean parameter.
Accept header (details below), which defaults to application/json.
Prefer header (details below).
uploadResultTo or uploadResultsTo properties.
uploadResultTo or uploadResultsTo, to upload your file(s) once the Job is done processing.
JSON (application/json) | Binary (application/octet-stream) | |
|---|---|---|
| Synchronous | Returns a JSON object with a file(s) URL(s). | Returns the processed file directly as binary. |
Asynchronous (respond-async) | Returns a JSON object with a Job ID and status. | Async preference ignored, returns sync binary. |
| Synchronous delivery | File delivered to custom endpoint / Bucket, returns success confirmation. | N/A |
| Asynchronous delivery | Returns a JSON object with a Job ID and status. The file(s) will be uploaded to the provided PUT endpoint / S3 Bucket at the end of process. If callback url is provided, Nitro will notify the endpoint with a JOB ID and location. | N/A |
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Controls response format and behavior. See endpoint description above for detailed response combinations.
application/json: Returns JSON response with operation resultapplication/octet-stream: Returns binary file content*/*: Defaults to JSON responseapplication/json, application/octet-stream, */* Controls synchronous vs asynchronous operation. See endpoint description above for behavior details.
respond-async: Makes request asynchronous, returns job status for pollingrespond-async The Extractions' endpoint method: extract-text
extract-text The file to process. It can be provided as a binary upload or as a JSON remote file reference.
This endpoint lets you supply your own URL to receive the single-file output. The URL may point to a custom API endpoint or a pre-signed S3 URL.
The HTTP method defaults to PUT, but you can change it based on your implementation needs via the verb parameter. You can also provide custom headers, such as authentication headers or any others required by your endpoint.
Returns either JSON or binary output depending on the Accept header (defaults to JSON). JSON responses include a file URL for synchronous tasks or a job status for asynchronous tasks.