Loading...
Reference

Universal Document Extractor

Overview

Heavstal Universal Extractor is a multi-format parsing engine. It accepts a direct URL to a document or code file and returns the clean, raw text content. This is ideal for feeding documents into LLMs or search indexes.

Supported Formats

  • PDF (.pdf)
  • Word (.docx)
  • Plain Text (.txt)
  • Code (.js, .ts, .py, .java, .html, .css, .json, .md, etc.)
  • Endpoint

    POST /doc-extract

    Request Parameters

    FieldTypeRequiredDescription
    urlstringRequiredDirect URL to the file.

    Example Usage

    javascript

    Successful Response

    json