Skip to main content

ONIX Intake

What is ONIX?

ONIX (ONline Information eXchange) is a standardized XML format for sharing book metadata between publishers, distributors, and retailers. It contains essential information about publications including titles, authors, ISBNs, pricing, and availability.

What are ONIX Intakes?

ONIX Intakes are automated processes that receive, process, and manage ONIX files from content providers like Bookwire or Ingram. These intakes ensure that book metadata in the Publica.la platform stays current and accurate.

Benefits of ONIX Intakes

  • Automated Updates: Catalog information is automatically refreshed
  • Data Consistency: Ensures metadata accuracy across the platform
  • Efficient Processing: Handles large volumes of publication data

How ONIX Intake Works

  1. File Reception: Content providers send ONIX files to our SFTP server, which connects to an AWS S3 bucket. This bucket stores both the ONIX files and the actual content files (EPUB, PDF, MP3).

  2. Processing Pipeline: Once the ONIX files arrive in S3, they are processed in several stages:

    • File Discovery: The received ONIX files are identified and logged.
    • Event Extraction: Each ONIX file contains multiple product blocks or "events." Each of these events describes the metadata of a specific book.
    • Publication Updates: The events are used to create or update publications in the Publica.la database, ensuring that the latest data is reflected on the platform.
  3. Content Ingestion: Publications are then made available on the Publica.la platform


X

Graph View