abacusai.page_data
==================

.. py:module:: abacusai.page_data


Classes
-------

.. autoapisummary::

   abacusai.page_data.PageData


Module Contents
---------------

.. py:class:: PageData(client, docId=None, page=None, height=None, width=None, pageCount=None, pageText=None, pageTokenStartOffset=None, tokenCount=None, tokens=None, extractedText=None, rotationAngle=None, pageMarkdown=None, embeddedText=None)

   Bases: :py:obj:`abacusai.return_class.AbstractApiClass`


   Data extracted from a docstore page.

   :param client: An authenticated API Client instance
   :type client: ApiClient
   :param docId: Unique Docstore string identifier for the document.
   :type docId: str
   :param page: The page number. Starts from 0.
   :type page: int
   :param height: The height of the page in pixels.
   :type height: int
   :param width: The width of the page in pixels.
   :type width: int
   :param pageCount: The total number of pages in document.
   :type pageCount: int
   :param pageText: The text extracted from the page.
   :type pageText: str
   :param pageTokenStartOffset: The offset of the first token in the page.
   :type pageTokenStartOffset: int
   :param tokenCount: The number of tokens in the page.
   :type tokenCount: int
   :param tokens: The tokens in the page.
   :type tokens: list
   :param extractedText: The extracted text in the page obtained from OCR.
   :type extractedText: str
   :param rotationAngle: The detected rotation angle of the page in degrees. Positive values indicate clockwise and negative values indicate anti-clockwise rotation from the original orientation.
   :type rotationAngle: float
   :param pageMarkdown: The markdown text for the page.
   :type pageMarkdown: str
   :param embeddedText: The embedded text in the page. Only available for digital documents.
   :type embeddedText: str


   .. py:attribute:: doc_id
      :value: None



   .. py:attribute:: page
      :value: None



   .. py:attribute:: height
      :value: None



   .. py:attribute:: width
      :value: None



   .. py:attribute:: page_count
      :value: None



   .. py:attribute:: page_text
      :value: None



   .. py:attribute:: page_token_start_offset
      :value: None



   .. py:attribute:: token_count
      :value: None



   .. py:attribute:: tokens
      :value: None



   .. py:attribute:: extracted_text
      :value: None



   .. py:attribute:: rotation_angle
      :value: None



   .. py:attribute:: page_markdown
      :value: None



   .. py:attribute:: embedded_text
      :value: None



   .. py:attribute:: deprecated_keys


   .. py:method:: __repr__()


   .. py:method:: to_dict()

      Get a dict representation of the parameters in this class

      :returns: The dict value representation of the class parameters
      :rtype: dict



