Metadata-Version: 2.4
Name: coffea
Version: 2025.5.0rc1
Summary: Basic tools and wrappers for enabling not-too-alien syntax when running columnar Collider HEP analysis.
Project-URL: Homepage, https://github.com/coffeateam/coffea
Project-URL: Bug Tracker, https://github.com/coffeateam/coffea/issues
Author-email: Lindsey Gray <lagray@fnal.gov>, Nick Smith <ncsmith@fnal.gov>
Maintainer-email: Lindsey Gray <lagray@fnal.gov>, Nick Smith <ncsmith@fnal.gov>
License: BSD-3-Clause
License-File: LICENSE
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Information Technology
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: BSD License
Classifier: Operating System :: MacOS
Classifier: Operating System :: Microsoft :: Windows
Classifier: Operating System :: POSIX
Classifier: Operating System :: Unix
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3 :: Only
Classifier: Topic :: Scientific/Engineering
Classifier: Topic :: Scientific/Engineering :: Information Analysis
Classifier: Topic :: Scientific/Engineering :: Mathematics
Classifier: Topic :: Scientific/Engineering :: Physics
Classifier: Topic :: Software Development
Classifier: Topic :: Utilities
Requires-Python: >=3.9
Requires-Dist: aiohttp
Requires-Dist: awkward>=2.8.2
Requires-Dist: cachetools
Requires-Dist: cloudpickle>=1.2.3
Requires-Dist: correctionlib>=2.6.0
Requires-Dist: dask-awkward>=2025.5.0
Requires-Dist: dask-histogram>=2025.2.0
Requires-Dist: dask[array]>=2024.3.0
Requires-Dist: fsspec-xrootd>=0.2.3
Requires-Dist: hist>=2
Requires-Dist: lz4
Requires-Dist: matplotlib>=3
Requires-Dist: mplhep>=0.1.18
Requires-Dist: numba>=0.58.1
Requires-Dist: numpy>=1.22
Requires-Dist: packaging
Requires-Dist: pandas
Requires-Dist: pyarrow>=6.0.0
Requires-Dist: requests
Requires-Dist: scipy>=1.1.0
Requires-Dist: toml>=0.10.2
Requires-Dist: tqdm>=4.27.0
Requires-Dist: uproot>=5.6.0
Requires-Dist: vector!=1.6.0,>=1.4.1
Provides-Extra: dask
Requires-Dist: bokeh!=3.0.*,>=2.4.2; extra == 'dask'
Requires-Dist: distributed>=2024.3.0; extra == 'dask'
Provides-Extra: dev
Requires-Dist: black; extra == 'dev'
Requires-Dist: distributed>=2023.4.0; extra == 'dev'
Requires-Dist: flake8; extra == 'dev'
Requires-Dist: ipython; extra == 'dev'
Requires-Dist: nbsphinx; extra == 'dev'
Requires-Dist: pre-commit; extra == 'dev'
Requires-Dist: pyinstrument; extra == 'dev'
Requires-Dist: pytest; extra == 'dev'
Requires-Dist: pytest-asyncio; extra == 'dev'
Requires-Dist: pytest-cov; extra == 'dev'
Requires-Dist: pytest-mock; extra == 'dev'
Requires-Dist: pytest-mpl; extra == 'dev'
Requires-Dist: sphinx-automodapi; extra == 'dev'
Requires-Dist: sphinx-copybutton>=0.3.2; extra == 'dev'
Requires-Dist: sphinx-rtd-theme; extra == 'dev'
Requires-Dist: sphinx<8; extra == 'dev'
Provides-Extra: parsl
Requires-Dist: parsl>=2024.12.09; extra == 'parsl'
Provides-Extra: rucio
Requires-Dist: rucio-clients>=32; extra == 'rucio'
Provides-Extra: spark
Requires-Dist: ipywidgets; extra == 'spark'
Requires-Dist: jinja2; extra == 'spark'
Requires-Dist: pyspark>=3.3.0; extra == 'spark'
Description-Content-Type: text/x-rst

.. image:: docs/source/logo/coffea_logo.svg
    :align: center
    :width: 250px
    :alt: logo


coffea - Columnar Object Framework For Effective Analysis
=========================================================

.. image:: https://zenodo.org/badge/159673139.svg
   :target: https://zenodo.org/badge/latestdoi/159673139

.. image:: https://github.com/scikit-hep/coffea/actions/workflows/ci.yml/badge.svg
    :target: https://github.com/scikit-hep/coffea/actions?query=workflow%3ACI%2FCD+event%3Aschedule+branch%3Amaster

.. image:: https://codecov.io/gh/scikit-hep/coffea/branch/master/graph/badge.svg?event=schedule
    :target: https://codecov.io/gh/scikit-hep/coffea

.. image:: https://badge.fury.io/py/coffea.svg
    :target: https://badge.fury.io/py/coffea

.. image:: https://img.shields.io/pypi/dm/coffea.svg
    :target: https://img.shields.io/pypi/dm/coffea

.. image:: https://img.shields.io/conda/vn/conda-forge/coffea.svg
    :target: https://anaconda.org/conda-forge/coffea

.. image:: https://badges.gitter.im/scikit-hep/coffea.svg
    :target: https://matrix.to/#/#coffea-hep_community:gitter.im

.. image:: https://mybinder.org/badge_logo.svg
   :target: https://mybinder.org/v2/gh/scikit-hep/coffea/master?filepath=binder/

.. inclusion-marker-1-do-not-remove

Basic tools and wrappers for enabling not-too-alien syntax when running columnar Collider HEP analysis.

.. inclusion-marker-1-5-do-not-remove

coffea is a prototype package for pulling together all the typical needs
of a high-energy collider physics (HEP) experiment analysis using the scientific
python ecosystem. It makes use of `uproot <https://github.com/scikit-hep/uproot4>`_
and `awkward-array <https://github.com/scikit-hep/awkward-1.0>`_ to provide an
array-based syntax for manipulating HEP event data in an efficient and numpythonic
way. There are sub-packages that implement histogramming, plotting, and look-up
table functionalities that are needed to convey scientific insight, apply transformations
to data, and correct for discrepancies in Monte Carlo simulations compared to data.

coffea also supplies facilities for horizontally scaling an analysis in order to reduce
time-to-insight in a way that is largely independent of the resource the analysis
is being executed on. By making use of modern *big-data* technologies like
`Apache Spark <https://spark.apache.org/>`_,  `parsl <https://github.com/Parsl/parsl>`_,
`Dask <https://dask.org>`_ , and `Work Queue <http://ccl.cse.nd.edu/software/workqueue>`_,
it is possible with coffea to scale a HEP analysis from a testing
on a laptop to: a large multi-core server, computing clusters, and super-computers without
the need to alter or otherwise adapt the analysis code itself.

coffea is a HEP community project collaborating with `iris-hep <http://iris-hep.org/>`_
and is currently a prototype. We welcome input to improve its quality as we progress towards
a sensible refactorization into the scientific python ecosystem and a first release. Please
feel free to contribute at our `github repo <https://github.com/scikit-hep/coffea>`_!

.. inclusion-marker-2-do-not-remove

Installation
============

Install coffea like any other Python package:

.. code-block:: bash

    pip install coffea

or similar (use ``sudo``, ``--user``, ``virtualenv``, or pip-in-conda if you wish).
For more details, see the `Installing coffea <https://coffea-hep.readthedocs.io/en/latest/installation.html>`_ section of the documentation.

Strict dependencies
===================

- `Python <http://docs.python-guide.org/en/latest/starting/installation/>`__ (3.9+)

The following are installed automatically when you install coffea with pip:

- `numpy <https://scipy.org/install.html>`__ (1.22+);
- `uproot <https://github.com/scikit-hep/uproot5>`__ for interacting with ROOT files and handling their data transparently;
- `awkward-array <https://github.com/scikit-hep/awkward>`__ to manipulate complex-structured columnar data, such as jagged arrays;
- `numba <https://numba.pydata.org/>`__ just-in-time compilation of python functions;
- `scipy <https://scipy.org/scipylib/index.html>`__ for many statistical functions;
- `matplotlib <https://matplotlib.org/>`__ as a plotting backend;
- and other utility packages, as enumerated in ``pyproject.toml``.

.. inclusion-marker-3-do-not-remove

Documentation
=============
All documentation is hosted at https://coffea-hep.readthedocs.io/

Citation
========
If you would like to cite this code in your work, you can use the zenodo DOI indicated in ``CITATION.cff``, or the `latest DOI <https://zenodo.org/badge/latestdoi/159673139>`__. You may also cite the proceedings:

- "N. Smith et al 2020 EPJ Web Conf. 245 06012"
- "L. Gray et al 2023 J. Phys.: Conf. Ser. 2438 012033"
