Code structure

Here, we describe the structure of the NNPDF code and we present a high-level description of its functionalities. The workflow for an NNPDF fit is displayed in the figure below.

The n3fit fitting code

This module implements the core fitting methodology as implemented through the Keras framework. The n3fit library allows for a flexible specification of the neural network model adopted to parametrise the PDFs, whose settings can be selected automatically via the built-in hyperoptimization algorithm. These include the neural network type and architecture, the activation functions, and the initialization strategy; the choice of optimizer and of its corresponding parameters; and hyperparameters related to the implementation in the fit of theoretical constraints such as PDF positivity and integrability. The settings for a PDF fit are inputted via a declarative runcard. Using these settings, n3fit finds the values of the neural network parameters, corresponding to the PDF at initial scale which describe the input data. Following a post-fit selection (using the postfit tool implemented in validphys) and PDF evolution step, the final output consists of an LHAPDF grid corresponding to the best fit PDF as well as metadata on the fit performance.

The validphys analysis framework

As an implementation of the reportengine, it enables a workflow focused on declarative and reproducible runcards. The code implements data structures that can interact with those of libnnpdf and are accessible from the runcard. The analysis code makes heavy use of common Python Data Science libraries such as NumPy, SciPy, Matplotlib and Pandas, and through its use of Pandoc it is capable of outputting the final results to HTML reports. These can be composed directly by the user or be generated by more specialised, downstream applications. The package includes tools to interact with online resources such as the results of fits or PDF grids, which, for example, are automatically downloaded when they are required by a runcard.