Organisation of data files
The nnpdf++
code needs to be able to handle a great deal of different
options with regard to the treatment of both experimental data and theoretical
choices. In the code, every effort has been made to keep experimental and
theoretical parameters strictly separate.
In this section we shall specify the layout of the various nnpdf++
data
directory. It is in this directory that all of the read-only data to be used in
the fit are accessed. The data directory is located in the nnpdfcpp
git
repository, under the path /nnpdfcpp/data/
.
Experimental data storage
The central repository for CommonData
in use by nnpdf++
projects is
located in the nnpdfcpp
git repository at
/nnpdfcpp/data/commondata/
where a separate CommonData
file is stored for each Dataset with the
filename format
DATA_<SETNAME>.dat
Information on the treatment of systematic uncertainties, provided in
SYSTYPE
files, is located in the subdirectory
/nnpdfcpp/data/commondata/systypes
Here several SYSTYPE
files may be supplied for each Dataset. The
various options are enumerated by suffix to the filename. The filename format
for SYSTYPE
files is therefore
SYSTYPE_<SETNAME>_<SYSID>.dat
Where the default systematic ID is DEFAULT. As an example, consider
the first SYSTYPE
file for the D0ZRAP Dataset:
SYSTYPE_D0ZRAP_DEFAULT.dat
Theory lookup table
In order to organise the various different theoretical treatments available, a
lookup table is provided in sqlite3
format. This lookup table can be found
in the nnpdfcpp
repository data directory at:
/nnpdfcpp/data/theory.db
This file should only be edited in order to add new theory options. It may be
edited with any appropriate sqlite3
-supported software. A script is provided to
give a brief overview of the various theory options available. It can be found
at
/nnpdfcpp/data/disp_theory.py
and should be run without any arguments.
Theory options are enumerated by an integer TheoryID
. The parameters of
each theory option are described in the lookup table under the appropriate ID.
The current available parameters are summarised in Theory parameter definitions.
Theory storage
Each theory configuration is stored as a gzip compressed tar archive with filename format
theory_<THEORYID>.tgz
and is stored at the location specified in the default nnprofile.yaml
. For easy
access, they can be downloaded through the vp-get
utility. Each archive
contains the following directory structure
theory_X/
-cfactor/
-compound/
-fastkernel/
Inside the directory theory_X/cfactor/
are stored CFACTOR
files
with the filename format
CF_<TYP>_<SETNAME>.dat
where <TYP>
is a three-letter designation for the source of the C-factor
(e.g. EWK or QCD) and <SETNAME>
is the typical Dataset designation.
The directory theory_X/compound/
contains the COMPOUND
files
described earlier, this time with the filename format
FK_<SETNAME>-COMPOUND.dat
Finally the FK
tables themselves are stored in theory_X/fastkernel/
with the filename format
FK_<SETNAME>.dat
Naturally, all of the FastKernel and C-factor files within the directory
theory_X/
have been determined with the theoretical parameters specified in
the theory lookup table under ID X
.