GAMCR in practice: A brief overview#

Typical File Structure for GAMCR#

GAMCR/
├── experiments/
│   ├── check_data.ipynb
│   ├── save_data_batch.py
│   ├── train_models.py
│   ├── compute_statistics.py
│   └── mysite/
│       ├── results/
│       │   ├── detailedresults/
│       │   ├── groups.csv
│       │   ├── NRF_RRD.csv
│       │   └── streamflow.csv
│       ├── data/
│       │   ├── transfer.npy
│       │   ├── lst_transfer.npy
│       │   ├── y_0.npy
│       │   └── ...
│       ├── data_mysite.txt
│       └── mysite_best_model.pkl

Folder Structure for Using the GAMCR Package#

To properly use the GAMCR package, you should have a folder for each site with the following structure:

You can name the folder as per your preference. For example: mysite.
Inside this folder, you should have a file named data_{mysite}.txt. This file can be created by running the notebook check_data.ipynb.
Two subfolders will be created and used by GAMCR:
- ``data/``: This subfolder will store the preprocessed data, created when running the script save_data_batch.py.
  - For simulated data:
    - If the true transfer functions have been computed, they are saved in a matrix in the file transfer.npy.
    - The indices corresponding to each precipitation event used to compute the true transfer functions are saved in the file lst_transfer.npy.
- ``results/``: This subfolder will store various statistics on the results of a trained model, created when running the script compute_statistics.py. Specifically, the following files will be generated:
  - groups.csv: Describes the precipitation and/or antecedent wetness ranges for each group of data points used to compute averages.
  - NRF_RRD.csv: Contains NRF, RRD, and weighted average RRD computed for the different groups of data points.
  - streamflow.csv: Includes observed and predicted streamflow time series.
GAMCR will also save the models you train for that site in this folder. By running the script train_models.py, a file named {mysite}_best_model.pkl will be generated.

Screen shot of an example of simulated data.