DSS to CSV Tools

This repository contains Python scripts to convert CalSim3 output DSS files into CSV format and validate the conversion. It depends on the pydsstools library.

Requirements

Compatibility: The pydsstools library is compatible with 64-bit Python on Windows 10 and Ubuntu-like Linux distributions. For Linux, ensure that zlib, math, quadmath, and gfortran libraries are installed. Using the Docker container in this repo will ensure this.
Dependencies: This library depends on heclib.a, which must be installed correctly, which is why we are using Docker.

Docker Setup

Included in the repository is a Dockerfile that runs a Linux container and installs the necessary libraries, fulfilling the compatibility and dependency requirements listed above. The instructions below are a guide to setting up a Docker container and running the Python scripts. If you are installing on Windows and don't want to use Docker, you can use the Dockerfile and the pydsstools README as a guide.

Mac Users

If you are on a Mac, you may encounter issues installing Docker. I had these issues, so I've included the docker-fix files in this repo. See: Docker for Mac Issue #7527. This issue has been resolved as of the start of 2025.

Python Scripts

The repository includes Python files for exporting DSS files to CSV and for validating them. Note that the pydsstools library prints every path as it processes it. I haven't been able to surpress this.

Steps to Use

Install Docker Desktop: If necessary, download Docker from Docker's website. If you are on a Mac and get malware warnings, see the notes above. Start up the Docker app. You can test your installation by running:
```
> docker info
```
Clone the Repository: And cd into the directory.
Edit docker-compose.yml if you want to change the data directory: The docker-compose.yml is set up with relative directories. See the directory documenation below. For at least your first run, I recommend using this directory structure as a trial.
Build the Docker Image:
```
> cd pydsstools-docker
> docker-compose build
```
Building takes a few minutes. Once the image is built, you won't have to build it again.
Run Services: See the docker-compose file for available services. The command structure is as in the following examples.

To extract data from the CalSim dss to csv:

> docker-compose run <service> --dss /data/scenario/<filename>.dss --csv /data/scenario/<filename>.csv

Example:

> docker-compose run convert --dss /data/scenario/DCR2023_DV_9.3.1_v2a_Danube_Adj_v1.8.dss --csv /data/scenario/DCR2023_DV_9.3.1_v2a_Danube_Adj_v1.8.csv

You will see all the paths print on the console. This is a function of the pydsstools library. Once they have all printed, the console will hang for a bit. You have time for a cup of coffee. It takes about 5 minutes to run the process.

To validate the resulting csv against overlapping columns in a reference csv (like the Trend Reports):

> docker-compose run <service> --ref /data/scenario/<filename>.csv --file /data/scenario/<filename>.csv

Example:

> docker-compose run compare-csv --ref /data/scenario/s0011/coeqwal_s0011_adjBL_wTUCP_DV_v0.0.csv --file /data/scenario/s0011/s0011_output.csv

Any discrepancies will print to the console. Float tolerance is set to 1e-5. You can set this value in the compare_csv_files.py. This test runs quickly.

To validate the resulting csv against the original dss:

> docker-compose run <service> --ref /data/scenario/<filename>.csv --file /data/scenario/<filename>.csv

Example:

> docker-compose run validate-sample --dss /data/scenario/s0011_adjBL_wTUCP/DSS/output/coeqwal_s0011_adjBL_wTUCP_DV_v0.0.dss --csv /data/scenario/s0011/s0011_output.csv

Any discrepancies will print to the console. validate-sample is a quick smoke test that validates the first 5 columns. validate-all validates every value in every column and takes about 1/2 hour. Tolerance is 1e-10.

Stop Services: When done, stop running services using:
```
> docker-compose down
```

Use Docker commands as normal. For example, to avoid voluminous print output, you can run Docker in the background with the -d flag. To automatically remove the container when it stops, use the --rm flag.

Filtering pipeline

This respository also contains csv_levels.py to create a pipeline for different levels of csv processing:

Level 0: csv format directly from DSS output
Level 1: csv with system and validation variables removed through Part C filtering
Level 2: final variable csv for the COEQWAL website

Suggested directory tree

COEQWAL-pydsstools/
├── README.md
│
├── data/                     
│   ├── 00_dss/               # raw DSS output files
│   │   └── scenarioA.dss
│   ├── 10_level0_raw_csv/    # Level-0 CSVs (1-to-1 export)
│   │   └── scenarioA_L0.csv
│   ├── 20_level1_filtered/   # Level-1 CSVs (after dropping whole Part C's)
│   │   └── scenarioA_L1.csv
│   ├── 30_variable_maps/     # helper text files for manual review
│   │   ├── PartC.txt         # unique Part-C list
│   │   └── PartsBC.txt       # Part-C ➜ Part-B map
│   ├── 40_configs/           # YAML keep-lists used to create Level-2
│   │   └── scenarioA_keep.yml
│   └── 50_level2_final/      # Level-2, database-ready CSVs
│       └── scenarioA_L2.csv
│
└── pydsstools-docker/        # all runnable code + container build
    ├── python-code/
    │   ├── dss_to_csv.py     # Level-0 exporter
    │   └── csv_levels.py     # multi-mode helper (0,1,2,listC,mapBC)
    ├── Dockerfile
    └── docker-compose.yml

DSS → COEQWAL variables pipeline

Quick-start (after building Docker image, see above)

Tip Run these docker compose commands from inside the pydsstools-docker/ folder. Inside the container, the project's top-level data/ directory is mounted at /data, so the paths you see below will resolve automatically.

1. Build Level-0 CSV

> docker compose run --rm convert \
  --dss  /data/00_dss/scenarioA.dss \
  --csv  /data/10_level0_raw_csv/scenarioA_L0.csv

2. List all unique Part-C values

> docker compose run --rm csv-levels listC \
  /data/10_level0_raw_csv/scenarioA_L0.csv \
  --outfile /data/30_variable_maps/PartC.txt

3. Produce Level-1 after deciding what to drop

> docker compose run --rm csv-levels 1 \
  /data/10_level0_raw_csv/scenarioA_L0.csv \
  /data/20_level1_filtered/scenarioA_L1.csv \
  --drop JUNKC1 JUNKC2

4. Map remaining Part-C → Part-B combinations (after Level-1)

> docker compose run --rm csv-levels mapBC \
  /data/20_level1_filtered/scenarioA_L1.csv \
  --mapfile /data/30_variable_maps/PartsBC.txt

5. Build Level-2 using an edited YAML keep-list

> docker compose run --rm csv-levels 2 \
  /data/20_level1_filtered/scenarioA_L1.csv \
  /data/50_level2_final/scenarioA_L2.csv \
  --config /data/40_configs/scenarioA_keep.yml

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
pydsstools-docker		pydsstools-docker
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DSS to CSV Tools

Requirements

Docker Setup

Mac Users

Python Scripts

Steps to Use

Filtering pipeline

Suggested directory tree

DSS → COEQWAL variables pipeline

Quick-start (after building Docker image, see above)

1. Build Level-0 CSV

2. List all unique Part-C values

3. Produce Level-1 after deciding what to drop

4. Map remaining Part-C → Part-B combinations (after Level-1)

5. Build Level-2 using an edited YAML keep-list

About

Uh oh!

Releases

Packages

Languages

berkeley-gif/COEQWAL-pydsstools

Folders and files

Latest commit

History

Repository files navigation

DSS to CSV Tools

Requirements

Docker Setup

Mac Users

Python Scripts

Steps to Use

Filtering pipeline

Suggested directory tree

DSS → COEQWAL variables pipeline

Quick-start (after building Docker image, see above)

1. Build Level-0 CSV

2. List all unique Part-C values

3. Produce Level-1 after deciding what to drop

4. Map remaining Part-C → Part-B combinations (after Level-1)

5. Build Level-2 using an edited YAML keep-list

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages