Boosting Scoring Throughput

By default, the SCORE.sh script calculates all metrics (“scores”) for all plans in an input ensemble. When the ensemble is large, this can take a long time.

You can use a combination of two techniques to increase scoring throughput:

  1. “Shard” the ensemble of plans into files with fewer plans, e.g., divide it into 10 smaller files, and score the shards in parallel.
  2. Score independent categories of metrics—“general”, “partisan”, “minority”, “compactness”, and “splitting”—separately and in parallel.

When used together, you can substantially boost throughput.

Used together, the basic process is as follows:

There are three utility bash scripts in the scripts/throughput directory to support this:

The SCORE.sh script takes an optional --mode argument to specify which category of metrics to calculate. The default is to calculate all categories.