IMPROVE Aerosol, Regional Haze Rule II (2003 Guidance)

Output file example (excerpt):

SiteCode|POC|FactDate|ParamCode|MethodID|Units|FactValue|HGroup|Status|ProviderStatus|good_year|PatchedOrSubbedFlag|n_dv|missing
ACAD1|1|2021-01-01|aerosol_bext|1010|1/Mm|12.24376|50|V0|---|1|REG|118|0
ACAD1|1|2021-01-01|ammNO3f|1010|ug/m^3|0.1117|50|V0|NM|1|REG|118|0
ACAD1|1|2021-01-01|ammNO3f_bext|1010|1/Mm|1.02711|50|V0|---|1|REG|118|0
ACAD1|1|2021-01-01|ammNO3f_Large|1010|ug/m^3|0.00062|50|V0|NM|1|REG|118|0
ACAD1|1|2021-01-01|ammNO3f_Small|1010|ug/m^3|0.11108|50|V0|NM|1|REG|118|0
ACAD1|1|2021-01-01|ammSO4f|1010|ug/m^3|0.71016|50|V0|NM|1|REG|118|0
ACAD1|1|2021-01-01|ammSO4f_bext|1010|1/Mm|6.08912|50|V0|---|1|REG|118|0
ACAD1|1|2021-01-01|ammSO4f_Large|1010|ug/m^3|0.02522|50|V0|NM|1|REG|118|0
ACAD1|1|2021-01-01|ammSO4f_Small|1010|ug/m^3|0.68494|50|V0|NM|1|REG|118|0
ACAD1|1|2021-01-01|CM_bext|1010|1/Mm|0.32673|50|V0|---|1|REG|118|0
ACAD1|1|2021-01-01|CM_calculated|1010|ug/m^3|0.54455|50|V0|NM|1|REG|118|0
ACAD1|1|2021-01-01|dv|1010|dv|8.85574|50|V0|---|1|REG|118|0
ACAD1|1|2021-01-01|ECf|1010|ug/m^3|0.1446|50|V0|NM|1|REG|118|0
ACAD1|1|2021-01-01|ECf_bext|1010|1/Mm|1.446|50|V0|---|1|REG|118|0

Schema description:

FieldDescription
SiteCodeXF_SiteCode
POCXF_POC
FactDateXF_FactDate
ParamCodeXF_ParamCode
MethodIDXF_MethodID
UnitsXF_Units
FactValueXF_FactValue
HGroup

HGroup ("Haze Group") indicates the quintile (percentile) based on sorting the annual data values from lowest to highest Deciview (dv)

IMPROVE convention bins the data values into "groups" based on this algortihm (where the prefix "p" stands for "percentile"):

IF the year's data did not meet the RHR completeness criteria or dv could not be calculated for some reason, then HGroup = 0
IF dv <= p20 Then HGroup = 10
IF dv > p20 AND dv < p40 THEN HGroup = 30
IF dv >= p40 AND dv <= p60 THEN HGroup = 50
IF dv > p60 AND dv < p80 THEN HGroup = 70
IF dv >= p80 then HGroup = 90

Status

XF_StatusCode

FlagDescriptionStart DateEnd DateRecords
I1Invalid value - known reason2004-03-102004-11-1785
M1Missing value because no value is available1988-03-022021-12-30672,335
M2Missing value because invalidated by data originator1990-09-012021-12-3043,344
M3Missing value due to clogged filter2000-01-012021-12-125,984
V0Valid value1988-03-022021-12-3011,644,995
V2Valid estimated value2011-01-032013-02-09366
V4Valid value despite failing to meet some QC or statistical criteria1988-03-232020-12-17282,162
V5Valid value but qualified because of possible contamination1990-09-012017-11-0632,415
V6Valid value but qualified due to non-standard sampling conditions2000-01-012021-12-30378,640

ProviderStatusThe status flag/code assigned to the data value by the Air Quality Group at Crocker Nuclear Laboratory at the University of California, Davis
good_year Indicates whether or not the data meets RHR criteria for annual completeness
PatchedOrSubbedFlag

A code that indicates whether the data value is normal, patched, or substituted. The possible values are:

FlagDescriptionStart DateEnd DateRecords
PATThe data value results from the RHR "patching" algorithm1988-04-022021-12-3037,186
REGThe data value results from a normal observation or calculation per the RHR guidance1988-03-022021-12-3012,623,159
SUBThe data value is a substituted value from another site or calculation procedure per the RHR guidance2000-01-012018-12-1938,252
---The data value has not been assigned a specific flag in this regard1988-03-232018-12-25361,729

n_dvThe number of valid daily dv values for the site and year
missingThe number of missing values for the site and year

References: