Skip to main content

Data Dictionary


Column Naming Convention

All combined output tables use a consistent prefix + suffix scheme.

Prefix — identifies what is included in the price:

PrefixIncluded componentsFormula
inst_Institutional / facilityinst_price
prof_Institutional + Professional Feeinst_price + prof_professional_price
prof_with_optional_Institutional + Professional Fee + Optional Feeinst_price + prof_all_fees_price

prof_* prices are additive: the institutional rate is always included, and the professional fee component is added on top.

Suffix — identifies the metric:

SuffixDescription
_priceVolume-weighted average commercial canonical rate
_medicare_priceMedicare reference rate from the highest-volume code
_weightprice / base_rate (IP base = 6,000,OPbase=6,000, OP base = 500)
_medicare_weightmedicare_price / base_rate

combined_subcategory_fee_schedule_2026_03_12

Granularity: one row per (ssp_grouper, sub_category, pos, provider_id)

Location: tq_dev.internal_dev_csong_ssp

ColumnTypeDescription
ssp_grouperstringSSP identifier
sub_categorystringSub-category within the SSP (0 = base / least severe)
posstringPlace of service: IP (inpatient) or OP (outpatient)
provider_idstringHospital provider identifier
inst_pricefloatInstitutional commercial rate
inst_medicare_pricefloatInstitutional Medicare rate
inst_weightfloatinst_price / base_rate
inst_medicare_weightfloatinst_medicare_price / base_rate
prof_pricefloatinst_price + professional_fee_price (commercial)
prof_medicare_pricefloatinst_medicare_price + professional_fee_medicare_price
prof_weightfloatprof_price / base_rate
prof_medicare_weightfloatprof_medicare_price / base_rate
prof_with_optional_pricefloatinst_price + professional_fee_price + optional_fee_price (commercial)
prof_with_optional_medicare_pricefloatinst_medicare_price + professional_fee_medicare_price + optional_fee_medicare_price
prof_with_optional_weightfloatprof_with_optional_price / base_rate
prof_with_optional_medicare_weightfloatprof_with_optional_medicare_price / base_rate

combined_ssp_fee_schedule_2026_03_12

Granularity: one row per (ssp_grouper, pos, provider_id)

Location: tq_dev.internal_dev_csong_ssp

SSP-level prices are a volume-weighted average of the sub-category prices. Weights use the same base rates as the sub-category table.

ColumnTypeDescription
ssp_grouperstringSSP identifier
posstringPlace of service: IP or OP
provider_idstringHospital provider identifier
total_claim_countbigintSum of sub_package_total_billed_count across all base codes in the SSP (from sub_packages)
inst_pricefloatInstitutional commercial rate
inst_medicare_pricefloatInstitutional Medicare rate
inst_weightfloatinst_price / base_rate
inst_medicare_weightfloatinst_medicare_price / base_rate
prof_pricefloatinst_price + professional_fee_price (commercial)
prof_medicare_pricefloatinst_medicare_price + professional_fee_medicare_price
prof_weightfloatprof_price / base_rate
prof_medicare_weightfloatprof_medicare_price / base_rate
prof_with_optional_pricefloatinst_price + professional_fee_price + optional_fee_price (commercial)
prof_with_optional_medicare_pricefloatinst_medicare_price + professional_fee_medicare_price + optional_fee_medicare_price
prof_with_optional_weightfloatprof_with_optional_price / base_rate
prof_with_optional_medicare_weightfloatprof_with_optional_medicare_price / base_rate

Combo SSPs

Combo SSPs (e.g., GA.2.colonoscopy_and_egd) are stored in the same combined output tables above. They are created by combo_ssps.py using multiple procedure logic: 100% of primary (higher inst_price) + 50% of secondary (lower inst_price), determined per provider. Combo SSPs have a single sub_category = '0' in combined_subcategory_fee_schedule.

See Methodology — Step 8 for details.


Source Tables

manual_institutional_line_codes

Granularity: one row per (base_code, line_code)

Location: tq_dev.internal_dev_csong_ssp

Claims-derived institutional line codes not in sub_package_contents (Facility Fee). Produced by manual_institutional_line_codes.py. Includes hardcoded combo SSP entries.

ColumnDescription
base_codeAnchor billing code (or combo base code, e.g., 43250 + 45384)
line_codeCo-occurring procedure code
manual_professional_line_codes

Granularity: one row per (base_code, line_code)

Location: tq_dev.internal_dev_csong_ssp

Claims-derived professional line codes not in sub_package_contents (Professional/Optional Fee). Produced by manual_professional_line_codes.py. Also mirrors ancillary codes (anesthesia, radiology, lab/path) from the institutional side and includes hardcoded combo SSP entries.

ColumnDescription
base_codeAnchor billing code (or combo base code, e.g., 43250 + 45384)
line_codeCo-occurring procedure code
institutional_line_codes

Granularity: one row per (ssp_grouper, sub_category, pos, line_code, provider_id)

Location: tq_dev.internal_dev_csong_ssp

Line-code level detail for institutional fee schedule. Combines sub_package_contents (Facility Fee), manual_institutional_line_codes, and DRG anchor codes. Produced by institutional_line_codes.py.

ColumnDescription
ssp_grouperSSP identifier
sub_categorySub-category (0 = least severe)
posIP or OP
line_codeProcedure code or revenue code
labelAnchor Code, Revenue Code, Carved Out: Drug, Carved Out: Implant, or NULL
fee_typeFacility Fee
provider_idHospital provider identifier
canonical_rateCommercial rate (anchor codes only)
medicare_rateMedicare rate (anchor codes only)
medicare_codeMS-DRG or APC code used for Medicare pricing
rate_sourcevalidated or backup
institutional_fee_schedule_2026_03_12

Granularity: one row per (ssp_grouper, sub_category, pos, provider_id)

ColumnDescription
ssp_grouperSSP identifier
sub_categorySub-category (0 = least severe)
posIP or OP
codeHighest-volume billing code (MS-DRG or HCPCS) in the sub-category
multiplierRII tier multiplier (1.0 for non-tiered sub-categories)
provider_idHospital provider identifier
subcategory_priceVolume-weighted avg commercial rate at sub-category level, scaled by multiplier
subcategory_medicare_priceVolume-weighted avg Medicare rate at sub-category level
ssp_grouper_priceCommercial rate rolled up to SSP level
ssp_grouper_medicare_priceMedicare rate rolled up to SSP level
subcategory_weightsubcategory_price / base_rate
subcategory_medicare_weightsubcategory_medicare_price / base_rate
ssp_grouper_weightssp_grouper_price / base_rate
ssp_grouper_medicare_weightssp_grouper_medicare_price / base_rate
professional_line_codes_2026_03_27

Granularity: one row per (ssp_grouper, sub_category, pos, base_code, line_code, fee_type, provider_id)

Location: tq_dev.internal_dev_csong_ssp

Intermediate table preserving line-code level detail. Produced by professional_line_codes.py. Only includes line codes where association_rate IS NULL OR association_rate > 0.3.

ColumnDescription
ssp_grouperSSP identifier
sub_categorySub-category (0 = least severe)
posIP or OP
base_codeAnchor billing code (HCPCS/CPT)
line_codeProfessional/optional fee line code
fee_typeProfessional Fee or Optional Fee
provider_idHospital provider identifier
unitsUnit multiplier applied to rates: anesthesia codes scaled by average_units / 15; anchor codes in the same SSP capped at 1; all others = 1
canonical_rateCOALESCE(validated, backup) commercial rate, multiplied by units
canonical_rate_validatedValidated commercial rate (score = 5)
canonical_rate_backupBackup commercial rate (score greater than 1)
n_rates_validatedNumber of validated rate observations
n_rates_backupNumber of backup rate observations
volumeClaim volume
medicare_rateMedicare rate multiplied by units: MPFS state-level rate first, then anesthesia fee schedule, then CLFS national rate as final fallback
rate_sourcevalidated or backup
association_rateRelative encounter proportion from claims discovery (NULL for codes from sub_package_contents with no claims data)
service_typeAnesthesia, Lab/Path, Radiology, or Professional — from ssp_line_code_service_types
line_code_descriptionHuman-readable description from services_spines_cleaned
line_code_shorthandShort label from services_spines_cleaned
ccs_categoryCCS (Clinical Classification Software) category from services_spines_cleaned
professional_fee_schedule_2026_03_12

Granularity: one row per (ssp_grouper, sub_category, pos, provider_id)

Location: tq_dev.internal_dev_csong_ssp

Aggregated from professional_line_codes table. Same column structure as the institutional table (prices only, no weights), plus Professional Fee only price variants:

Extra columnsDescription
subcategory_professional_priceCommercial rate, Professional Fee codes only
subcategory_professional_medicare_priceMedicare rate, Professional Fee codes only
ssp_grouper_professional_priceRolled-up commercial rate, Professional Fee only
ssp_grouper_professional_medicare_priceRolled-up Medicare rate, Professional Fee only

Weights for all columns (institutional and professional) are computed in prices.py.