hydrotools.nwm_client.NWMFileProcessor module¶
NWM File Processor¶
Tools for processing NWM data in NetCDF (.nc) format.
Classes¶
NWMFileProcessor
- class hydrotools.nwm_client.NWMFileProcessor.NWMFileProcessor¶
Bases:
object
Provides a concrete interface for methods used to process National Water Model data from NetCDF (.nc) format to xarray.Dataset, dask.dataframe.Dataframe, or pandas.DataFrame.
- classmethod convert_to_dask_dataframe(ds: Dataset) DataFrame ¶
Generate a dask.dataframe.DataFrame from an xarray.Dataset.
- Parameters:
ds (xarray.Dataset, required) – xarray.Dataset containing National Water Model data.
- Return type:
dask.dataframe.DataFrame of NWM data.
- classmethod convert_to_dataframe(ds: Dataset) DataFrame ¶
Generate a pandas.DataFrame from an xarray.Dataset.
- Parameters:
ds (xarray.Dataset, required) – xarray.Dataset containing National Water Model data.
- Return type:
pandas.DataFrame of NWM data.
- classmethod get_dataset(paths: str | ~numpy._typing._array_like._SupportsArray[~numpy.dtype[~typing.Any]] | ~numpy._typing._nested_sequence._NestedSequence[~numpy._typing._array_like._SupportsArray[~numpy.dtype[~typing.Any]]] | bool | int | float | complex | bytes | ~numpy._typing._nested_sequence._NestedSequence[bool | int | float | complex | str | bytes], feature_id_filter: ~numpy._typing._array_like._SupportsArray[~numpy.dtype[~typing.Any]] | ~numpy._typing._nested_sequence._NestedSequence[~numpy._typing._array_like._SupportsArray[~numpy.dtype[~typing.Any]]] | bool | int | float | complex | str | bytes | ~numpy._typing._nested_sequence._NestedSequence[bool | int | float | complex | str | bytes] = Index([800010123, 800012331, 800005662, 800006277, 800009690, 800015240, 800016809, 800001894, 800005664, 800006488, ... 41028035, 41028026, 41025904, 15489152, 41022861, 41022723, 15465127, 15456882, 15476223, 15448784], dtype='int64', name='nwm_feature_id', length=8866), variables: ~typing.List[str] = ['streamflow']) Dataset ¶
Generate an xarray.Dataset from an input directory of NWM .nc files.
- Parameters:
paths (str or array-like of paths, required) – Glob string or array-like of paths passed directly to xarray.open_mfdataset
feature_id_filter (array-like, optional) – Subset of feature IDs to return. Defaults to USGS assimilation locations.
variables (list of str, optional, default ["streamflow"]) – List of variables to retrieve from source files. Options include: ‘streamflow’, ‘nudge’, ‘velocity’, ‘qSfcLatRunoff’, ‘qBucket’, ‘qBtmVertRunoff’
- Return type:
xarray.Dataset of paths lazily loaded.