mb.get_dataset(dataset_name, ...)
Fetches a dataset and returns it as a pandas DataFrame. Datasets can optionally be filtered and used as feature stores.
Parameters
dataset_name
:str
The name of the dataset.filters
:Optional[Dict[str, List[Any]]]
If supplied with afilters
dict, the DataFrame returned will be filtered to rows matching the filter criteria. The keys of thefilters
dict are column names. The values are lists of values to match against.
Returns
pandas.DataFrame
Examples
Get all rows in customer_features
similar_customers = mb.get_dataset("customer_features")
Get specific rows in customer_features
similar_customers = mb.get_dataset(
"customer_features",
filters={
"REGION": ["NA", "SA"]
"EMPLOYEE_COUNT": ["100-500","500-5000"]
}
)
See also
- Read the Datasets section of the docs for more info on using datasets as feature stores.