Skip to main content

mb.get_dataset(dataset_name, ...)

Fetches a dataset and returns it as a pandas DataFrame. Datasets can optionally be filtered and used as feature stores.

Parameters

  • dataset_name: str The name of the dataset.
  • filters: Optional[Dict[str, List[Any]]] If supplied with a filters dict, the DataFrame returned will be filtered to rows matching the filter criteria. The keys of the filters dict are column names. The values are lists of values to match against.

Returns

pandas.DataFrame

Examples

Get all rows in customer_features

similar_customers = mb.get_dataset("customer_features")

Get specific rows in customer_features

similar_customers = mb.get_dataset(
"customer_features",
filters={
"REGION": ["NA", "SA"]
"EMPLOYEE_COUNT": ["100-500","500-5000"]
}
)

See also