calculate_feature_statistics#

calculate_feature_statistics(features: list[str], data: DataFrame, ref_data: DataFrame) DataFrame[source]#

Perform two-sample Kolmogorov-Smirnov test for goodness of fit on features.

Parameters:
  • features – List of features to perform test on.

  • data – Sample data, with features as columns.

  • ref_data (pd.DataFrame) – References data, with features as columns.

Returns:

Kolmogorov-Smirnov statistics and p-values for each feature.