Air Foil (UCI, Regression, n=1503, d=5)

Loading The Data

In [1]:
from kxy_datasets.uci_regressions import AirFoil # pip install kxy_datasets
In [2]:
dataset = AirFoil()
df = dataset.df # Retrieve the dataset as a pandas dataframe
y_column = dataset.y_column # The name of the column corresponding to the target
problem_type = dataset.problem_type # 'regression' or 'classification'
In [3]:
df.kxy.describe() # Visualize a summary of the data

-----------------------
Column: Angle of Attack
-----------------------
Type:   Continuous
Max:    22
p75:    9.9
Mean:   6.8
Median: 5.4
p25:    2.0
Min:    0.0

--------------------
Column: Chord Length
--------------------
Type:   Continuous
Max:    0.3
p75:    0.2
Mean:   0.1
Median: 0.1
p25:    0.1
Min:    0.0

------------------------------
Column: Displacement Thickness
------------------------------
Type:   Continuous
Max:    0.1
p75:    0.0
Mean:   0.0
Median: 0.0
p25:    0.0
Min:    0.0

-----------------
Column: Frequency
-----------------
Type:   Continuous
Max:    20,000
p75:    4,000
Mean:   2,886
Median: 1,600
p25:    800
Min:    200

----------------------
Column: Sound Pressure
----------------------
Type:   Continuous
Max:    140
p75:    129
Mean:   124
Median: 125
p25:    120
Min:    103

----------------
Column: Velocity
----------------
Type:   Continuous
Max:    71
p75:    71
Mean:   50
Median: 39
p25:    39
Min:    31

Data Valuation

In [4]:
df.kxy.data_valuation(y_column, problem_type=problem_type)
[====================================================================================================] 100% ETA: 0s    Duration: 0s
Out[4]:
Achievable R-Squared Achievable Log-Likelihood Per Sample Achievable RMSE
0 0.80 8.03e-01 3.09

Automatic (Model-Free) Variable Selection

In [5]:
df.kxy.variable_selection(y_column, problem_type=problem_type)
[====================================================================================================] 100% ETA: 0s    Duration: 0s
Out[5]:
Variable Running Achievable R-Squared Running Achievable RMSE
Selection Order
0 No Variable 0.00 6.90
1 Frequency 0.30 5.76
2 Displacement Thickness 0.73 3.61
3 Chord Length 0.75 3.43
4 Velocity 0.80 3.09
5 Angle of Attack 0.80 3.09