1> tmp3
2# A tibble: 50 x 43 addr_state count default_count default_rate
451 AK 112420.37562 AL 5571940.34873 AR 3151220.38784 AZ 10903160.29095 CA 617318370.298106 CO 9522220.233117 CT 5631470.261128 DC 86140.163139 DE 118310.2631410 FL 30149330.31015# ... with 40 more rows16>17
Order States by Default Rate
We can order states by default rate to identify states with highest and lowest default rates.
We found some features are quite correlated, we can remove correlated features with findCorrelation function. The function will find all correlated pairs that have correlation exceed a specified threshold and try to remove one of them in such a way that overall correlation is reduced.