Santander Product Recommendation
https://www.kaggle.com/c/santander-product-recommendation/kernels
Data 살펴보기 > Cleansing
https://www.kaggle.com/apryor6/detailed-cleaning-visualization
우선 엑셀에 데이터를 표로 구성하고, datatype을 정하고, 특징을 정리한다.

R
> d0.lag1
Month V1_lag1 V2_lag1 V3_lag1
1: 2 0 1 0
2: 3 0 1 1
3: 4 1 0 1
4: 5 1 0 1
5: 6 1 1 0
6: 7 0 1 0
7: 8 0 1 0
> d1
Month V1 V2 V3 V1_lag1 V2_lag1 V3_lag1
1: 1 0 1 0 NA NA NA
2: 2 0 1 1 0 1 0
3: 3 1 0 1 0 1 1
4: 4 1 0 1 1 0 1
5: 5 1 1 0 1 0 1
6: 6 0 1 0 1 1 0
7: 7 0 1 0 0 1 0
> d1
Month V1 V2 V3 V1_lag1 V2_lag1 V3_lag1 P1 P2 P3
1: 1 0 1 0 NA NA NA NA NA NA
2: 2 0 1 1 0 1 0 0 0 1
3: 3 1 0 1 0 1 1 1 -1 0
4: 4 1 0 1 1 0 1 0 0 0
5: 5 1 1 0 1 0 1 0 1 -1
6: 6 0 1 0 1 1 0 -1 0 0
7: 7 0 1 0 0 1 0 0 0 0