Data Mining Techniques in Analyzing Process
Methods
Participants
The USA sample (N = 429) was extracted
from the 2012 PISA public dataset. Students were from 15 years 3 months
old to 16 years 2 months old, representing 15-year-olds in USA. Three
students with missing student IDs and school IDs were deleted, yielding a
sample of 426 students. There were no missing responses. The dataset
was randomly partitioned into a training dataset (n = 320, 75.12%) and a
test dataset (n = 106, 24.88%). The size of the training dataset is
usually about 2 to 3 times of the size of the test dataset to increase
the precision in prediction.