Home
Model
User space
Structure preprocessing
Data preprocessing
Modelling pipeline
Manual
Help
Contact
User Space
My Data
My Model
My Report
Structure Preprocessing
File check
Validation of molecules
Standardization of molecules
Custom Preprocessing
Data Preprocessing
Imputation of missing values
Removing low variance features
Removing high correlation features
Univariate feature selection
Tree-based feature selection
RFE feature selection
Modelling Process
Start Pipeline
Feature Calculation
Model Selection
Model Building
Prediction
Model interpretation
Feature importance
Tree visualization
Other tools
Statistical Analysis
Random training set split
Diverse training set split
What's the tool?
This tool can be used to split training set and test set by picking a subset of molecules randomly.
| Step 1: Upload data file
Choose your X_data file:
Example
Choose your y_data file:
Example
| Step 2: Set parameters
Set the test size for the data
If float, should be between 0.0 and 1.0 and represent the proportion of the dataset to include in the test split. If integer, represents the absolute number of test samples.
Set the random state for the data
Pseudo-random number generator state used for random sampling.