Breaking News: Grepper is joining You.com. Read the official announcement!

split data train, test by id python

Add Answer

Victorious Vendace answered on July 17, 2020 Popularity 9/10 Helpfulness 4/10

answer split data train, test by id python

related train-test split code in pandas

related split train and test data python

split data train, test by id python

Comment

Tip Victorious Vendace 1 GREPCC

train_inds, test_inds = next(GroupShuffleSplit(test_size=.20, n_splits=2, random_state = 7).split(df, groups=df['Group_Id']))

train = df.iloc[train_inds]
test = df.iloc[test_inds]

xxxxxxxxxx

train_inds, test_inds = next(GroupShuffleSplit(test_size=.20, n_splits=2, random_state = 7).split(df, groups=df['Group_Id']))

train = df.iloc[train_inds]

test = df.iloc[test_inds]

Popularity 9/10 Helpfulness 4/10 Language python

Source: stackoverflow.com

Tags: python

Link to this answer
Share Copy Link

Contributed on Jul 17 2020

Victorious Vendace

0 Answers Avg Quality 2/10

Closely Related Answers

train-test split code in pandas

Comment

Tip Coder_Fox 1 GREPCC

df_permutated = df.sample(frac=1)

train_size = 0.8
train_end = int(len(df_permutated)*train_size)

df_train = df_permutated[:train_end]
df_test = df_permutated[train_end:]

xxxxxxxxxx

df_permutated = df.sample(frac=1)

train_size = 0.8

train_end = int(len(df_permutated)*train_size)

df_train = df_permutated[:train_end]

df_test = df_permutated[train_end:]

Popularity 10/10 Helpfulness 4/10 Language python

Source: Grepper

Tags: pandas pa

Link to this answer
Share Copy Link

Contributed on Nov 18 2020

Coder_Fox

0 Answers Avg Quality 2/10

split train and test data python

Comment

Tip Innocent Iguana 1 GREPCC

import numpy as np
# Randomly take 80% index as mask
mask = np.random.rand(len(df)) < 0.8 
# Take features
df = df[['A','B','C','D']]
# Use index mask to pull out 80% training data
train_df = df[mask]
X_Train = train_df[['A','B','C']]
Y_Train = train_df['D']
# Use negation mask to pull out remaining testing data
test_df = df[~mask]
X_Test = test_df[['A','B','C']]
Y_Test = test_df['D']

xxxxxxxxxx

import numpy as np

# Randomly take 80% index as mask

mask = np.random.rand(len(df)) < 0.8

# Take features

df = df[['A','B','C','D']]

# Use index mask to pull out 80% training data

train_df = df[mask]

X_Train = train_df[['A','B','C']]

Y_Train = train_df['D']

# Use negation mask to pull out remaining testing data

test_df = df[~mask]

X_Test = test_df[['A','B','C']]

Y_Test = test_df['D']

Popularity 9/10 Helpfulness 2/10 Language python

Source: Grepper

Tags: python py

Link to this answer
Share Copy Link

Contributed on Feb 10 2023

Innocent Iguana

0 Answers Avg Quality 2/10

split data train, test by id python

Contents

More Related Answers

split data train, test by id python

Closely Related Answers

train-test split code in pandas

split train and test data python

Grepper

Documentation

Social

Legal

Contact

Oops, You will need to install Grepper and log-in to perform this action.