python 将数据随机分为训练集和测试集

时间:2015-06-23 19:54:36   收藏:0   阅读:1164
# -*- coding: utf-8 -*-
"""
Created on Tue Jun 23 15:24:19 2015

@author: hd
"""

from sklearn import cross_validation

c = []
j=0
filename = r‘C:\Users\hd\Desktop\bookmarks\bookmarks.arff‘ 
out_train = open(r‘C:\Users\hd\Desktop\bookmarks\train.arff‘,‘w‘)
out_test = open(r‘C:\Users\hd\Desktop\bookmarks\test.arff‘,‘w‘)

for line in open(filename):
#    items = line.strip().split()
    c.append(line)
 
c_train,c_test = cross_validation.train_test_split(c,test_size = 0.6)
for i in c_train:
    out_train.write(i)
for i in c_test:
    out_test.write(i)

  

原文:http://www.cnblogs.com/huadongw/p/4595949.html

评论(0
© 2014 bubuko.com 版权所有 - 联系我们:wmxa8@hotmail.com
打开技术之扣,分享程序人生!