[python] kaggle, boston marathon

2021. 4. 29. 03:53Python/문법

chapter1. pandas다루기_마라톤.ipynb
0.18MB

Ver. Jupyter Notebook (Anaconda3)

▶ 데이터 합치기 : concat

>>> import pandas as pd
>>> import numpy as np

>>> marathon_2015 = pd.read_csv('./date_boston/marathon_results_2015.csv')
>>> marathon_2016 = pd.read_csv('./date_boston/marathon_results_2016.csv')
>>> marathon_2017 = pd.read_csv('./date_boston/marathon_results_2017.csv')


marathon_2015['Year'] = '2015'
marathon_2016['Year'] = '2016'
marathon_2017['Year'] = '2017'

# index 'Offical Time'으로 데이터 합치기
>>> marathon_2015_2017 = pd.concat([marathon_2015, marathon_2016, marathon_2017], ignore_index=True, sort=False)

# 불필요한 columns 삭제
>>> marathon_2015_2017 = marathon_2015_2017.drop(['Unnamed: 0', 'Bib', 'Citizen', 'Unnamed: 9', 'Proj Time', 'Unnamed: 8'], axis=1)
>>> marathon_2015_2017.head()

# for문, pandas로 초단위로 변환
>>> list(marathon_2015_2017.columns)[6:17]
>>> for i in list(marathon_2015_2017.columns)[6:17]:
           marathon_2015_2017[i] = pd.to_timedelta(marathon_2015_2017[i]).astype('m8[s]').astype(np.int64)

>>> print(marathon_2015_2017.shape)
>>> marathon_2015_2017.head()

# 데이터 저장
>>> marathon_2015_2017.to_csv('./data_boston/marathon_2015_2017.csv', index=None, header=True)

'Python > 문법' 카테고리의 다른 글

[python] crawler  (0) 2021.04.30
[Python] 데이터 시각화  (0) 2021.04.29
[python] 외부데이터  (0) 2021.04.29
[Python] 정리  (0) 2021.04.29
[python] pandas  (0) 2021.04.28