8. Pandas (35%)
Read CSV and JSON
讀取CSV檔
import pandas as pd
df = pd.read_csv('data/14196_drug_adv.csv', error_bad_lines=False)
set(df.刊播媒體類別)讀取Excel檔
import pandas as pd
df = pd.read_excel('../../R/clickbait_detection/labeled/tag_comparison_1st&2nd_round.xlsx', error_bad_lines=False)計數以了解資料概況
df.刊播媒體類別.value_counts()from collections import Counter
type_dict = Counter(df.刊播媒體類別)
print(type_dict)Pilot分析-群組化計數 group_by

案例分析:摘要youbike
載入資料

產生新的變項
觀察資料概況

Read R RDS
Read R RDA
Tokenizing post content

M1. Tokenize one column by for-loo
M2. Tokenize pandas columns by `apply()`
Applications: Building word2vec model
Last updated