Kaggle -- Titanic - Machine Learning from Disaster

新手kaggle之旅:1 . 泰坦尼克号 

使用一个简单的决策树进行模型构建,达到75.8%的准确率(有点低,但是刚开始)

完整代码如下:

import pandas as pd
import numpy as np

df = pd.read_csv("train.csv")

df.info

label = ['Pclass','Sex','Age','SibSp','Fare','Embarked']

x = df[label]
y = df['Survived']
print(x.loc[0])

x['Embarked'] = x['Embarked'].map({'C': 1, 'Q': 2, 'S': 3})


x['Sex'] = x['Sex'].map({'male': 1,'female' : 2})
print(x.loc[0])

x = x.fillna(x.mean())


import sklearn
from sklearn.tree import DecisionTreeClassifier
from sklearn.model_selection import train_test_split
from sklearn.metrics import accuracy_score

train_x,test_x,train_y,test_y = train_test_split(x,y,test_size=0.2,random_state=42,shuffle=True)

clf = DecisionTreeClassifier()
clf.fit(train_x,train_y)


y_pred = clf.predict(test_x)

accuracy = accuracy_score(y_pred,test_y)
print(f"Accuracy: {accuracy * 100:.2f}%")


res = pd.read_csv('test.csv')
print(res.loc[0])


res_x = res[label]
res_x['Embarked'] = res_x['Embarked'].map({'C': 1, 'Q': 2, 'S': 3})
res_x['Sex'] = res_x['Sex'].map({'male': 1,'female' : 2})
print(res_x.loc[0])

res_x = res_x.fillna(res_x.mean())


pred = clf.predict(res_x)
print(pred[0])

ans = res[['PassengerId']].copy()
ans['Survived'] = pred

print(ans.loc[0])

ans.to_csv("ans.csv")

相关推荐

  1. kaggle-abalone

    2024-06-10 16:24:02       27 阅读
  2. sklearn教程:titanic泰坦尼克号数据集

    2024-06-10 16:24:02       56 阅读

最近更新

  1. docker php8.1+nginx base 镜像 dockerfile 配置

    2024-06-10 16:24:02       98 阅读
  2. Could not load dynamic library ‘cudart64_100.dll‘

    2024-06-10 16:24:02       106 阅读
  3. 在Django里面运行非项目文件

    2024-06-10 16:24:02       87 阅读
  4. Python语言-面向对象

    2024-06-10 16:24:02       96 阅读

热门阅读

  1. 线程安全应用:

    2024-06-10 16:24:02       24 阅读
  2. 01-今日课程介绍

    2024-06-10 16:24:02       33 阅读
  3. 软件测试之黑盒测试与白盒测试

    2024-06-10 16:24:02       39 阅读
  4. 在WSL2的Ubuntu中安装和使用Docker/Podman

    2024-06-10 16:24:02       23 阅读
  5. [AIGC] 图论在LeetCode算法题中的应用

    2024-06-10 16:24:02       33 阅读
  6. 6_1 Linux 用户管理

    2024-06-10 16:24:02       29 阅读
  7. Migrate a WordPress database using MariaDB to another server

    2024-06-10 16:24:02       33 阅读
  8. Linux

    2024-06-10 16:24:02       36 阅读
  9. K8s 集群高可用master节点ETCD全部挂掉如何恢复?

    2024-06-10 16:24:02       35 阅读
  10. 大模型与知识图谱

    2024-06-10 16:24:02       34 阅读