EADST

Train XGBoost Model with Pandas Input

Train XGBoost Model with Pandas Input

import warnings
warnings.filterwarnings("ignore")
import pandas as pd
import numpy as np
import xgboost as xgb
from sklearn.metrics import classification_report

train=pd.read_csv('./train.csv')
test=pd.read_csv('./test.csv')


info=pd.read_csv('info.csv')
print(info.head()) # column name
print(info.shape)
new_info = info.drop_duplicates(subset=['id']) # remove duplicate row with same id
train2=pd.merge(train, new_info[['id', 'number']], how='left', on='id').fillna(0) # merge table horizontally

train_y=train2['result']
train_x=train2.drop(columns=['uaid','result','others'])
test_id = test['id']
test_y=test['result']
test_x=test.drop(columns=['uaid','result','others'])


model = xgb.XGBClassifier()
model.fit(train_x, train_y)
train_predict_y = model.predict(train_x)
print(classification_report(train_y, train_predict_y))


result=model.predict_proba(test_x)
result=pd.concat([test_y,pd.DataFrame(result)],axis=1)
result.to_csv('./test_result.csv')
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
FastAPI Shortcut CV Food OpenAI Python 域名 Jupyter LLM Pickle CC Github Markdown Heatmap Knowledge Dataset uwsgi Plate Windows Permission 腾讯云 git XGBoost SPIE Interview ChatGPT 图标 Anaconda Paddle ResNet-50 HaggingFace Card Hotel Pandas Web Docker v0.dev 净利润 NLP CUDA TensorRT Bitcoin PIP Pillow Google Llama Translation 关于博主 Git Disk UI VGG-16 FP16 Proxy AI Excel UNIX GPTQ Template Vim NameSilo 报税 VSCode Linux Color Django WAN 算法题 torchinfo tar Input 飞书 Nginx CLAP 图形思考法 多进程 IndexTTS2 scipy Sklearn 版权 uWSGI Ptyhon JSON Tensor 公式 Video Logo tqdm GoogLeNet Crawler Tracking SAM 强化学习 Website transformers Conda PyCharm FP64 Land Bin TSV Quantize SQLite 顶会 Breakpoint LeetCode LaTeX CTC CAM DeepSeek Use TensorFlow PyTorch Bert Animate Domain Search icon Hungarian Quantization OCR Safetensors Transformers InvalidArgumentError DeepStream llama.cpp Diagram 阿里云 Magnet Algorithm PDF GGML Miniforge Augmentation diffusers YOLO Attention FP32 Tiktoken GIT BF16 Clash Claude Random printf Qwen2 TTS FP8 Image2Text 证件照 HuggingFace RAR v2ray Bipartite Numpy COCO XML NLTK 第一性原理 Jetson Password SVR EXCEL QWEN Qwen2.5 ModelScope Distillation Statistics API Streamlit Paper mmap Qwen CEIR Ubuntu Gemma Plotly BTC git-lfs CSV BeautifulSoup News Rebuttal Baidu SQL PDB Cloudreve WebCrawler RGB Datetime Mixtral Firewall Base64 多线程 Freesound Zip ONNX VPN Michelin Data Agent FlashAttention hf GPT4 LoRA OpenCV LLAMA 继承 财报 Review 搞笑 Hilton 递归学习法 云服务器 logger Math MD5 Pytorch 签证 音频 C++ Vmess
站点统计

本站现有博文323篇,共被浏览795444

本站已经建立2493天!

热门文章
文章归档
回到顶部