| 3,276 | 28 | 723 |
| 下载次数 | 被引频次 | 阅读次数 |
为了更加精准地预测二手房价格,该文以2019年深圳市二手房的真实交易数据为研究对象,利用线性回归模型、随机森林模型和XGBoost模型并加以POI计算来预测二手房价格.首先,对数据集进行清洗并可视化展示.其次,运用百度地图进行POI处理扩充数据集,使得数据集接近现实情况.接着,按照数据特征对房价影响的重要程度进行了排序,选取重要的特征来训练模型.最后,通过数值结果分析,XGBoost模型对二手房的房价评估效果最好,尤其是经过POI处理的数据集和XGBoost模型的这种组合,对于深圳市的二手房价格具有极好的预测效果.
Abstract:In order to predict the second-hand house prices more accurately, the real transaction data of second-hand houses in Shenzhen in 2019 is taken as the research object, using Linear Regression, Random Forest, XGBoost and add POI calculation to predict the price. Firstly, the dataset was cleaned and visualized. Then, the dataset was expanded by using Baidu map for POI processing to make it close to the reality. Next, the dataset features were ranked according to their importance in influencing house prices, and the important features were selected to train the models. Finally, the experimental data showed that XGBoost model was the best for predicting second-hand house prices, especially the combination of the POI dataset and XGBoost model had an excellent prediction performance for the second-hand house prices in Shenzhen.
[1] Limsombunc V,Gan C ,Lee M.House Price Prediction:Hedonic Price Model vs.Artificial Neural Network[C].Science Publications,2004.
[2] Antipov E A,Pokryshevskaya E B.Mass appraisal of residential apartments:An application of Random forest for valuation and a CART-based approach for model diagnostics[J].Expert Systems with Applications,2012,39(2):1772-1778.
[3] Fotheringham A S,Crespo R,Yao J.Exploring,modelling and predicting spatiotemporal variations in house prices[J].Annals of Regional Science,2015,54(2):417-436.
[4] Denisko D,Hoffman M M.Classification and interaction in random forests[J].Proceedings of the National Academy of Sciences of the United States of America,2018,115(8):1690-1692.
[5] Mohd T,Jamil N S,Johari N,et al.An Overview of Real Estate Modelling Techniques for House Price Prediction[M].Charting a Sustainable Future of ASEAN in Business and Social Sciences,2020.321-338.
[6] 陈世鹏,金升平.基于随机森林模型的房价预测[J].科技创新与应用,2016(04):52.
[7] 张靖苗.基于GWR模型的城市住宅价格空间分异及其影响因素对比研究——以昆明市和成都市为例[D].昆明:昆明理工大学,2017.
[8] 杨贵军,徐雪,赵富强.基于XGBoost算法的用户评分预测模型及应用[J].数据分析与知识发现,2019,3(01):118-126.
[9] 梁佩.基于机器学习的二手房估价模型[D].昆明:云南大学,2019.
[10] 龚洪亮.基于XGBoost算法的武汉市二手房价格预测模型的实证研究[D].武汉:华中师范大学,2018.
基本信息:
中图分类号:F299.23;TP181
引用信息:
[1]胡晓伟,马春梅,孔祥山,等.基于XGBoost的深圳二手房价格预测[J].曲阜师范大学学报(自然科学版),2022,48(01):57-65.
基金信息:
国家自然科学基金(62072273); 山东省重大基础研究(ZR201906140028)
2022-01-15
2022-01-15