基于Hadoop平台的用户行为挖掘 |
点此下载全文 |
引用本文:曾志浩,姚贝,张琼林,孙琪.基于Hadoop平台的用户行为挖掘[J].计算技术与自动化,2015,(2):100-103 |
摘要点击次数: 1279 |
全文下载次数: 53 |
|
|
中文摘要:随着互联网发展带来的数据爆炸,使得Web日志的数据量也越来越大,如何从海量的Web日志中挖掘有价值的信息成为了目前研究的热点。本文提出基于Hadoop集群框架对Web日志进行挖掘。实验结果表明,该集群系统既可以处理海量的web日志,同时也能够挖掘出有价值的信息,并证实了利用sqoop在Hive仓库和传统数据库之间数据迁移的可行性。 |
中文关键词:Web日志 Hadoop Sqoop Hive 数据迁移 |
|
User Behavior Mining Based on Hadoop platform |
|
|
Abstract:The rapid development of Internet brings data explosion, so web log data is becoming bigger and bigger. How to mine valuable information from huge amounts of Web log becomes the focus of present study. This paper presented Web log mining based on the Hadoop cluster framework. Experimental results show that this cluster system can process massive web log data, and can mine valuable information. And it is confirmed to be feasible that the data migrates between Hive warehouse and traditional database by using sqoop. |
keywords:Web log Hadoop Sqoop Hive data migration |
查看全文 查看/发表评论 下载pdf阅读器 |