基于Hadoop平台的用户行为挖掘
    点此下载全文
引用本文:曾志浩,姚贝,张琼林,孙琪.基于Hadoop平台的用户行为挖掘[J].计算技术与自动化,2015,(2):100-103
摘要点击次数: 1279
全文下载次数: 53
作者单位
曾志浩,姚贝,张琼林,孙琪 (湖南工业大学 计算机与通信学院,湖南 株洲412000) 
中文摘要:随着互联网发展带来的数据爆炸,使得Web日志的数据量也越来越大,如何从海量的Web日志中挖掘有价值的信息成为了目前研究的热点。本文提出基于Hadoop集群框架对Web日志进行挖掘。实验结果表明,该集群系统既可以处理海量的web日志,同时也能够挖掘出有价值的信息,并证实了利用sqoop在Hive仓库和传统数据库之间数据迁移的可行性。
中文关键词:Web日志  Hadoop  Sqoop  Hive  数据迁移
 
User Behavior Mining Based on Hadoop platform
Abstract:The rapid development of Internet brings data explosion, so web log data is becoming bigger and bigger. How to mine valuable information from huge amounts of Web log becomes the focus of present study. This paper presented Web log mining based on the Hadoop cluster framework. Experimental results show that this cluster system can process massive web log data, and can mine valuable information. And it is confirmed to be feasible that the data migrates between Hive warehouse and traditional database by using sqoop.
keywords:Web log  Hadoop  Sqoop  Hive  data migration
查看全文   查看/发表评论   下载pdf阅读器