Big Data Analytics with Hadoop

贡献者:rurutia 类别:英文 时间:2016-03-02 17:03:27 收藏数:10 评分:0
返回上页 举报此文章
请选择举报理由:




收藏到我的文章 改错字
Big Data Analytics with Hadoop (3 credits)
In this course, students will learn cutting edge technologies and
concepts related to analysis of big data – data that is too large to process
in the main memory of one computer. The course is organized into two parts. In the first part,
students will build upon their understanding of RDBMS and SQL,
and explore the use of SQL-like queries in a big data environment
(Hadoop distributed file system – HDFS), using tools such as Sqoop, Pig and Hive.
Students will identify typical situations that warrant large data analysis,
move data between relational databases and Hadoop using Sqoop, manage data in HDFS,
and use Pig and Hive to run distributed queries on data.
In the second part of the course, students will build upon their understanding of
data mining techniques and learn to apply them to analyze large datasets.
Students will use Apache Mahout software in the Hadoop ecosystem to explore
item-based collaborative filtering, non-distributed recommenders, frequent itemset mining,
clustering, and some text mining algorithms, including Naïve Bayesian classifier.
Course includes: SQL-like querying in big data cluster; systems, classifiers,
clustering; Hadoop ecosystem overview; and deep dive into Hadoop Pig, Hive and Mahout.
声明:以上文章均为用户自行添加,仅供打字交流使用,不代表本站观点,本站不承担任何法律责任,特此声明!如果有侵犯到您的权利,请及时联系我们删除。
文章热度:
文章难度:
文章质量:
说明:系统根据文章的热度、难度、质量自动认证,已认证的文章将参与打字排名!

本文打字排名TOP20

登录后可见