Зарегистрироваться
Восстановить пароль
FAQ по входу

Apache Hadoop

Apache Hadoop — свободно распространяемый набор утилит, библиотек и фреймворк для разработки и выполнения распределённых программ, работающих на кластерах из сотен и тысяч узлов.
Используется для реализации поисковых и контекстных механизмов многих высоконагруженных веб-сайтов, в том числе, для Yahoo! и Facebook.
Разработан на Java в рамках вычислительной парадигмы MapReduce, согласно которой приложение разделяется на большое количество одинаковых элементарных заданий, выполнимых на узлах кластера и естественным образом сводимых в конечный результат.
  • Без фильтрации типов файлов
H
Manning Publications, 2021. — 482 p. — ISBN 978-1617296901. Data Pipelines with Apache Airflow teaches you the ins-and-outs of the Directed Acyclic Graphs (DAGs) that power Airflow, and how to write your own DAGs to meet the needs of your projects. With complete coverage of both foundational and lesser-known features, when you’re done you’ll be set to start using Airflow for...
  • №1
  • 9,14 МБ
  • добавлен
  • описание отредактировано
O’Reilly, 2017. — 300 р. — ISBN: 978-1491959633. Up until recently, Hadoop deployments have existed on hardware owned and run by organizations, often alongside legacy “big-iron” hardware. Today, cloud service providers allow customers to effectively rent hardware and associated network connectivity, along with a variety of other features like databases and bulk storage. But...
  • №2
  • 4,93 МБ
  • добавлен
  • описание отредактировано
K
PE Press, 2021. — 120 р. — ISBN: 978-1-716-10839-6. This book provides alternative approach to get started with Big Data Query using Apache Impala. This book describes how to work with Apache Impala and to perform queries inside Apache Impala. Apache Impala is a modern, open source, distributed SQL query engine for Apache Hadoop. With Impala, we can query data, whether stored...
  • №3
  • 4,60 МБ
  • добавлен
  • описание отредактировано
M
Addison-Wesley Professional, 2016. — 387 p. — (Addison-Wesley Data & Analytics). — ISBN10: 0134024141. — ISBN13: 978-0134024141. As adoption of Hadoop accelerates in the enterprise and beyond, there's soaring demand for those who can solve real world problems by applying advanced data science techniques in Hadoop environments. Now Practical Data Science with Hadoop(R) and Spark...
  • №4
  • 9,30 МБ
  • добавлен
  • описание отредактировано
O
Packt Publishing, 2013. — 316 p. — ISBN: 978-1-78439-550-6. Helping developers become more comfortable and proficient with solving problems in the Hadoop space. People will become more familiar with a wide variety of Hadoop related tools and best practices for implementation. Hadoop Real-World Solutions Cookbook will teach readers how to build solutions using tools such as...
  • №5
  • 1,63 МБ
  • добавлен
  • описание отредактировано
S
Packt Publishing, 2015. — 222 p. — ISBN: 978-1-78528-899-9. Integrate Elasticsearch into Hadoop to effectively visualize and analyze your data The Hadoop ecosystem is a de-facto standard for processing terra-bytes and peta-bytes of data. Lucene-enabled Elasticsearch is becoming an industry standard for its full-text search and aggregation capabilities. Elasticsearch-Hadoop...
  • №6
  • 2,63 МБ
  • добавлен
  • описание отредактировано
Packt Publishing, 2015. — 100 p. — ISBN: 978-1-78328-155-8. Get to grips with the intricacies of Hadoop monitoring using the power of Ganglia and Nagios With the exponential growth of data and many enterprises crunching more and more data, Hadoop as a data platform has gained a lot of popularity. The Hadoop platform needs to be monitored with respect to how it works and...
  • №7
  • 1,39 МБ
  • добавлен
  • описание отредактировано
T
Packt Publishing, 2015. — 518 p. — ISBN10: 1783285516, ISBN13: 9781783285518. Код примеров к книге выложен здесь. This book introduces you to the world of building data-processing applications with the wide variety of tools supported by Hadoop2. Starting with the core components of the framework—HDFS and YARN—this book will guide you through how to build applications using a...
  • №8
  • 1,88 МБ
  • добавлен
  • описание отредактировано
W
4th Edition. — O’Reilly, 2015. — 805 p. — ISBN: 1491901632. Get ready to unlock the power of your data. With the fourth edition of this comprehensive guide, you’ll learn how to build and maintain reliable, scalable, distributed systems with Apache Hadoop. This book is ideal for programmers looking to analyze datasets of any size, and for administrators who want to set up and...
  • №9
  • 4,21 МБ
  • добавлен
  • описание отредактировано
В этом разделе нет файлов.

Комментарии

В этом разделе нет комментариев.