Microsoft big data solutions / Adam Jorgensen ...
Contributor(s): Resource type: Ressourcentyp: Buch (Online)Book (Online)Language: English Publisher: New York : John Wiley & Sons, Incorporated, 2014Edition: Online-AusgDescription: Online-Ressource (1 online resource (xix, 388 pages)) : illustrationsISBN:- 9781306472975
- 1306472970
- 9781118742099
- 005.74
- QA76.9.F5
- TK5105.88813
Contents:
Summary: Cover -- Title Page -- Copyright -- Contents -- Introduction -- Part I What Is Big Data? -- Chapter 1 Industry Needs and Solutions -- What's So Big About Big Data? -- A Brief History of Hadoop -- Google -- Nutch -- What Is Hadoop? -- Derivative Works and Distributions -- Hadoop Distributions -- Core Hadoop Ecosystem -- Important Apache Projects for Hadoop -- The Future for Hadoop -- Summary -- Chapter 2 Microsoft's Approach to Big Data -- A Story of "Better Together" -- Competition in the Ecosystem -- SQL on Hadoop Today -- Hortonworks and Stinger -- Cloudera and Impala -- Microsoft's Contribution to SQL in Hadoop -- Deploying Hadoop -- Deployment Factors -- Deployment Topologies -- Deployment Scorecard -- Summary -- Part II Setting Up for Big Data with Microsoft -- Chapter 3 Configuring Your First Big Data Environment -- Getting Started -- Getting the Install -- Running the Installation -- On-Premise Installation: Single-Node Installation -- HDInsight Service: Installing in the Cloud -- Windows Azure Storage Explorer Options -- Validating Your New Cluster -- Logging into HDInsight Service -- Verify HDP Functionality in the Logs -- Common Post-Setup Tasks -- Loading Your First Files -- Verifying Hive and Pig -- Summary -- Part III Storing and Managing Big Data -- Chapter 4 HDFS, Hive, HBase, and HCatalog -- Exploring the Hadoop Distributed File System -- Explaining the HDFS Architecture -- Interacting with HDFS -- Exploring Hive: The Hadoop Data Warehouse Platform -- Designing, Building, and Loading Tables -- Querying Data -- Configuring the Hive ODBC Driver -- Exploring HCatalog: HDFS Table and Metadata Management -- Exploring HBase: An HDFS Column-Oriented Database -- Columnar Databases -- Defining and Populating an HBase Table -- Using Query Operations -- Summary -- Chapter 5 Storing and Managing Data in HDFS.PPN: PPN: 787517909Package identifier: Produktsigel: ZDB-26-MYL | ZDB-30-PAD | ZDB-30-PQE
Cover; Title Page; Copyright; Contents; Introduction; Part I What Is Big Data?; Chapter 1 Industry Needs and Solutions; What's So Big About Big Data?; A Brief History of Hadoop; Google; Nutch; What Is Hadoop?; Derivative Works and Distributions; Hadoop Distributions; Core Hadoop Ecosystem; Important Apache Projects for Hadoop; The Future for Hadoop; Summary; Chapter 2 Microsoft's Approach to Big Data; A Story of "Better Together"; Competition in the Ecosystem; SQL on Hadoop Today; Hortonworks and Stinger; Cloudera and Impala; Microsoft's Contribution to SQL in Hadoop; Deploying Hadoop
Deployment FactorsDeployment Topologies; Deployment Scorecard; Summary; Part II Setting Up for Big Data with Microsoft; Chapter 3 Configuring Your First Big Data Environment; Getting Started; Getting the Install; Running the Installation; On-Premise Installation: Single-Node Installation; HDInsight Service: Installing in the Cloud; Windows Azure Storage Explorer Options; Validating Your New Cluster; Logging into HDInsight Service; Verify HDP Functionality in the Logs; Common Post-Setup Tasks; Loading Your First Files; Verifying Hive and Pig; Summary; Part III Storing and Managing Big Data
Chapter 4 HDFS, Hive, HBase, and HCatalogExploring the Hadoop Distributed File System; Explaining the HDFS Architecture; Interacting with HDFS; Exploring Hive: The Hadoop Data Warehouse Platform; Designing, Building, and Loading Tables; Querying Data; Configuring the Hive ODBC Driver; Exploring HCatalog: HDFS Table and Metadata Management; Exploring HBase: An HDFS Column-Oriented Database; Columnar Databases; Defining and Populating an HBase Table; Using Query Operations; Summary; Chapter 5 Storing and Managing Data in HDFS; Understanding the Fundamentals of HDFS; HDFS Architecture
NameNodes and DataNodesData Replication; Using Common Commands to Interact with HDFS; Interfaces for Working with HDFS; File Manipulation Commands; Administrative Functions in HDFS; Moving and Organizing Data in HDFS; Moving Data in HDFS; Implementing Data Structures for Easier Management; Rebalancing Data; Summary; Chapter 6 Adding Structure with Hive; Understanding Hive's Purpose and Role; Providing Structure for Unstructured Data; Enabling Data Access and Transformation; Differentiating Hive from Traditional RDBMS Systems; Working with Hive; Creating and Querying Basic Tables
Creating DatabasesCreating Tables; Adding and Deleting Data; Querying a Table; Using Advanced Data Structures with Hive; Setting Up Partitioned Tables; Loading Partitioned Tables; Using Views; Creating Indexes for Tables; Summary; Chapter 7 Expanding Your Capability with HBase and HCatalog; Using HBase; Creating HBase Tables; Loading Data into an HBase Table; Performing a Fast Lookup; Loading and Querying HBase; Managing Data with HCatalog; Working with HCatalog and Hive; Defining Data Structures; Creating Indexes; Creating Partitions; Integrating HCatalog with Pig and Hive
Using HBase or Hive as a Data Warehouse
No physical items for this record