Skip to content

Tag: CDH

Interesting Docker Containers (and some tips on running them)

July 20, 2018 by edpflager

I’ve been learning how to use Docker for the last couple of months. Part of that experience has been downloading and working with various freely available containers from the Docker Hub. Since an ever increasing number applications are web based (i.e using a website as a UI tool) porting many open source projects to use Docker is …

Continue Reading

Connecting Kettle to Cloudera Hadoop Impala

July 20, 2018 by edpflager

As Big Data platforms like Hadoop and its ecosystem of related applications has matured, they have moved beyond the original key-value model to embrace data processing of more traditional structured data. But a big problem for DBAs and Data Analysts wanting to use the power of these new platforms to analyze data from RDBMS systems …

Continue Reading

Setup a Single-node Hadoop Yarn machine using CDH5 – Part 4

July 20, 2018 by edpflager

This is part 4 of a series about setting up a single-node Hadoop Yarn system for sandbox use. Part 1 was here, part 2 here, and part 3 here. I have another series for using MapReduceV1, which is here. I’m hoping to keep this series in a similar order as the original set of articles, and will deviate …

Continue Reading

Setup a Single-node Hadoop Yarn machine using CDH5 – Part 3

July 20, 2018 by edpflager

This is part 3 of a series about setting up a single-node Hadoop Yarn system for sandbox use. Part 1 was here, and part 2 here. I have another series for using MapReduceV1, which is here. I’m hoping to keep this series in a similar order as the original set of articles, and will deviate only when necessary. …

Continue Reading

Setup a Single-node Hadoop Yarn machine using CDH5 – Part 2

July 20, 2018 by edpflager

This is part 2 of setting up a single-node Hadoop Yarn system for sandbox use. Part 1 was here, or for the series for using MapReduceV1, go here. I’m hoping to keep this series in a similar order as the original set of articles, and will deviate only when necessary. All the content here is based …

Continue Reading

Setup a Single-node Hadoop Yarn machine using CDH5 – Part 1

July 20, 2018 by edpflager

Previously I posted a series of articles that walked through installing a single-node Hadoop machine using version 1 of MapReduce. Since that  series went live, a good deal of development has moved on to MRv2 aka YARN. I won’t go into the intricacies of the new architecture, suffice to say that it aims to be more efficient …

Continue Reading

Posts navigation

  • 1
  • 2
  • 3
  • 4
  • Next

Recent Posts

  • Diagram SQL Server Graph Databases in R – Part 4
  • Diagram SQL Server Graph Databases in R – Part 3
  • Diagram SQL Server Graph Databases in R – Part 2.5
  • Diagram SQL Server Graph Databases in R – Part 2
  • Diagram SQL Server Graph Databases in R – Part 1

Categories

Archives

RSS BI news

  • Query Generation in R
  • Visualizing Language Loss in Taiwan: Create an “Age-Sex Pyramid of Language” with ggplot2
  • Verbose data.table and uncovering hidden cedta’s data table awareness decisions
  • Weather Forecast from MET Office
  • Geocoding function

RSS R related news

  • Query Generation in R
  • Visualizing Language Loss in Taiwan: Create an “Age-Sex Pyramid of Language” with ggplot2
  • Verbose data.table and uncovering hidden cedta’s data table awareness decisions
  • Weather Forecast from MET Office
  • Geocoding function

Tags

Ambassador Bridge Big Data CDH centos Cloudera cookbook Docker ETL external article goofy graph database guides Hadoop HortonWorks How-to howto HUE humor impala inspiration install kettle LaTax LDAP Linux Mac metadata Mint MySQL nat: HNS failed nginx PDI Pentaho photogs R R Markdown RStudio rvest SQL Server SysAdmin technical Ubuntu Windows Windsor YARN
© 2019 | WordPress Theme by Superbthemes