CDH3 update 4 is now available
We are happy to officially announce the general availability of CDH3 update 4. This update consists primarily of reliability enhancements as well as a number of minor improvements. First, there have...
View ArticleCloudera Manager 4.0 Beta released
We’re happy to announce the Beta release of Cloudera Manager 4.0. This version of Cloudera Manager includes support for CDH4 Beta2 and several new features for both the Free edition and the Enterprise...
View ArticleMeet the Presenter: Todd Lipcon
Today’s interview features Todd Lipcon, software engineer for Cloudera. Todd will be presenting Optimizing MapReduce Job Performance at Hadoop Summit. Question: Tell us about your current role and how...
View ArticleApache HBase 0.94 is now released
Apache HBase 0.94.0 has been released! This is the first major release since the January 22nd HBase 0.92 release. In the HBase 0.94.0 release the main focuses were on performance enhancements and the...
View ArticleApache MRUnit Is Now A Top Level Project
This posted was originally posted to the Apache Software Foundation MRUnit blog. The Apache MRUnit team has graduated from the Apache Incubator to an Apache TLP (Top Level Project)! MRUnit is a Java...
View ArticleNameNode Recovery Tools for the Hadoop Distributed File System
Most system administrators have had to deal with a bad hard disk at some point. One moment, the hard disk is a mechanical marvel; the next, it is an expensive paperweight. The HDFS (Hadoop Distributed...
View ArticleCloudera Manager 3.7.6 released!
We are pleased to announce that Cloudera Manager 3.7.6 is now available! The most notable updates in this release are: Support for multiple Hue service instances Separating RPC queue and processing...
View ArticleOnline HBase Backups with CopyTable
CopyTable is a simple Apache HBase utility that, unsurprisingly, can be used for copying individual tables within an HBase cluster or from one HBase cluster to another. In this blog post, we’ll talk...
View ArticleHue 2.0 Packs New Features in a Refreshing UI
Hue 2.0.1 has just been released. 2.0.1 represents major improvement on top of the Hue 1.x series. To list a few key new features: Frontend has been re-implemented as full screen pages. Hue supports...
View ArticleCDH4 and Cloudera Enterprise 4.0 Now Available
I’m very pleased to announce the immediate General Availability of CDH4 and Cloudera Manager 4 (part of the Cloudera Enterprise 4.0 subscription). These releases are an exciting milestone for Cloudera...
View ArticleThe Singularity: HBase Compatibility and Extensibility
Overview One of the major features of the upcoming Apache HBase 0.96 release is improved support for compatibility and extensibility across different HBase versions. This includes support for the...
View ArticleThe Elephant in the Enterprise
On Tuesday, June 12th The Churchill Club of Silicon Valley hosted a panel discussion on Hadoop’s evolution from an open-source project to becoming a standard component of today’s enterprise computing...
View ArticleHBase Write Path
Apache HBase is the Hadoop database, and is based on the Hadoop Distributed File System (HDFS). HBase makes it possible to randomly access and update data stored in HDFS, but files in HDFS can only be...
View ArticleA Big Thank You to All Who Participated In Making HBaseCon and the HBase...
HBaseCon 2012 summation provided by Michael Stack, PMC Chair of the Apache HBase Project. HBase Hack-a-thon summation provided by David Wang, Engineering Manager for the Cloudera HBase team. HBaseCon...
View ArticleHBase I/O – HFile
Introduction Apache HBase is the Hadoop open-source, distributed, versioned storage manager well suited for random, realtime read/write access. Wait wait? random, realtime read/write access?How is that...
View ArticleUpdate on Apache Bigtop (incubating)
Introduction Ever since Cloudera decided to contribute the code and resources for what would later become Apache Bigtop (incubating), we’ve been answering a very basic question: what exactly is Bigtop...
View ArticleApache Flume Development Status Update
Apache Flume is a scalable, reliable, fault-tolerant, distributed system designed to collect, transfer, and store massive amounts of event data into HDFS. Apache Flume recently graduated from the...
View ArticleThe Hadoop Ecosystem, Visualized in Datameer
This is a guest re-post from Datameer’s Director of Marketing, Rich Taylor. The original post can be found on the Datameer blog. Datameer uses D3.js to power our Business Infographic™ designer. I...
View ArticleWatching the Clock: Cloudera’s Response to Leap Second Troubles
At 5 pm PDT on June 30, a leap second was added to the Universal Coordinated Time (UTC). Within an hour, Cloudera Support started receiving reports of systems running at 100% CPU utilization. The...
View ArticleHBase Log Splitting
In the recent blog post about the HBase Write Path, we talked about the write-ahead-log (WAL), which plays an important role in preventing data loss should a HBase region server failure occur. This...
View Article
More Pages to Explore .....