Apache HBase

Apache HBase

Apache HBase is an open-source, distributed, versioned, column-oriented database modeled after Google's Bigtable. It provides Bigtable-like capabilities on top of Apache Hadoop for handling massive datasets with billions of rows and millions of columns.

Self-hosted alternatives to:
Similar self-hosted alternatives:
Repository activity:
Stars
5,346
Forks
3,348
Watchers
389
Open Issues
316
Last commit
about 4 hours ago
Details:
Estimated Popularity
29
Pricing Model
Free
Hosting Type
Self-Hosted
License
Apache-2.0
Deployment Difficulty
Advanced
Language
Java

Apache HBase is a distributed, scalable, big data store designed to handle massive tables with billions of rows and millions of columns. It's built on top of HDFS (Hadoop Distributed File System) and provides real-time read/write access to your data.

Key Features

  • Distributed Architecture:

    • Linear and modular scalability
    • Automatic sharding of tables
    • Consistent reads and writes
    • Region servers for data hosting
    • ZooKeeper quorum for coordination
  • Data Model:

    • Column-oriented storage
    • Schema-flexible tables
    • Versioned data cells
    • Sparse data handling
    • Row-level atomicity
  • Performance:

    • In-memory operation
    • Block cache for hot data
    • Bloom filters for queries
    • Compaction for storage optimization
    • Write-ahead logging

Help improve this content

Found an error or want to add more information about Apache HBase? You can edit this page directly on GitHub.

Project Categories

Click on a category to explore similar projects