<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[Hbase基础知识]]></title><description><![CDATA[<p dir="auto">1.Hbase表设计<br />
In HBase, you will find two different types of tables: the system tables and the user<br />
tables. Systems tables are used internally by HBase to keep track of meta information<br />
like the table’s access control lists (ACLs), metadata for the tables and regions, name‐<br />
spaces, and so on. There should be no need for you to look at those tables. User tables<br />
are what you will create for your use cases. They will belong to the default name‐<br />
space unless you create and use a specific one.</p>
<p dir="auto"><img src="/assets/uploads/files/1612691040475-883300c1-458a-4a46-9b9e-64ef5e802dcd-image.png" alt="883300c1-458a-4a46-9b9e-64ef5e802dcd-image.png" class=" img-responsive img-markdown" /><br />
一个具体例子：<br />
<img src="/assets/uploads/files/1612787696227-7f9d6d17-d183-4dd3-a08b-41b69aa19e88-image.png" alt="7f9d6d17-d183-4dd3-a08b-41b69aa19e88-image.png" class=" img-responsive img-markdown" /><br />
<a href="https://www.tutorialspoint.com/hbase/hbase_create_data.htm" rel="nofollow ugc">https://www.tutorialspoint.com/hbase/hbase_create_data.htm</a></p>
<p dir="auto">Only columns where there is a value are stored in the underlying filesystem.<br />
<img src="/assets/uploads/files/1612691169975-fd740118-2eb7-40fc-b10f-c5a0fdd7c8ae-image.png" alt="fd740118-2eb7-40fc-b10f-c5a0fdd7c8ae-image.png" class=" img-responsive img-markdown" /><br />
tables are split into <strong>regions</strong> where each region will store a specific range<br />
of data. The regions are assigned to RegionServers to serve each region’s content.<br />
A <strong>column family</strong> is an HBase-specific concept that you will not find in other RDBMS applications. For the same region, different column families will store the data into different files and can be configured differently. Data with the same access pattern and the same format should be grouped into the same column family. As an example regarding the format, if you need to store a lot of textual metadata information for customer profiles in addition to image files for each customer’s profile photo, you might want to store them into two different column families: one compressed (where all the textual information will be stored), and one not compressed (where the image files will be stored). As an example regarding the access pattern, if some information is mostly read and almost never written, and some is mostly written and almost never read, you might want to separate them into two different column families. If the different columns you want to store have a similar format and access pattern, regroup them within the same column family.</p>
<p dir="auto">Stores<br />
We will find one store per column family. A store object regroups one memstore and zero or more store files (called HFiles). This is the entity that will store all the information written into the table and will also be used when data needs to be read from the table.</p>
<p dir="auto">HFiles<br />
HFiles are created when the memstores are full and must be flushed to disk. HFiles are eventually compacted together over time into bigger files. They are the HBase file format used to store table data. HFiles are composed of different types of blocks (e.g.,<br />
index blocks and data blocks). HFiles are stored in HDFS, so they benefit from Hadoop persistence and replication.</p>
<p dir="auto">Blocks<br />
HFiles are composed of blocks. Those blocks should not be confused with HDFS blocks. One HDFS block might contain multiple HFile blocks. HFile blocks are usually between 8 KB and 1 MB, but the default size is 64 KB. However, if compression is<br />
configured for a given table, HBase will still generate 64 KB blocks but will then compress them. The size of the compressed block on the disk might vary based on the data and the compression format. Larger blocks will create a smaller number of index values and are good for sequential table access, while smaller blocks will create more index values and are better for random read accesses.</p>
<p dir="auto"><img src="/assets/uploads/files/1612692198442-3e677184-4834-4fa7-be4b-c1e3165f2318-image.png" alt="3e677184-4834-4fa7-be4b-c1e3165f2318-image.png" class=" img-responsive img-markdown" /><br />
each row will be stored within a specific format. Figure 2-4 represents the format of an individual HBase cell.</p>
<p dir="auto">节点角色：<br />
<img src="/assets/uploads/files/1612692997169-2a977524-fe14-4eb2-a706-89f5f8f051c8-image.png" alt="2a977524-fe14-4eb2-a706-89f5f8f051c8-image.png" class=" img-responsive img-markdown" /><br />
Master Server<br />
• Region assignment<br />
• Load balancing<br />
• RegionServer recovery<br />
• Region split completion monitoring<br />
• Tracking active and dead servers</p>
<p dir="auto">Unlike HBase RegionServers, the HBase Master doesn’t have much workload and can　be installed on servers with less memory and fewer cores.Building HBase Masters (and other master services like NameNodes, ZooKeeper, etc.)on robust hardware with OS on RAID drives, dual power supply, etc. is highly recommended.</p>
<p dir="auto">RegionServer<br />
A RegionServer (RS) is the application hosting and serving the HBase regions and　therefore the HBase data.Even if it is technically doable to host more than one RegionServer on a physical host,it is recommended to run only one server per host and to give it the resources you　will have shared between the two servers.</p>
]]></description><link>http://an.forum.genostack.com/topic/211/hbase基础知识</link><generator>RSS for Node</generator><lastBuildDate>Sat, 13 Jun 2026 10:43:03 GMT</lastBuildDate><atom:link href="http://an.forum.genostack.com/topic/211.rss" rel="self" type="application/rss+xml"/><pubDate>Sun, 07 Feb 2021 10:59:50 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[Reply to Hbase基础知识 on Mon, 08 Feb 2021 07:17:28 GMT]]></title><description><![CDATA[<p dir="auto">Hbase 可靠性<br />
<img src="/assets/uploads/files/1612768604206-2ba0910e-6f34-4d99-bffe-1990c44160ef-image.png" alt="2ba0910e-6f34-4d99-bffe-1990c44160ef-image.png" class=" img-responsive img-markdown" /><br />
<a href="https://www.simplilearn.com/tutorials/hadoop-tutorial/hbase" rel="nofollow ugc">https://www.simplilearn.com/tutorials/hadoop-tutorial/hbase</a></p>
]]></description><link>http://an.forum.genostack.com/post/411</link><guid isPermaLink="true">http://an.forum.genostack.com/post/411</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Mon, 08 Feb 2021 07:17:28 GMT</pubDate></item><item><title><![CDATA[Reply to Hbase基础知识 on Mon, 08 Feb 2021 06:56:23 GMT]]></title><description><![CDATA[<p dir="auto">Hbase架构<br />
<a href="https://data-flair.training/blogs/hbase-architecture/" rel="nofollow ugc">https://data-flair.training/blogs/hbase-architecture/</a></p>
]]></description><link>http://an.forum.genostack.com/post/410</link><guid isPermaLink="true">http://an.forum.genostack.com/post/410</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Mon, 08 Feb 2021 06:56:23 GMT</pubDate></item><item><title><![CDATA[Reply to Hbase基础知识 on Mon, 08 Feb 2021 06:58:53 GMT]]></title><description><![CDATA[<p dir="auto">Master和regions<br />
<img src="/assets/uploads/files/1612766258604-1a880af4-a3ec-461f-bfce-3267dffeac4c-image.png" alt="1a880af4-a3ec-461f-bfce-3267dffeac4c-image.png" class=" img-responsive img-markdown" /><br />
<a href="https://dwgeek.com/hbase-architecture-components.html/" rel="nofollow ugc">https://dwgeek.com/hbase-architecture-components.html/</a></p>
<p dir="auto"><img src="/assets/uploads/files/1612767501752-07d6e61f-911b-4b6f-a5c2-c68a7b3f678e-image.png" alt="07d6e61f-911b-4b6f-a5c2-c68a7b3f678e-image.png" class=" img-responsive img-markdown" /><br />
<a href="https://www.dummies.com/programming/big-data/hadoop/regionservers-in-hbase/" rel="nofollow ugc">https://www.dummies.com/programming/big-data/hadoop/regionservers-in-hbase/</a></p>
]]></description><link>http://an.forum.genostack.com/post/409</link><guid isPermaLink="true">http://an.forum.genostack.com/post/409</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Mon, 08 Feb 2021 06:58:53 GMT</pubDate></item><item><title><![CDATA[Reply to Hbase基础知识 on Mon, 08 Feb 2021 06:29:18 GMT]]></title><description><![CDATA[<p dir="auto">Table                    (HBase table)<br />
Region               (Regions for the table)<br />
Store            (Store per ColumnFamily for each Region for the table)<br />
MemStore     (MemStore for each Store for each Region for the table)<br />
StoreFile    (StoreFiles for each Store for each Region for the table)<br />
Block    (Blocks within a StoreFile within a Store for each Region for the table)</p>
]]></description><link>http://an.forum.genostack.com/post/408</link><guid isPermaLink="true">http://an.forum.genostack.com/post/408</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Mon, 08 Feb 2021 06:29:18 GMT</pubDate></item></channel></rss>