The Baidu File System (BFS) is a distributed file system designed to support real-time applications. Like many other distributed file systems, BFS is highly fault-tolerant. But different from others, BFS provides low read/write latency while maintaining high throughput rates. Together with Galaxy and Tera, BFS supports many real-time products in Baidu, including Baidu webpage database, Baidu incremental indexing system, Baidu user behavior analysis system, etc.
Features
Continuous availability
Nameserver is implemented as a raft group, no single point failure.
High throughput
High performance data engine to maximize IO utils.
Low latency
Global load balance and slow node detection.
Linear scalability
Support multi data center deployment and up to 10,000 data nodes.
The Baidu File System
The Baidu File System (BFS) is a distributed file system designed to support real-time applications. Like many other distributed file systems, BFS is highly fault-tolerant. But different from others, BFS provides low read/write latency while maintaining high throughput rates. Together with Galaxy and Tera, BFS supports many real-time products in Baidu, including Baidu webpage database, Baidu incremental indexing system, Baidu user behavior analysis system, etc.
Features
raft group, no single point failure.Architecture
Quick Start
Build
Standalone BFS
How to Contribute
make testandmake check.Contact us
opensearch@baidu.com
====
百度文件系统
百度的核心业务和数据库系统都依赖分布式文件系统作为底层存储,文件系统的可用性和性能对上层搜索业务的稳定性与效果有着至关重要的影响。现有的分布式文件系统(如HDFS等)是为离线批处理设计的,无法在保证高吞吐的情况下做到低延迟和持续可用,所以我们从搜索的业务特点出发,设计了百度文件系统。
核心特点
架构
快速试用
构建
单机版BFS
如何参与开发
联系我们
邮件:opensearch@baidu.com
QQ群:188471131