HOME PAGE>   Hands-on Cases
2016-3-18 品格翻译为某知名IT公司提供翻译服务。翻译语种:英译中。翻译内容:使用Memblaze® PBlaze闪存卡提升Hadoop性能。
译文摘录(保密内容以XXX代替):

       
We live in the data age. It’s not easy to measure the total volume of data stored electronically, but an IDC estimate put the size of the “digital universe” at 4.4 zettabytes in 2013 and is forecasting a tenfold growth by 2020 to 44 zettabytes. So how to organize those sheer volume of data being generated every year effectively? Processing massive amounts of data requires a parallel compute and storage infrastructure, thus the parallel processing, scalable and reliable ability which Hadoop provides. Hadoop utilizes the powerful Hadoop Distributed File System (HDFS) which is a highly scalable, parallel file system optimized for very large sequential data sets running on clusters of commodity hardware.

我们现在生活在数据时代。测量电子数据储存总量并非易事。但在2013年,人们通过IDC估算出全世界储存的电子数据总量为4.4泽字节,预计到2020年,该数字将会增长十倍,达到44泽字节。那么,我们应该如何有效地整理如此大量的数据呢?更何况数据总量每年都在增加。我们需要通过并行计算和存储基础设施来处理大量数据。根据这项需求,Hadoop提供了并行处理功能和可靠的信息扩缩功能。为达成这一目标,Hadoop利用了强大的Hadoop分布式文件系统(HDFS),这是一种优化的扩缩性强的并行文件系统,非常适合在商用硬件集群上运行大量序列数据集时使用。

How to improve HDFS performance? Add more nodes, add more disks? No, Memblaze PCIe SSD offers better performance with less nodes. 3 data nodes, each with one SSD performance is better than 7 data nodes, each with 6 HDDs.
如何提高HDFS的性能?是该增加更多节点,还是增加更多磁盘?都不是。Memblaze PCIe闪存卡能够以很少的节点提供更好的性能。3个数据节点(每个节点配一个SSD)比7个数据节点(每个节点配6个硬盘驱动器)性能更佳。

Benchmark Procedure
基准程序

The test process aims at evaluate HDFS performance with PCIe SSD.
本测试的目的是评估配备PCIe SSD的HDFS性能。

TestDFSIO is used to measure performance of HDFS and stress both network and IO subsystems. The command read and write files in HDFS which is useful in measuring system-wide performance and exposing network bottlenecks on the NameNode and DataNodes. A majority of MapReduce workloads are IO bound more than compute and hence TestDFSIO can provide an accurate initial picture of such scenarios.
TestDFSIO的作用是测量网络和IO子系统的HDFS性能。HDFS的读取/写入文件命令可用于测量整个系统的性能,并揭露Name节点和数据节点的网络瓶颈。大部分映射化简计算模式(MapReduce)工作负载都属于IO密集型(IO bound),而非计算密集型,所以TestDFSIO可以对此情况作出准确的初步描述。

上一篇:2016-3-8 品格翻译为某文化公司提供翻译服务。翻译语种:中译阿(阿拉伯语)。翻译内容:中国传统文化(《龙龛手鉴》)。
下一篇:2016-2-26 品格翻译继续为某知名律所提供翻译服务。翻译语种:中译英。翻译内容:法律行业新闻。