Hardware 【硬件】
Available Configurations 【可用配置】
File Systems 【文件系统】
Core Software 【核心软件】
Additional Software【附加软件】
Tools 【工具】
About the HPCC Distributed File System【关于HPCC分布式文件系统】
The Thor DFS is record-oriented using a local Linux file system to store file parts. Files are initially loaded (sprayed) across nodes and each node has a single file part which can be empty for each distributed file. 【Thor是使用linxu本地文件系统存储文件的、记录的分布式文件系统。被初始加载的文件跨节点,每个节点作为一个分布式文件的部分,为空。】 Files are divided on even record/document boundaries specified by the user. 【文件被用户指定分割为一致的的记录/文档界限】 Master/slave architecture with name services and file mapping information are stored on a separate server. Only one local file per node is required to represent a distributed file. 【使用名称服务的主/从架构和文件映射信息被存储在单独的服务器。每个节点只有一个文件被要求代表一个分布式的文件。】 Read/write access is supported between clusters configured in the same environment. 【读/写访问被配置为同样环境的机器间支持】 Utilizing special adapters allow files from external databases such as MySQL to be accessed, allowing transactional data to be integrated with DFS data and incorporated into batch jobs. 【利用特殊的适配器可允许来自外部的数据、如访问MySQL,允许交易数据与使用DFS数据和合成为成批处理作业中的数据进行整合。】The Roxie DFS utilizes distributed B+Tree index files containing key information and data stored in local files on each node. 【Roxie利用分布式B+树索引文件涵盖关键信息和被存储在每个节点中的本地文件中的数据】
Redundancy【冗余】
The DFS for Thor and Roxie stores replicas of file parts on other configurable nodes to protect against disk and node failure.【对于Thor和Roxie分布式文件系统,在其他配置节点上存储复制信息以防止产品或节点故障】 The Thor system offers either automatic or manual node swap and warm-start following a node failure, and jobs are restarted from last checkpoint or persist. 【在一个节点出现故障时,Thor系统提供自动或手动的节点切换“温启动”,作业被从上一次的“checkpoint”点重新开始或一直运行】Replicas are automatically used while copying data to the new node. 【当拷贝数据到新的节点时,复制被自动进行】The Roxie system continues running following a node failure with a reduced number of nodes. 【在一个节点出现故障时,Roxie系统可在已减少的节点系统上持续运行】
Job Execution Environment【作业执行环境】
Thor utilizes a master/slave processing architecture.【Thor使用主/从处理架构】 Processing steps defined in an ECL job can specify local (data processed separately on each node) or global (data is processed across all nodes) operation. 【被定义在一个ECL作业的处理步骤可以指定本地(数据在每个节点被独立处理)或全局(数据被跨都有节点处理)操作】 Multiple processing steps for a procedure are executed automatically as part of a single job based on an optimized execution graph for a compiled ECL dataflow program. 【对于一个已编译过的数据流程序,作为一个基于执行优化图的作业的部分的一个过程的多个处理步骤是被自动执行的】A single Thor cluster can be configured to run multiple jobs concurrently reducing latency if adequate CPU and memory resources are available on each node. 【在每个节点上如果有足够的CPU和内存资源,一个Thor集群可以被配置为并行地运行多个作业且减少延迟】Middleware components including an ECLAgent, ECLServer, and Dali Server provide the client interface and manage execution of the job which is packaged as a workunit. 【包括ECLAgent、 ECLServer、 and Dali Serve的中间件组件提供客户端接口和管理着被打包为一个工作单元的作业执行】 Roxie utilizes a multiple server/agent architecture to process ECL programs accessed by queries using server tasks acting as a manager for each query and multiple agent tasks as needed to retrieve and process data for the query. 【Roxie采用多服务/代理架构,通过使用服务任务作为每个查询管理者、通过多重代理任务收集和处理查询处理的数据来处理被查询访问的ECL程序】
Programming Language【编程语言】
ECL is the primary programming language for the HPCC environment. 【ECL是HPCC环境下的主要程序设计语言】ECL is compiled into optimized C++ which is then compiled into DLLs for execution on the Thor and Roxie platforms.【ECL被编译到优化过的C++、被编译为Thor和Roxie平台下的动态库】 ECL can include inline C++ code encapsulated in functions. 【ECL可以包括内嵌C ++代码的功能封装】 External services can be written in any language and compiled into shared libraries of functions callable from ECL. 【外部服务可以使用任何语言并且可以便意味共享函数库被ECL调用】A pipe interface allows execution of external programs written in any language to be incorporated into jobs. 【一个管道接口允许执行外部任何语言编写的程序以整合到作业中】
Database Capabilities【数据库能力】
The HPCC platform includes the capability to build multi-key, multivariate indexes on DFS files. 【在分布式文件系统上,HPCC平台包括构建多关键字、多变量索引】These indexes can be used to improve performance and provide keyed access for batch jobs on a Thor system, or be used to support development of queries deployed to Roxie systems. 【那些索引可被用例改善性能、提供在Thor系统上的对于批处理作业的关键字访问、用于支持部署到Roxie系统的查询开发】Keyed access to data is supported directly in the ECL language. 【基于关键字访问数据被直接在ECL语言中支持】
Online Query and Data Warehouse Capabilities【在线查询和数据库仓库能力】
The Roxie system configuration in the HPCC platform is specifically designed to provide data warehouse capabilities for structured queries and data analysis applications. 【在HPCC平台上,Roxie系统配置设计为能提供数据仓库能力以应对结构化查询和数据分析型应用】Roxie is a high-performance platform capable of supporting thousands of users and providing sub-second response time depending on the application. 【Roxie是一个高性能平台有能力支持数千用户并且提供低于秒级的且依赖于应用的响应时间】