Нема описа

Gavin Halliday 81a8c86c88 Merge pull request #9360 from richardkchapman/issue16516 пре 8 година
build_utils c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® пре 10 година
charm 37d0fb7f7d HPCC-11289 Add README file for HPCC Juju Charm Development пре 10 година
clienttools c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® пре 10 година
cmake_modules f0280cd3e5 Merge branch 'candidate-6.0.8' into candidate-6.2.0 пре 8 година
common 81a8c86c88 Merge pull request #9360 from richardkchapman/issue16516 пре 8 година
configuration 07f9f9b999 Merge pull request #9220 from ghalliday/issue16412 пре 8 година
dali 7a5dd9456e Merge pull request #9290 from ghalliday/issue16546 пре 8 година
deploy c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® пре 10 година
deployment f192561ac2 HPCC-8614 Improved ConfigMgr пре 8 година
docs ba5b70b05e Merge pull request #9343 from g-pan/H14025-LocalnLDAPAuth пре 8 година
ecl 81a8c86c88 Merge pull request #9360 from richardkchapman/issue16516 пре 8 година
ecllibrary fa2e97e93e HPCC-16655 Extract_Tri anomaly in eclblas пре 8 година
esp 53939bd7e7 Merge pull request #9319 from miguelvazq/HPCC-16188 пре 8 година
githooks 4ba38f7d01 Merge remote-tracking branch 'origin/candidate-3.10.x' пре 12 година
initfiles eca6685050 Merge pull request #9278 from Michael-Gardner/HPCC-16527 пре 8 година
lib2 96610805d9 HPCC-16088 Excessive calls to install_name_tool пре 9 година
misc 66e814cdf0 HPCC-9508 Add eclipse code layout settings file to project пре 12 година
package dd10b3c3be HPCC-16491 Work-around CMake productbuild packaging issue пре 8 година
plugins 81a8c86c88 Merge pull request #9360 from richardkchapman/issue16516 пре 8 година
roxie 38f8a46131 HPCC-16625 Add options to control lifetime of global variables in embedded Python пре 8 година
rtl 1e9d33c614 Merge branch 'candidate-6.0.8' into candidate-6.2.0 пре 8 година
services 40a6995017 HPCC-16103 Removed logging from common/remote/rmtssh.cpp пре 9 година
system 5c2d0c181d HPCC-16434 Coverity: Resource leak пре 8 година
testing 38f8a46131 HPCC-16625 Add options to control lifetime of global variables in embedded Python пре 8 година
thorlcr 9b61760716 Merge pull request #9338 from richardkchapman/python-global-preserve пре 8 година
tools 2e8714e625 Signed-off-by: Russ Whitehead <william.whitehead@lexisnexis.com> пре 8 година
.gitattributes 936666bb1a HPCC-16584 Ensure run time script files are \n terminated пре 8 година
.gitignore bcc5a9e382 HPCC-15799 Add minimal linting пре 8 година
.gitmodules 1cdb916e70 HPCC-16522 Added libcouchbase as submodule and packaged alongside plugin пре 8 година
.travis.yml bcc5a9e382 HPCC-15799 Add minimal linting пре 8 година
BUILD_ME.md fb5d21dc72 HPCC-13515 A proper README.md, moving old README.md -> BUILD_ME.md пре 10 година
CMakeLists.txt dd10b3c3be HPCC-16491 Work-around CMake productbuild packaging issue пре 8 година
CNAME 996619b9ea Add CNAME entry for GitHub pages redirection пре 14 година
CONTRIBUTORS c04b2a38e0 HPCC-16014 Contributors file needs some refreshing пре 8 година
FUTURE b39eb133f9 Initial version of FUTURE document пре 14 година
LICENSE.txt c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® пре 10 година
R-LICENSE.txt 41fdcf477c HPCC-14457 Split R plugin to its own package пре 9 година
README.md fb5d21dc72 HPCC-13515 A proper README.md, moving old README.md -> BUILD_ME.md пре 10 година
VERSIONS 04760b84cc Preparation for 6.0.0-beta1 release пре 9 година
baseaddr.txt c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® пре 10 година
build-config.h.cmake 08fd95330b HPCC-9902 Use the build version as the ecl version reported by eclcc пре 12 година
cmake_uninstall.cmake.in 2ae8bdcd44 HPCC-15142 Minimal changes needed for DESTDIR пре 9 година
sourcedoc.xml c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® пре 10 година
version.cmake 1e9d33c614 Merge branch 'candidate-6.0.8' into candidate-6.2.0 пре 8 година

README.md

Description / Rationale

HPCC Systems offers an enterprise ready, open source supercomputing platform to solve big data problems. As compared to Hadoop, the platform offers analysis of big data using less code and less nodes for greater efficiencies and offers a single programming language, a single platform and a single architecture for efficient processing. HPCC Systems is a technology division of LexisNexis Risk Solutions.

Getting Started

Architecture

The HPCC Systems architecture incorporates the Thor and Roxie clusters as well as common middleware components, an external communications layer, client interfaces which provide both end-user services and system management tools, and auxiliary components to support monitoring and to facilitate loading and storing of filesystem data from external sources. An HPCC environment can include only Thor clusters, or both Thor and Roxie clusters. Each of these cluster types is described in more detail in the following sections below the architecture diagram.

Thor

Thor (the Data Refinery Cluster) is responsible for consuming vast amounts of data, transforming, linking and indexing that data. It functions as a distributed file system with parallel processing power spread across the nodes. A cluster can scale from a single node to thousands of nodes.

  • Single-threaded
  • Distributed parallel processing
  • Distributed file system
  • Powerful parallel processing programming language (ECL)
  • Optimized for Extraction, Transformation, Loading, Sorting, Indexing and Linking
  • Scales from 1-1000s of nodes

Roxie

Roxie (the Query Cluster) provides separate high-performance online query processing and data warehouse capabilities. Roxie (Rapid Online XML Inquiry Engine) is the data delivery engine used in HPCC to serve data quickly and can support many thousands of requests per node per second.

  • Multi-threaded
  • Distributed parallel processing
  • Distributed file system
  • Powerful parallel processing programming language (ECL)
  • Optimized for concurrent query processing
  • Scales from 1-1000s of nodes

ECL

ECL (Enterprise Control Language) is the powerful programming language that is ideally suited for the manipulation of Big Data.

  • Transparent and implicitly parallel programming language
  • Non-procedural and dataflow oriented
  • Modular, reusable, extensible syntax
  • Combines data representation and algorithm implementation
  • Easily extend using C++ libraries
  • ECL is compiled into optimized C++

ECL IDE

ECL IDE is a modern IDE used to code, debug and monitor ECL programs.

  • Access to shared source code repositories
  • Complete development, debugging and testing environment for developing ECL dataflow programs
  • Access to the ECLWatch tool is built-in, allowing developers to watch job graphs as they are executing
  • Access to current and historical job workunits

ESP

ESP (Enterprise Services Platform) provides an easy to use interface to access ECL queries using XML, HTTP, SOAP and REST.

  • Standards-based interface to access ECL functions