Keine Beschreibung

Jake Smith 0798b66d94 HPCC-16582 Ensure logical filename issued on index read error vor 8 Jahren
build_utils c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® vor 10 Jahren
charm 37d0fb7f7d HPCC-11289 Add README file for HPCC Juju Charm Development vor 10 Jahren
clienttools c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® vor 10 Jahren
cmake_modules 13e17b9b9d Merge pull request #9265 from xwang2713/HPCC-16519-clienttools-lib-path vor 8 Jahren
common f5acbc365d HPCC-16537 Keep track of the number of retries accessing remote files vor 8 Jahren
configuration 07f9f9b999 Merge pull request #9220 from ghalliday/issue16412 vor 8 Jahren
dali faba669504 Merge pull request #9293 from ghalliday/issue16445 vor 8 Jahren
deploy c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® vor 10 Jahren
deployment f192561ac2 HPCC-8614 Improved ConfigMgr vor 8 Jahren
docs 7e618cc864 Merge branch 'candidate-6.0.8' into candidate-6.2.0 vor 8 Jahren
ecl 1eefc95e26 Merge pull request #9261 from ghalliday/issue16503 vor 8 Jahren
ecllibrary 80bac9909c HPCC-16409 Regression suite failures from BLAS library selftests vor 8 Jahren
esp 14015a6f71 Merge remote-tracking branch 'origin/candidate-6.2.0' vor 8 Jahren
githooks 4ba38f7d01 Merge remote-tracking branch 'origin/candidate-3.10.x' vor 12 Jahren
initfiles fa1877391a HPCC-16535 INIT_PID_NAME missing from init_thor vor 8 Jahren
lib2 96610805d9 HPCC-16088 Excessive calls to install_name_tool vor 9 Jahren
misc 66e814cdf0 HPCC-9508 Add eclipse code layout settings file to project vor 12 Jahren
plugins 14015a6f71 Merge remote-tracking branch 'origin/candidate-6.2.0' vor 8 Jahren
roxie 980df3eaa1 HPCC-16529 Protect update of stats with a critical section vor 8 Jahren
rtl db37e243b4 HPCC-16385 Add support for number of allocation scans statistic vor 8 Jahren
services 40a6995017 HPCC-16103 Removed logging from common/remote/rmtssh.cpp vor 9 Jahren
system def51a3ebc Merge remote-tracking branch 'origin/candidate-6.2.0' vor 8 Jahren
testing 88fdd04923 HPCC-16503 Generate constant inline child datasets efficiently vor 8 Jahren
thorlcr 0798b66d94 HPCC-16582 Ensure logical filename issued on index read error vor 8 Jahren
tools 2e8714e625 Signed-off-by: Russ Whitehead <william.whitehead@lexisnexis.com> vor 8 Jahren
.gitattributes 7f4953af04 Issue #254 Switches template reading to use jlib vor 14 Jahren
.gitignore bcc5a9e382 HPCC-15799 Add minimal linting vor 9 Jahren
.gitmodules edc2fae157 HPCC-15612 Target hpcc-systems' fork of libcouchbase-cxx vor 9 Jahren
.travis.yml bcc5a9e382 HPCC-15799 Add minimal linting vor 9 Jahren
BUILD_ME.md fb5d21dc72 HPCC-13515 A proper README.md, moving old README.md -> BUILD_ME.md vor 10 Jahren
CMakeLists.txt 250914e804 HPCC-16467 Fix configurator file collision issue when installing plugin packages vor 8 Jahren
CNAME 996619b9ea Add CNAME entry for GitHub pages redirection vor 14 Jahren
CONTRIBUTORS c04b2a38e0 HPCC-16014 Contributors file needs some refreshing vor 9 Jahren
FUTURE b39eb133f9 Initial version of FUTURE document vor 14 Jahren
LICENSE.txt c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® vor 10 Jahren
R-LICENSE.txt 41fdcf477c HPCC-14457 Split R plugin to its own package vor 9 Jahren
README.md fb5d21dc72 HPCC-13515 A proper README.md, moving old README.md -> BUILD_ME.md vor 10 Jahren
VERSIONS 04760b84cc Preparation for 6.0.0-beta1 release vor 10 Jahren
baseaddr.txt c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® vor 10 Jahren
build-config.h.cmake 08fd95330b HPCC-9902 Use the build version as the ecl version reported by eclcc vor 12 Jahren
cmake_uninstall.cmake.in 2ae8bdcd44 HPCC-15142 Minimal changes needed for DESTDIR vor 9 Jahren
sourcedoc.xml c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® vor 10 Jahren
version.cmake e36808d85d Split off 6.2.0 vor 8 Jahren

README.md

Description / Rationale

HPCC Systems offers an enterprise ready, open source supercomputing platform to solve big data problems. As compared to Hadoop, the platform offers analysis of big data using less code and less nodes for greater efficiencies and offers a single programming language, a single platform and a single architecture for efficient processing. HPCC Systems is a technology division of LexisNexis Risk Solutions.

Getting Started

Architecture

The HPCC Systems architecture incorporates the Thor and Roxie clusters as well as common middleware components, an external communications layer, client interfaces which provide both end-user services and system management tools, and auxiliary components to support monitoring and to facilitate loading and storing of filesystem data from external sources. An HPCC environment can include only Thor clusters, or both Thor and Roxie clusters. Each of these cluster types is described in more detail in the following sections below the architecture diagram.

Thor

Thor (the Data Refinery Cluster) is responsible for consuming vast amounts of data, transforming, linking and indexing that data. It functions as a distributed file system with parallel processing power spread across the nodes. A cluster can scale from a single node to thousands of nodes.

  • Single-threaded
  • Distributed parallel processing
  • Distributed file system
  • Powerful parallel processing programming language (ECL)
  • Optimized for Extraction, Transformation, Loading, Sorting, Indexing and Linking
  • Scales from 1-1000s of nodes

Roxie

Roxie (the Query Cluster) provides separate high-performance online query processing and data warehouse capabilities. Roxie (Rapid Online XML Inquiry Engine) is the data delivery engine used in HPCC to serve data quickly and can support many thousands of requests per node per second.

  • Multi-threaded
  • Distributed parallel processing
  • Distributed file system
  • Powerful parallel processing programming language (ECL)
  • Optimized for concurrent query processing
  • Scales from 1-1000s of nodes

ECL

ECL (Enterprise Control Language) is the powerful programming language that is ideally suited for the manipulation of Big Data.

  • Transparent and implicitly parallel programming language
  • Non-procedural and dataflow oriented
  • Modular, reusable, extensible syntax
  • Combines data representation and algorithm implementation
  • Easily extend using C++ libraries
  • ECL is compiled into optimized C++

ECL IDE

ECL IDE is a modern IDE used to code, debug and monitor ECL programs.

  • Access to shared source code repositories
  • Complete development, debugging and testing environment for developing ECL dataflow programs
  • Access to the ECLWatch tool is built-in, allowing developers to watch job graphs as they are executing
  • Access to current and historical job workunits

ESP

ESP (Enterprise Services Platform) provides an easy to use interface to access ECL queries using XML, HTTP, SOAP and REST.

  • Standards-based interface to access ECL functions