설명 없음

Richard Chapman b51f2df68b Merge pull request #10344 from ghalliday/issue17730 7 년 전
.github f35ba39077 HPCC-17012 Add github PULL_REQUEST_TEMPLATE.md file 8 년 전
build_utils c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 9 년 전
charm 37d0fb7f7d HPCC-11289 Add README file for HPCC Juju Charm Development 10 년 전
clienttools ae0b7dfdf3 HPCC-18164 Handle invalid ESDL syntax when generating ECL 7 년 전
cmake_modules 365b180489 HPCC-18089 Add directory exists guard for FILE() calls in find_package scripts 7 년 전
common 579d9270b3 Merge pull request #10277 from mckellyln/hpcc-18007 7 년 전
configuration 82eeff61fa HPCC-17632 Add process "required" entry in AppInfo 8 년 전
dali f244f261a4 HPCC-18007 Dfuplus spray to dafilesrv with SSLFirst problem 7 년 전
deploy c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 9 년 전
deployment 6a4d5ebc79 Merge pull request #9993 from Michael-Gardner/HPCC-17629 8 년 전
docs cdb965236a Merge branch 'candidate-6.4.x' into candidate-6.4.2 7 년 전
ecl d9a8b9b44d HPCC-18000 Do not require macro parameters when syntax checking 8 년 전
ecllibrary 44507cf94d Merge pull request #10111 from dcamper/hpcc-17802-std-date-typo 8 년 전
esp b95ce2e085 Merge pull request #10336 from miguelvazq/HPCC-17970 7 년 전
githooks 4ba38f7d01 Merge remote-tracking branch 'origin/candidate-3.10.x' 12 년 전
initfiles 7a0d3de6ec HPCC-18038 Remove the version option from the publish and java command 7 년 전
lib2 23789dc506 HPCC-16745 Include zlib.dll in platform and clienttools packages 8 년 전
misc 66e814cdf0 HPCC-9508 Add eclipse code layout settings file to project 12 년 전
package dd10b3c3be HPCC-16491 Work-around CMake productbuild packaging issue 8 년 전
plugins 23612b373e HPCC-18173 Couchbase Plugin: Solve various memory leaks 7 년 전
roxie b51f2df68b Merge pull request #10344 from ghalliday/issue17730 7 년 전
rtl 3f7490e112 HPCC-17974 Support xpath('<>') in soapcall response layout 8 년 전
services 3914058968 HPCC-17246 Logging improvements in frunssh and thor 8 년 전
system e9e59d3315 Merge pull request #10326 from mckellyln/nfy_link 7 년 전
testing f23e47fec7 Merge pull request #10243 from rpastrana/HPCC-17930couchbase-simple 7 년 전
thorlcr 932c06d73e HPCC-18001 Mark disk read activities as 'fastThrough' 8 년 전
tools 419fbb58b3 Merge pull request #10316 from mayx/HPCC-17859 7 년 전
.gitattributes 2241da2000 HPCC-17425 Various fixes for running HPCC in windows linux subsystem 8 년 전
.gitignore bcc5a9e382 HPCC-15799 Add minimal linting 8 년 전
.gitmodules d20d54004b HPCC-17600 SQS Plugin modifications 8 년 전
.travis.yml bcc5a9e382 HPCC-15799 Add minimal linting 8 년 전
BUILD_ME.md fb5d21dc72 HPCC-13515 A proper README.md, moving old README.md -> BUILD_ME.md 10 년 전
CMakeLists.txt 7d5060380d HPCC-17958 Do not turn off "USE_APR" when "INCLUDE_PLUGINS" enabled 8 년 전
CNAME 996619b9ea Add CNAME entry for GitHub pages redirection 14 년 전
CONTRIBUTORS c04b2a38e0 HPCC-16014 Contributors file needs some refreshing 8 년 전
FUTURE b39eb133f9 Initial version of FUTURE document 13 년 전
LICENSE.txt c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 9 년 전
R-LICENSE.txt 41fdcf477c HPCC-14457 Split R plugin to its own package 9 년 전
README.md 6041fee721 Merge pull request #10169 from richardkchapman/readme-links 8 년 전
VERSIONS 04760b84cc Preparation for 6.0.0-beta1 release 9 년 전
baseaddr.txt c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 9 년 전
build-config.h.cmake 08fd95330b HPCC-9902 Use the build version as the ecl version reported by eclcc 11 년 전
cmake_uninstall.cmake.in 2ae8bdcd44 HPCC-15142 Minimal changes needed for DESTDIR 8 년 전
sourcedoc.xml c63b80c278 HPCC-13448 Source Code needs Marca Registrada next to HPCC Systems® 9 년 전
version.cmake 57fd5cf454 Split off candidate-6.4.2 7 년 전

README.md

Description / Rationale

HPCC Systems offers an enterprise ready, open source supercomputing platform to solve big data problems. As compared to Hadoop, the platform offers analysis of big data using less code and less nodes for greater efficiencies and offers a single programming language, a single platform and a single architecture for efficient processing. HPCC Systems is a technology division of LexisNexis Risk Solutions.

Getting Started

Architecture

The HPCC Systems architecture incorporates the Thor and Roxie clusters as well as common middleware components, an external communications layer, client interfaces which provide both end-user services and system management tools, and auxiliary components to support monitoring and to facilitate loading and storing of filesystem data from external sources. An HPCC environment can include only Thor clusters, or both Thor and Roxie clusters. Each of these cluster types is described in more detail in the following sections below the architecture diagram.

Thor

Thor (the Data Refinery Cluster) is responsible for consuming vast amounts of data, transforming, linking and indexing that data. It functions as a distributed file system with parallel processing power spread across the nodes. A cluster can scale from a single node to thousands of nodes.

  • Single-threaded
  • Distributed parallel processing
  • Distributed file system
  • Powerful parallel processing programming language (ECL)
  • Optimized for Extraction, Transformation, Loading, Sorting, Indexing and Linking
  • Scales from 1-1000s of nodes

Roxie

Roxie (the Query Cluster) provides separate high-performance online query processing and data warehouse capabilities. Roxie (Rapid Online XML Inquiry Engine) is the data delivery engine used in HPCC to serve data quickly and can support many thousands of requests per node per second.

  • Multi-threaded
  • Distributed parallel processing
  • Distributed file system
  • Powerful parallel processing programming language (ECL)
  • Optimized for concurrent query processing
  • Scales from 1-1000s of nodes

ECL

ECL (Enterprise Control Language) is the powerful programming language that is ideally suited for the manipulation of Big Data.

  • Transparent and implicitly parallel programming language
  • Non-procedural and dataflow oriented
  • Modular, reusable, extensible syntax
  • Combines data representation and algorithm implementation
  • Easily extend using C++ libraries
  • ECL is compiled into optimized C++

ECL IDE

ECL IDE is a modern IDE used to code, debug and monitor ECL programs.

  • Access to shared source code repositories
  • Complete development, debugging and testing environment for developing ECL dataflow programs
  • Access to the ECLWatch tool is built-in, allowing developers to watch job graphs as they are executing
  • Access to current and historical job workunits

ESP

ESP (Enterprise Services Platform) provides an easy to use interface to access ECL queries using XML, HTTP, SOAP and REST.

  • Standards-based interface to access ECL functions

Developer documentation

The following links describe the structure of the system and detail some of the key components: