新永資訊有限公司


軟體資產管理家- 本公司可代為訂購國內外軟硬體
Tel:02-2597-1006   Fax:02-2597-1007

CART 6.0

研究分析軟體
Research & Analysis Software

歡迎來電洽詢
TEL: 02-2597-1006

需要產品報價?

CART 6.0 ProEX 資料挖掘分析軟體

 

 

 

 

CART是Salford Systems的旗艦數據挖掘軟件,該軟件是一款功能強大、易操作的決策樹,能自動篩選複雜的數據

Salford Systems' flagship data mining software, CART®, is a robust, easy-to-use decision tree that automatically sifts large, complex databases, searching for and isolating significant patterns and relationships. This discovered knowledge is then used to generate reliable, easy-to-grasp predictive models for applications such as finding best prospects and customers, targeted marketing, detecting credit card fraud, and managing credit risk.

Designed for both non-technical and technical business users, CART can quickly reveal important data relationships that could remain hidden using other analytical tools. The most recent 2008 release, CART 6.0, includes modeling automation technology that dramatically accelerates the process of generating accurate and robust models for deployment in core business functions. CART was the primary tool used to win the KDDCup 2000 web mining competition and is currently in use in major web applications.

Technically, Classification And Regression Trees (CART) is based on landmark mathematical theory introduced in 1984 by four world-renowned statisticians at Stanford University and the University of California at Berkeley. Salford Systems’ implementation of CART is the only decision tree software embodying the proprietary code written by CART co-author Professor Jerome H. Friedman.

The CART creators continue to collaborate with Salford Systems to enhance CART with proprietary advances. With CART 6.0 ProEX, Salford has introduced patented extensions to CART specifically designed to enhance results for market research and web analytics. CART supports high-speed deployment, allowing Salford models to predict and score in real time on a massive scale.

CART 6.0 ProEX, newly released for 2008, comes with a huge list of new features that will help analysts work more rapidly and guide their models to the best-performing trees. This is a dramatic upgrade of our flagship product and is drawing rave reviews from our customers. All of the new CART 6.0 ProEX features are explained in detail in our feature matrix (PDF); some highlights are listed below:

Tree Controls

  • Force splitters into nodes
  • Confine select splitters to specific regions of a tree (Structured Tree™)

HotSpot Detector™

  • Search data for ultra-high performance segments.
  • HotspotDetector trees are specifically designed to yield extraordinarily high-lift or high-risk nodes. The process focuses on individual nodes and generally discards the remainder of the tree.

Train/Test Consistency Assessment

  • Node-by-node summaries of agreement between train and test data on both class assignment and rank ordering of the nodes.
  • Quickly identify ideally-performing robust trees.

Modeling Automation

  • Automatically generate entire collections of trees exploring different control parameters.
  • Nineteen automated batteries cover exploration of multiple splitting rules, five alternative missing value handling strategies, random selection of alternative predictor lists, progressively smaller (or larger) training sample sizes, and much more.

Predictor Refinement

  • Includes stepwise backwards predictor elimination using any of three predictor ranking criteria (lowest variable importance rank, lowest loss of area under the ROC curve, highest variable importance rank).

Model Assessment via Monte Carlo Testing

  • Measure possible overfitting with automated Monte Carlo randomization tests.

Constructed Features

  • New tools for automatic construction of new features (as linear combinations of predictors).
  • Identification of multiple lists of candidates allows precise control over which predictors may be combined into a single new feature.

Unsupervised Learning Mode

  • Use Breiman's column scrambler to automatically detect potential clusters with no need to scale data, address missing values, or select variables for clustering.