IBM 15 - Manuals
IBM 15 – Manual in PDF format online.
Manuals:
Manual IBM 15
Summary
Preface IBM® SPSS® Modeler is the IBM Corp. enterprise-strength data mining workbench. SPSSModeler helps organizations to improve customer and citizen relationships through an in-depthunderstanding of data. Organizations use the insight gained from SPSS Modeler to retainpro fi table customers, ident...
Contents 1 About IBM SPSS Modeler 1 IBM SPSS Modeler Products . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 IBM SPSS Modeler . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 I...
Logical Functions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137 Numeric Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 138 Trigonometric Functions . . . . ....
Chapter 1 1 1 1 About IBM SPSS Modeler IBM® SPSS® Modeler is a set of data mining tools that enable you to quickly develop predictivemodels using business expertise and deploy them into business operations to improve decisionmaking. Designed around the industry-standard CRISP-DM model, SPSS Modeler ...
2 Chapter 1 IBM SPSS Modeler Server SPSS Modeler uses a client/server architecture to distribute requests for resource-intensiveoperations to powerful server software, resulting in faster performance on larger data sets. SPSS Modeler Server is a separately-licensed product that runs continually in d...
3 About IBM SPSS Modeler can be shared by multiple users, or accessed from the thin-client application IBM SPSS ModelerAdvantage. You install the adapter on the system that hosts the repository. IBM SPSS Modeler Editions SPSS Modeler is available in the following editions. SPSS Modeler Professional ...
4 Chapter 1 IBM SPSS Modeler Documentation Documentation in online help format is available from the Help menu of SPSS Modeler. Thisincludes documentation for SPSS Modeler, SPSS Modeler Server, and SPSS Modeler SolutionPublisher, as well as the Applications Guide and other supporting materials. Comp...
5 About IBM SPSS Modeler IBM SPSS Modeler Administration Console User Guide. Information on installing and using the console user interface for monitoring and con fi guring SPSS Modeler Server. The console is implemented as a plug-in to the Deployment Manager application. IBM SPSS Modeler Solution P...
6 Chapter 1 Demos Folder The data fi les and sample streams used with the application examples are installed in the Demos folder under the product installation directory. This folder can also be accessed from the IBM SPSS Modeler 15 program group on the Windows Start menu, or by clicking Demos on th...
Chapter 2 2 2 2 New Features New and Changed Features in IBM SPSS Modeler 15 From this release onwards, IBM® SPSS® Modeler has the following editions. IBM® SPSS® Modeler Professional is the new name for the existing SPSS Modeler product. IBM® SPSS® Modeler Premium is a separately-licensed product th...
8 Chapter 2 Default settings for database connections. You can now specify default settings for SQL Server and Oracle database connections, as well as those already supported for IBM DB2 InfoSphereWarehouse. Stream properties and optimization redesign. The Options tab on the Stream Properties dialog...
9 New Features SQL generation enhancements. The Aggregate node now supports SQL generation for date, time, timestamp, and string data types, in addition to integer and real. With IBM Netezza databases, theSample node supports SQL generation for simple and complex sampling, and the Binning nodesuppor...
10 Chapter 2 New features in IBM SPSS Modeler Premium IBM® SPSS® Modeler Premium is a separately-licensed product that provides additional featuresto those supplied by IBM® SPSS® Modeler Professional. Previously, SPSS Modeler Premiumincluded only IBM® SPSS® Modeler Text Analytics . The full set of S...
11 New Features The Netezza Time Series node analyzes time series data and can predict futurebehavior from past events. The Netezza Generalized Linear model expands the linear regression model so thatthe dependent variable is related to the predictor variables by means of a speci fi ed link function...
Chapter 3 3 3 3 IBM SPSS Modeler Overview Getting Started As a data mining application, IBM® SPSS® Modeler offers a strategic approach to fi nding useful relationships in large data sets. In contrast to more traditional statistical methods, you do notnecessarily need to know what you are looking for...
13 IBM SPSS Modeler Overview Launching from the Command Line You can use the command line of your operating system to launch IBM® SPSS® Modeleras follows: E On a computer where IBM® SPSS® Modeler is installed, open a DOS, or command-prompt, window. E To launch the SPSS Modeler interface in interacti...
14 Chapter 3 To Connect to a Server E On the Tools menu, click Server Login . The Server Login dialog box opens. Alternatively, double-click the connection status area of the SPSS Modeler window. E Using the dialog box, specify options to connect to the local server computer or select a connection f...
15 IBM SPSS Modeler Overview Note : You cannot edit a server connection that was added from IBM® SPSS® Collaboration and Deployment Services, since the name, port, and other details are de fi ned in IBM SPSS Collaboration and Deployment Services. Figure 3-3 Server Login Add/Edit Server dialog box To...
16 Chapter 3 Searching for Servers in IBM SPSS Collaboration and Deployment Services Instead of entering a server connection manually, you can select a server or server cluster availableon the network through the Coordinator of Processes, available in IBM® SPSS® Collaborationand Deployment Services....
17 IBM SPSS Modeler Overview E Edit options.cfg , located in the /con fi g directory of your SPSS Modeler installation directory. Edit the temp_directory parameter in this fi le to read: temp_directory, "C:/spss/servertemp" . E After doing this, you must restart the SPSS Modeler Server servi...
18 Chapter 3 This sequence of operations is known as a data stream because the data fl ows record by record from the source through each manipulation and, fi nally, to the destination—either a model or type of data output. Figure 3-5 A simple stream IBM SPSS Modeler Stream Canvas The stream canvas i...
19 IBM SPSS Modeler Overview Field Ops. Nodes perform operations on data fi elds , such as fi ltering, deriving new fi elds, and determining the measurement level for given fi elds. Graphs. Nodes graphically display data before and after modeling. Graphs include plots, histograms, web nodes, and eva...
20 Chapter 3 Figure 3-8 Outputs tab The Models tab is the most powerful of the manager tabs. This tab contains all model nuggets , which contain the models generated in SPSS Modeler, for the current session. These models canbe browsed directly from the Models tab or added to the stream in the canvas...
21 IBM SPSS Modeler Overview Figure 3-10 CRISP-DM view The Classes tab provides a way to organize your work in SPSS Modeler categorically—by thetypes of objects you create. This view is useful when taking inventory of data, streams, andmodels. Figure 3-11 Classes view IBM SPSS Modeler Toolbar At the...
22 Chapter 3 Cut & move to clipboard Copy to clipboard Paste selection Undo last action Redo Search for nodes Edit stream properties Preview SQL generation Run current stream Run stream selection Stop stream (Active only whilestream is running) Add SuperNode Zoom in (SuperNodes only) Zoom out (S...
23 IBM SPSS Modeler Overview Customizing the Toolbar You can change various aspects of the toolbar, such as: Whether it is displayed Whether the icons have tooltips available Whether it uses large or small icons To turn the toolbar display on and off: E On the main menu, click: View > Toolbar >...
24 Chapter 3 Figure 3-12 Maximized stream canvas As an alternative to closing the nodes palette, and the managers and project panes, you can use thestream canvas as a scrollable page by moving vertically and horizontally with the scrollbars at theside and bottom of the SPSS Modeler window. You can a...
25 IBM SPSS Modeler Overview Figure 3-13 Changing the icon size To scale the entire stream (stream properties method) E From the main menu, choose Tools > Stream Properties > Options > Layout. E Choose the size you want from the Icon Size menu. E Click Apply to see the result. E Click OK to...
26 Chapter 3 Using the Mouse in IBM SPSS Modeler The most common uses of the mouse in IBM® SPSS® Modeler include the following: Single-click. Use either the right or left mouse button to select options from menus, open pop-up menus, and access various other standard controls and options. Click and h...
27 IBM SPSS Modeler Overview Table 3-2 Supported shortcuts for old hot keys ShortcutKey Function Ctrl+Alt+D Duplicate node Ctrl+Alt+L Load node Ctrl+Alt+R Rename node Ctrl+Alt+U Create User Input node Ctrl+Alt+C Toggle cache on/off Ctrl+Alt+F Flush cache Ctrl+Alt+X Expand SuperNode Ctrl+Alt+Z Zoom i...
28 Chapter 3 revenue data or as complex as transforming web log data into a set of fi elds and records with usable information. For more information, see the topic About CLEM in Chapter 7 on p. 105. Scripting is a powerful tool for automating processes in the user interface. Scripts can perform the ...
Chapter 4 4 4 4 Understanding Data Mining Data Mining Overview Through a variety of techniques, data mining identi fi es nuggets of information in bodies of data. Data mining extracts information in such a way that it can be used in areas such as decisionsupport, prediction, forecasts, and estimatio...
30 Chapter 4 Typically, you will use these facilities to identify a promising set of attributes in the data. Theseattributes can then be fed to the modeling techniques, which will attempt to identify underlyingrules and relationships. Typical Applications Typical applications of data mining techniqu...
31 Understanding Data Mining together. It may not even be online. If it exists only on paper, data entry will be required beforeyou can begin data mining. Check whether the data covers the relevant attributes The object of data mining is to identify relevant attributes, so including this check may s...
32 Chapter 4 A Strategy for Data Mining As with most business endeavors, data mining is much more effective if done in a planned,systematic way. Even with cutting-edge data mining tools, such as IBM® SPSS® Modeler, themajority of the work in data mining requires a knowledgeable business analyst to k...
33 Understanding Data Mining Figure 4-1 CRISP-DM process model The six phases include: Business understanding. This is perhaps the most important phase of data mining. Business understanding includes determining business objectives, assessing the situation, determiningdata mining goals, and producin...
34 Chapter 4 have been resolved adequately. Similarly, the evaluation phase can lead you to reevaluate youroriginal business understanding, and you may decide that you have been trying to answer thewrong question. At this point, you can revise your business understanding and proceed throughthe rest ...
37 Understanding Data Mining The Self-Learning Response Model (SLRM) node enables you to build a model inwhich a single new case, or small number of new cases, can be used to reestimate themodel without having to retrain the model using all data. The Time Series node estimates exponential smoothing,...
38 Chapter 4 preconditions. Apriori requires that input and output fi elds all be categorical but delivers better performance because it is optimized for this type of data. The CARMA model extracts a set of rules from the data without requiring you tospecify input or target fi elds. In contrast to A...
39 Understanding Data Mining Segmentation nodes The Auto Cluster node estimates and compares clustering models, which identifygroups of records that have similar characteristics. The node works in the samemanner as other automated modeling nodes, allowing you to experiment with multiplecombinations ...
40 Chapter 4 Data Mining Examples The best way to learn about data mining in practice is to start with an example. A number ofapplication examples are available in the IBM® SPSS® Modeler Applications Guide , which provides brief, targeted introductions to speci fi c modeling methods and techniques. ...
Chapter 5 5 5 5 Building Streams Stream-Building Overview Data mining using IBM® SPSS® Modeler focuses on the process of running data through a seriesof nodes, referred to as a stream. This series of nodes represents operations to be performed on the data, while links between the nodes indicate the ...
42 Chapter 5 Figure 5-1 Completed stream on the stream canvas This section contains more detailed information on working with nodes to create more complexdata streams. It also discusses options and settings for nodes and streams. For step-by-stepexamples of stream building using the data shipped wit...
43 Building Streams A runnable node that processes stream data is known as a terminal node . A modeling or output node is a terminal node if it is located at the end of a stream or stream branch. You cannotconnect further nodes to a terminal node. Note : You can customize the Nodes palette. For more...
44 Chapter 5 Figure 5-2 Stream created by double-clicking nodes from the palettes To Connect Nodes Using the Middle Mouse Button On the stream canvas, you can click and drag from one node to another using the middle mousebutton. (If your mouse does not have a middle button, you can simulate this by ...
45 Building Streams Figure 5-5 Connected nodes When connecting nodes, there are several guidelines to follow. You will receive an error messageif you attempt to make any of the following types of connections: A connection leading to a source node A connection leading from a terminal node A node havi...
46 Chapter 5 Disabling Nodes in a Stream Process nodes with a single input within streams can be disabled, with the result that the node isignored during running of the stream. This saves you from having to remove or bypass the nodeand means you can leave it connected to the remaining nodes. You can...
47 Building Streams Figure 5-8 Connecting a new node between two connected nodes E With the middle mouse button, click and drag the connection arrow into which you want to insert the node. Alternatively, you can hold down the Alt key while clicking and dragging to simulate amiddle mouse button. Figu...
48 Chapter 5 Figure 5-10 Deleting the connection between nodes in a stream To delete all connections to and from a node, do one of the following: Select the node and press F3. Select the node, and on the main menu click: Edit > Node > Disconnect Setting Options for Nodes Once you have created ...
49 Building Streams Figure 5-11 Pop-up menu options for nodes Click Edit to open the dialog box for the selected node. Click Connect to manually connect one node to another. Click Disconnect to delete all links to and from the node. Click Rename and Annotate to open the Annotations tab of the editin...
50 Chapter 5 Click Save Node to save the node’s details in a fi le. You can load node details only into another node of the same type. Click Store Node to store the selected node in a connected IBM SPSS Collaboration and Deployment Services Repository. Click Cache to expand the menu, with options fo...
51 Building Streams Figure 5-12 Caching at the Type node to store newly derived fields To Enable a Cache E On the stream canvas, right-click the node and click Cache on the menu. E On the caching submenu, click Enable . E You can turn the cache off by right-clicking the node and clicking Disable on ...
52 Chapter 5 Note : The following databases support temporary tables for the purpose of caching: DB2, Netezza, Oracle, SQL Server, and Teradata. Other databases will use a normal table for databasecaching. The SQL code can be customized for speci fi c databases - contact Support for assistance. To F...
53 Building Streams Figure 5-13 Data Preview from a model nugget From the Generate menu, you can create several types of nodes. Locking Nodes To prevent other users from amending the settings of one or more nodes in a stream, you canencapsulate the node or nodes in a special type of node called a Su...
54 Chapter 5 Figure 5-14 Streams tab in the managers pane with pop-up menu options From this tab, you can: Access streams. Save streams. Save streams to the current project. Close streams. Open new streams. Store and retrieve streams from an IBM SPSS Collaboration and Deployment Services repository ...
55 Building Streams Logging and status. Options controlling SQL logging and record status. For more information, see the topic Setting SQL logging and record status options for streams on p. 63. Layout. Options relating to the layout of the stream on the canvas. For more information, see the topic S...
57 Building Streams Maximum number of rows to show in Data Preview. Specify the number of rows to be shown when a preview of the data is requested for a node. For more information, see the topic Previewing Data in Nodes on p. 52. Maximum members for nominal fields. Select to specify a maximum number...
59 Building Streams 2-digit dates start from. Specify the cutoff year to add century digits for years denoted with only two digits. For example, specifying 1930 as the cutoff year will assume that 05/11/02 is in theyear 2002. The same setting will use the 20th century for dates after 30; thus 05/11/...
60 Chapter 5 Decimal places (standard, scientific, currency). For number display formats, speci fi es the number of decimal places to be used when displaying or printing real numbers. This option is speci fi ed separately for each display format. Calculations in. Select Radians or Degrees as the uni...
61 Building Streams Figure 5-19 Setting stream optimization options Note: Whether SQL pushback and optimization are supported depends on the type of database in use. For the latest information on which databases and ODBC drivers are supported and tested foruse with IBM® SPSS® Modeler 15, see the cor...
62 Chapter 5 reduce network traf fi c and speed stream operations. Note that the Generate SQL check box must be selected for SQL optimization to have any effect. Optimize syntax execution. This method of stream rewriting increases the ef fi ciency of operations that incorporate more than one node co...
63 Building Streams Setting SQL logging and record status options for streams These settings include various options controlling the display of SQL statements generated by thestream, and the display of the number of records processed by the stream. Figure 5-20 Setting SQL logging and record status o...
64 Chapter 5 Reformat SQL for improved readability. Speci fi es whether SQL displayed in the log should be formatted for readability. Show status for records. Speci fi es when records should be reported as they arrive at terminal nodes. Specify a number that is used for updating the status every N r...
65 Building Streams Stream scroll rate. Specify the scrolling rate for the stream canvas to control how quickly the stream canvas pane scrolls when a node is being dragged from one place to another on the canvas.Higher numbers specify a faster scroll rate. Icon name maximum. Specify a limit in chara...
67 Building Streams Figure 5-23 Stream running with error reported If SQL optimization and logging options are enabled in the User Options dialog box, theninformation on generated SQL is also displayed. For more information, see the topic Setting optimization options for streams on p. 60. You can sa...
68 Chapter 5 Figure 5-24 Viewing execution times for nodes in the stream In the table of node execution times, the columns are as follows. Click a column heading to sortthe entries into ascending or descending order (for example, to see which nodes have the longestexecution times). Terminal Node. Th...
69 Building Streams Parameters can also be set for SuperNodes, in which case they are visible only to nodesencapsulated within that SuperNode. To Set Stream and Session Parameters through the User Interface E To set stream parameters, on the main menu, click: Tools > Stream Properties > Parame...
70 Chapter 5 Note that long name, storage, and type options can be set for parameters through the user interface only. These options cannot be set using scripts. Click the arrows at the right to move the selected parameter further up or down the list of available parameters. Use the delete button (m...
71 Building Streams Figure 5-27 Specifying available values for a parameter Type. Displays the currently selected measurement level. You can change this value to re fl ect the way that you intend to use the parameter in IBM® SPSS® Modeler. Storage. Displays the storage type if known. Storage types a...
72 Chapter 5 Analytical Decision Management or Predictive Applications 5.x. All streams require a designatedscoring branch before they can be deployed; additional requirements and options depend on thedeployment type. For more information, see the topic Storing and Deploying Repository Objects in Ch...
73 Building Streams Searching for Nodes in a Stream You can search for nodes in a stream by specifying a number of search criteria, such as node name,category and identi fi er. This feature can be especially useful for complex streams containing a large number of nodes. To Search for Nodes in a Stre...
74 Chapter 5 Node category. Check this box and click a category on the list to search for a particular type of node. Process Node means a node from the Record Ops or Field Ops tab of the nodes palette; Apply Model Node refers to a model nugget. Keywords include. Check this box and enter one or more ...
75 Building Streams Figure 5-30 Opening section of stream description The stream description is displayed in the form of an HTML document consisting of a numberof sections. General Stream Information This section contains the stream name, together with details of when the stream was createdand last ...
76 Chapter 5 Inputs. Lists the input fi elds together with their storage types (for example, string, integer, real and so on). Outputs. Lists the output fi elds, including the additional fi elds generated by the modeling node, together with their storage types. Parameters. Lists any parameters relat...
77 Building Streams Exporting Stream Descriptions You can export the contents of the stream description to an HTML fi le. To export a stream description: E On the main menu, click: File > Export Stream Description E Enter a name for the HTML fi le and click Save . Running Streams Once you have sp...
78 Chapter 5 Some nodes have further displays giving additional information about stream execution. Theseare displayed by selecting the corresponding row in the dialog box. The fi rst row is selected automatically. Working with Models If a stream includes a modeling node (that is, one from the Model...
79 Building Streams Figure 5-34 Stream with comments added Others can then view these comments on-screen, or you can print out an image of the streamthat includes the comments. You can list all the comments for a stream or SuperNode, change the order of comments in the list, edit the comment text, a...
80 Chapter 5 The appearance of the text box changes to indicate the current mode of the comment (or annotationshown as a comment), as the following table shows. Table 5-1 Comment and annotation text box modes Comment text box Annotation text box Mode Indicates Obtained by... Edit Comment is open for...
81 Building Streams Figure 5-37 Comment in edit mode When you click again, the border changes to solid lines to show that editing is complete. Figure 5-38 Completed comment Double-clicking a comment changes the text box to edit mode—the background changes to white and the comment text can be edited....
82 Chapter 5 Right-click the stream background and click New Comment on the pop-up menu. Click the New Comment button in the toolbar. E Enter the comment text (or paste in text from the clipboard). E Click a node in the stream to save the comment. To attach a comment to a node or nugget E Select one...
83 Building Streams E Edit the comment text. You can use standard Windows shortcut keys when editing, for example Ctrl+C to copy text. Other options during editing are listed in the pop-up menu for the comment. E Click outside the text box once to display the resizing controls, then again to complet...
84 Chapter 5 If the comment was originally a stream or SuperNode annotation that had been converted to afreestanding comment, the comment is deleted from the canvas but its text is retained on theAnnotations tab for the stream or SuperNode. To show or hide comments for a stream E Do one of the follo...
85 Building Streams Figure 5-39 Listing comments for a stream Text. The text of the comment. Double-click the text to change the fi eld to an editable text box. Links. The name of the node to which the comment is attached. If this fi eld is empty, the comment applies to the stream. Positioning butto...
86 Chapter 5 E Click the Annotations tab. E Select the Show annotation as comment check box. E Click OK . To convert a SuperNode annotation to a comment E Double-click the SuperNode icon on the canvas. E Click the Annotations tab. E Select the Show annotation as comment check box. E Click OK . Annot...
87 Building Streams Figure 5-40 Annotations tab options Name. Select Custom to adjust the autogenerated name or to create a unique name for the node as displayed on the stream canvas. Tooltip text. (For nodes and model nuggets only) Enter text used as a tooltip on the stream canvas. This is particul...
88 Chapter 5 Show annotation as comment. (For stream and SuperNode annotations only) Check this box to convert the annotation to a freestanding comment that will be visible on the stream canvas.For more information, see the topic Adding Comments and Annotations to Nodes and Streams on p. 78. ID. Dis...
89 Building Streams Saving Multiple Stream Objects When you exit IBM® SPSS® Modeler with multiple unsaved objects, such as streams, projects,or model nuggets, you will be prompted to save before completely closing the software. If youchoose to save items, a dialog box will open with options for savi...
90 Chapter 5 Encrypting and Decrypting Information When you save a stream, node, project, output fi le, or model nugget, you can encrypt it to prevent its unauthorized use. To do this, you select an extra option when saving, and add a password to theitem being saved. This encryption can be set for a...
91 Building Streams Models palette (. gen ) Nodes (. nod ) Output (. cou ) Projects (. cpj ) Opening New Files Streams can be loaded directly from the File menu. E On the File menu, click Open Stream . All other fi le types can be opened using the submenu items available on the File menu. For exampl...
92 Chapter 5 Map to. This method starts with the node to be introduced to the stream. First, right-click the node to introduce; then, using the Data Mapping > Map To option from the pop-up menu, select the node to which it should join. This method is particularly useful for mapping to a terminal ...
93 Building Streams Step 3: Replace the template source node. Using the Data Mapping option on the pop-up menu for the template source node, click Select Replacement Node , then select the source node for the replacement data. Figure 5-45 Selecting a replacement source node Step 4: Check mapped fiel...
94 Chapter 5 Mapping between Streams Similar to connecting nodes, this method of data mapping does not require you to set essential fi elds beforehand. With this method, you simply connect from one stream to another using Map to from the Data Mapping pop-up menu. This type of data mapping is useful ...
95 Building Streams To Set Essential Fields E Right-click the source node of the template stream that will be replaced. E On the menu, click: Data Mapping > Specify Essential Fields Figure 5-48 Specifying essential fields E Using the Field Chooser, you can add or remove fi elds from the list. To ...
96 Chapter 5 Mapped. Lists the fi elds selected for mapping to template fi elds. These are the fi elds whose names may have to change to match the original fi elds used in stream operations. Click in the table cell for a fi eld to activate a list of available fi elds. If you are unsure of which fi e...
97 Building Streams Figure 5-51 ToolTip and custom node name Insert values automatically into a CLEM expression. Using the Expression Builder, accessible from a variety of dialog boxes (such as those for Derive and Filler nodes), you canautomatically insert fi eld values into a CLEM expression. Clic...
98 Chapter 5 Figure 5-53 Selecting the Demos folder from the list of recently-used directories Minimize output window clutter. You can close and delete output quickly using the red X button at the top right corner of all output windows. This enables you to keep only promising orinteresting results o...
Chapter 6 6 6 6 Handling Missing Values Overview of Missing Values During the Data Preparation phase of data mining, you will often want to replace missing valuesin the data. Missing values are values in the data set that are unknown, uncollected, or incorrectly entered. Usually, such values are inv...
100 Chapter 6 Figure 6-1 Specifying missing values for a continuous variable Reading in mixed data. Note that when you are reading in fi elds with numeric storage (either integer, real, time, timestamp, or date), any non-numeric values are set to null or system missing . This is because, unlike some...
101 Handling Missing Values In general terms, there are two approaches you can follow: You can exclude fi elds or records with missing values You can impute, replace, or coerce missing values using a variety of methods Both of these approaches can be largely automated using the Data Audit node. For ...
102 Chapter 6 Screening or Removing Fields To screen out fi elds with too many missing values, you have several options: You can use a Data Audit node to fi lter fi elds based on quality. You can use a Feature Selection node to screen out fi elds with more than a speci fi ed percentage of missing va...
104 Chapter 6 Note on Discarding Records When using a Select node to discard records, note that syntax uses three-valued logic andautomatically includes null values in select statements. To exclude null values (system-missing)in a select expression, you must explicitly specify this by using and not ...
Chapter 7 7 7 7 Building CLEM Expressions About CLEM The Control Language for Expression Manipulation (CLEM) is a powerful language for analyzingand manipulating the data that fl ows along IBM® SPSS® Modeler streams. Data miners use CLEM extensively in stream operations to perform tasks as simple as...
106 Chapter 7 Figure 7-1 Derive node creating a new field based on a formula CLEM expressions can also be used for global search and replace operations. For example, theexpression @NULL(@FIELD) can be used in a Filler node to replace system-missing values with the integer value 0. (To replace user-m...
108 Chapter 7 Figure 7-3 Conditional Derive comparing values of one field to those of the field before it CLEM Examples To illustrate correct syntax as well as the types of expressions possible with CLEM, exampleexpressions follow. Simple Expressions Formulas can be as simple as this one, which deri...
109 Building CLEM Expressions Complex Expressions Expressions can also be lengthy and more complex. The following expression returns true if the value of two fi elds ( $KX-Kohonen and $KY-Kohonen ) fall within the speci fi ed ranges. Notice that here the fi eld names are single-quoted because the fi...
110 Chapter 7 Frequently, special functions are used in combination, which is a commonly used method of fl agging blanks in more than one fi eld at a time. @BLANK(@FIELD)-> T Additional examples are discussed throughout the CLEM documentation. For more information,see the topic CLEM Reference Ove...
111 Building CLEM Expressions Characters —Always use single backquotes like this ` . For example, note the character d in the function stripchar(`d`,"drugA") . The only exception to this is when you are using an integer to refer to a speci fi c character in a string. For example, note the ch...
112 Chapter 7 If you want to override precedence, or if you are in any doubt of the order of evaluation, you canuse parentheses to make it explicit—for example, sqrt(abs(Signal)) * (max(T1, T2) + Baseline) Stream, Session, and SuperNode Parameters Parameters can be de fi ned for use in CLEM expressi...
113 Building CLEM Expressions Determining the length (number of characters) for a string variable— length(STRING). Checking the alphabetical ordering of string values— alphabefore(STRING1, STRING2) . Removing leading or trailing white space from values— trim(STRING) , trim_start(STRING) , or trimend...
114 Chapter 7 Figure 7-4 Filler node replacing system-missing values with 0 For more information, see the topic Functions Handling Blanks and Null Values in Chapter 8 on p. 156. Working with Numbers Numerous standard operations on numeric values are available in IBM® SPSS® Modeler, such as: Calculat...
115 Building CLEM Expressions Calculating Time Passed You can easily calculate the time passed from a baseline date using a family of functions similar tothe following one. This function returns the time in months from the baseline date to the daterepresented by the date string DATE as a real number...
116 Chapter 7 You can also use a number of counting functions to obtain counts of values that meet speci fi c criteria, even when those values are stored in multiple fi elds. For example, to count the number of cards that have been held for more than fi ve years: count_greater_than(5, ['cardtenure' ...
117 Building CLEM Expressions Working with Multiple-Response Data A number of comparison functions can be used to analyze multiple-response data, including: value_at first_index / last_index first_non_null / last_non_null first_non_null_index / last_non_null_index min_index / max_index For example, ...
118 Chapter 7 functions. In addition, the Builder controls automatically add the proper quotes for fi elds and values, making it easier to create syntactically correct expressions. Figure 7-5 Expression Builder dialog box Note : The Expression Builder is not supported in scripting or parameter setti...
119 Building CLEM Expressions Accessing the Expression Builder The Expression Builder is available in all nodes where CLEM expressions are used, includingSelect, Balance, Derive, Filler, Analysis, Report, and Table nodes. You can open it by clicking thecalculator button just to the right of the form...
120 Chapter 7 E Double-click or click the yellow arrow button to add the fi eld or function to the expression fi eld. E Use the operand buttons in the center of the dialog box to insert the operations into the expression. Selecting Functions The function list displays all available CLEM functions an...
122 Chapter 7 Viewing or Selecting Values Field values can be viewed from a number of places in the system, including the ExpressionBuilder, data audit reports, and when editing future values in a Time Intervals node. Note thatdata must be fully instantiated in a source or Type node to use this feat...
123 Building CLEM Expressions Checking CLEM Expressions Click Check in the Expression Builder (lower right corner) to validate the expression. Expressions that have not been checked are displayed in red. If errors are found, a message indicating thecause is displayed. Figure 7-11 Invalid CLEM expres...
124 Chapter 7 Figure 7-12 Find/Replace dialog box E With the cursor in a text area, press Ctrl+F to access the Find/Replace dialog box. E Enter the text you want to search for, or choose from the drop-down list of recently searched items. E Enter the replacement text, if any. E Click Find Next to st...
125 Building CLEM Expressions Characters Matches \0nn The character with octal value 0nn (0 <= n <= 7) \0mnn The character with octal value 0mnn (0 <= m <= 3, 0 <= n <= 7) \xhh The character with hexadecimal value 0xhh \uhhhh The character with hexadecimal value 0xhhhh \t The tab c...
126 Chapter 7 Boundary matchers Matches \Z The end of the input but for the fi nal terminator, if any \z The end of the input
Chapter 8 8 8 8 CLEM Language Reference CLEM Reference Overview This section describes the Control Language for Expression Manipulation (CLEM), which is apowerful tool used to analyze and manipulate the data used in IBM® SPSS® Modeler streams.You can use CLEM within nodes to perform tasks ranging fr...
128 Chapter 8 For more information, see the topic Values and Data Types in Chapter 7 on p. 110. Additionally, these rules are covered in more detail in the following topics. Integers Integers are represented as a sequence of decimal digits. Optionally, you can place a minus sign(−) before the intege...
129 CLEM Language Reference Strings Generally, you should enclose strings in double quotation marks. Examples of strings are "c35product2" and "referrerID" . To indicate special characters in a string, use a backslash––for example, "\$65443" . (To indicate a backslash charact...
130 Chapter 8 Format Examples DD/MM/YYYY 15/01/1963 MM/DD/YY 01/15/63 MM/DD/YYYY 01/15/1963 DD-MM-YY 15-01-63 DD-MM-YYYY 15-01-1963 MM-DD-YY 01-15-63 MM-DD-YYYY 01-15-1963 DD.MM.YY 15.01.63 DD.MM.YYYY 15.01.1963 MM.DD.YY 01.15.63 MM.DD.YYYY 01.15.1963 DD-MON-YY 15-JAN-63, 15-jan-63, 15-Jan-63 DD/MON...
131 CLEM Language Reference Format Examples MM.SS 55.58, 01.00 (H)H.(M)M.(S)S 12.1.12, 1.1.1, 22.12.12 (H)H.(M)M 12.23, 7.45, 22.7 (M)M.(S)S 55.58, 1.0 CLEM Operators The following operators are available. Operation Comments Precedence (see next section) or Used between two CLEM expressions.Returns ...
132 Chapter 8 Operation Comments Precedence (see next section) && Used between two integers. The resultis the bitwise ‘and’ of the integers INT1and INT2. 4 &&~~ Used between two integers. The resultis the bitwise ‘and’ of INT1 and thebitwise complement of INT2. 4 || Used between two ...
133 CLEM Language Reference Functions Reference The following CLEM functions are available for working with data in IBM® SPSS® Modeler.You can enter these functions as code in a variety of dialog boxes, such as Derive and Set ToFlag nodes, or you can use the Expression Builder to create valid CLEM e...
134 Chapter 8 Convention Description INT , INT1 , INT2 Any integer, such as 1 or –77 . CHAR A character code, such as `A` . STRING A string, such as "referrerID" . LIST A list of items, such as ["abc" "def"] . ITEM A fi eld, such as Customer or extract_concept . DATE A date f...
135 CLEM Language Reference Conversion Functions Conversion functions allow you to construct new fi elds and convert the storage type of existing fi les. For example, you can form new strings by joining strings together or by taking strings apart. To join two strings, use the operator >< . For...
137 CLEM Language Reference Function Result Description max_n(LIST) Number Returns the maximum value from a list of numeric fi elds or null if all of the fi eld values are null. For more information, see the topic Summarizing Multiple Fields in Chapter 7 on p. 115. member(ITEM, LIST) Boolean Returns...
138 Chapter 8 Numeric Functions CLEM contains a number of commonly used numeric functions. Function Result Description – NUM Number Used to negate NUM . Returns the corresponding number with the opposite sign. NUM1 + NUM2 Number Returns the sum of NUM1 and NUM2 . code – NUM2 Number Returns the value...
139 CLEM Language Reference Function Result Description mean_n(LIST) Number Returns the mean value from a list of numeric fi elds or null if all of the fi eld values are null. sdev_n(LIST) Number Returns the standard deviation from a list of numeric fi elds or null if all of the fi eld values are nu...
140 Chapter 8 Function Result Description cdf_normal(NUM, MEAN,STDDEV) Real Returns the probability that a value from the normaldistribution with the speci fi ed mean and standard deviation will be less than the speci fi ed number. cdf_t(NUM, DF) Real Returns the probability that a value from Studen...
141 CLEM Language Reference Function Result Description integer_bitcount(INT) Integer Counts the number of 1 or 0 bits in thetwo’s-complement representation of INT . If INT is non-negative, N is the number of 1 bits. If INT is negative, it is the number of 0 bits. Owing to the sign extension, there ...
145 CLEM Language Reference Function Result Description stripchar(CHAR,STRING) String Enables you to remove speci fi ed characters from a string or fi eld. You can use this function, for example, to remove extrasymbols, such as currency notations, fromdata to achieve a simple number or name.For exam...
146 Chapter 8 Function Result Description unicode_char(NUM) CHAR Returns the character with Unicode value NUM. unicode_value(CHAR) NUM Returns the Unicode value of CHAR uppertolower(CHAR)uppertolower (STRING) CHAR or String Input can be either a string or character and isused in this function to ret...
150 Chapter 8 Function Result Description time_mins_difference(TIME1,TIME2) Real Returns the time difference in minutes between thetimes or timestamps represented by TIME1 and TIME2 , as a real number. If you select Rollover days/mins in the stream properties dialog box, a higher value of TIME1 is t...
151 CLEM Language Reference Sequence functions Record indexing Averaging, summing, and comparing values Monitoring change—differentiation @SINCE Offset values Additional sequence facilities For many applications, each record passing through a stream can be considered as an individualcase, independen...
152 Chapter 8 For this reason, @SINCE does not evaluate its condition for the current record. Use a similar function, @SINCE0 , if you want to evaluate the condition for the current record as well as previous ones; if the condition is true in the current record, @SINCE0 returns 0 . Note : @ function...
155 CLEM Language Reference Function Result Description @SUM(FIELD, EXPR) Number Returns the sum of values for FIELD over the last EXPR records received by the current node, including the current record. FIELD must be the name of a numeric fi eld. EXPR may be any expression evaluating to an integer ...
157 CLEM Language Reference Special Fields Special functions are used to denote the speci fi c fi elds under examination, or to generate a list of fi elds as input. For example, when deriving multiple fi elds at once, you should use @FIELD to denote “perform this derive action on the selected fi eld...
159 Using IBM SPSS Modeler with a Repository Figure 9-1 Objects in the IBM SPSS Collaboration and Deployment Services Repository Extensive Versioning and Search Support The repository provides comprehensive object versioning and search capabilities. For example,suppose that you create a stream and s...
160 Chapter 9 For more information, see the topic Connecting to the Repository on p. 161. Storing and Deploying Repository Objects Streams created in IBM® SPSS® Modeler can be stored in the repository just as they are, as fi les with the extension .str . In this way, a single stream can be accessed ...
161 Using IBM SPSS Modeler with a Repository Other Deployment Options While IBM SPSS Collaboration and Deployment Services offers the most extensive features formanaging enterprise content, a number of other mechanisms for deploying or exporting streamsare also available, including: Export the strea...
162 Chapter 9 Ensure secure connection. Speci fi es whether a Secure Sockets Layer (SSL) connection should be used. SSL is a commonly used protocol for securing data sent over a network. To use thisfeature, SSL must be enabled on the server hosting the repository. If necessary, contact yourlocal adm...
164 Chapter 9 Storing Objects in the Repository Figure 9-5 Storing a model You can store streams, nodes, models, model palettes, projects, and output objects in therepository, from where they can be accessed by other users and applications. Note : A separate license is required to access an IBM® SPS...
165 Using IBM SPSS Modeler with a Repository Choosing the Location for Storing Objects Figure 9-6 Choosing the location for storing an object Save in. Shows the current folder—the location where the object will be stored. Double-click a folder name in the list to set that folder as the current folde...
166 Chapter 9 Figure 9-7 Adding information about the object Author. The username of the user creating the object in the repository. By default, this shows the username used for the repository connection, but you can change this name here. Version Label. Select a label from the list to indicate the ...
167 Using IBM SPSS Modeler with a Repository Assigning Topics to a Stored Object Topics are a hierarchical classi fi cation system for the content stored in the repository. You can choose from the available topics when storing objects, and users can also search for objects bytopic. The list of avail...
168 Chapter 9 Figure 9-9 Setting security options for an object Principal. The repository username of the user or group who has access rights on this object. Permissions. The access rights that this user or group has for the object. Add. Enables you to add one or more users or groups to the list of ...
169 Using IBM SPSS Modeler with a Repository Adding a User to the Permissions List Figure 9-10 Adding a user to the permissions list for an object Select provider. Choose a security provider for authentication. The repository can be con fi gured to use different security providers; if necessary, con...
170 Chapter 9 Read. By default, a user or group that is not the object owner has only Read access rights to the object. Select the appropriate check boxes to add Write, Delete, and Modify Permissions accessrights for this user or group. Storing Streams You can store a stream as a .str fi le in the r...
171 Using IBM SPSS Modeler with a Repository E Specify connection settings to the repository if necessary. For more information, see the topic Connecting to the Repository on p. 161. For speci fi c port, password, and other connection details, contact your local system administrator. E In the Reposi...
172 Chapter 9 Storing Models and Model Palettes You can store an individual model as a .gm fi le in the repository, from where it can be accessed by other users. You can also store the complete contents of the Models palette as a .gen fi le in the repository. Storing a model E Click the object on th...
173 Using IBM SPSS Modeler with a Repository or File > Projects > Retrieve Project... or File > Outputs > Retrieve Output... E Alternatively, right-click in the managers or project pane and click Retrieve on the pop-up menu. E To retrieve a node, on the SPSS Modeler main menu click: Inse...
174 Chapter 9 Look in. Shows the folder hierarchy for the current folder. To navigate to a different folder, select one from this list to navigate there directly, or navigate using the object list below this fi eld. Up Folder button. Navigates to one level above the current folder in the hierarchy. ...
175 Using IBM SPSS Modeler with a Repository E Select the object version you want to work with. E Click Continue. Searching for Objects in the Repository You can search for objects by name, folder, type, label, date, or other criteria. Searching by Name To search for objects by name: E On the IBM® S...
176 Chapter 9 When searching for objects by name, an asterisk ( * ) can be used as a wildcard character to match any string of characters, and a question mark ( ? ) matches any single character. For example, *cluster* matches all objects that include the string cluster anywhere in the name. The sear...
177 Using IBM SPSS Modeler with a Repository Topics. You can search on models associated with speci fi c topics from a list set by repository users with the appropriate privileges (for more information, see the Deployment Manager User’s Guide ). To obtain the list, check this box, then click the Add...
178 Chapter 9 Figure 9-16 Locked object To lock an object E In the repository explorer window, right-click the required object. E Click Lock . To unlock an object E In the repository explorer window, right-click the required object. E Click Unlock . Deleting Repository Objects Before deleting an obj...
179 Using IBM SPSS Modeler with a Repository Figure 9-17 Select versions to delete Managing Properties of Repository Objects You can control various object properties from SPSS Modeler. You can: View the properties of a folder View and edit the properties of an object Create, apply and delete versio...
180 Chapter 9 Displays the folder name, creation, and modi fi cation dates. Permissions tab Speci fi es read and write permissions for the folder. All users and groups with access to the parent folder are listed. Permissions follow a hierarchy. For example, if you do not have readpermission, you can...
181 Using IBM SPSS Modeler with a Repository E In the repository window, right-click the required object. E Click Object Properties . Figure 9-20 Object properties General Tab Name. The name of the object as viewed in the repository. Created on. Date the object (not the version) was created. Last mo...
182 Chapter 9 Figure 9-21 Version properties The following properties can be speci fi ed or modi fi ed for speci fi c versions of a stored object: Version. Unique identi fi er for the version generated based on the time when the version was stored. Label. Current label for the version, if any. Unlik...
183 Using IBM SPSS Modeler with a Repository Figure 9-22 Object access rights Users And Groups. Lists the repository users and groups that have at least Read access to this object. Select the Write and Delete check boxes to add those access rights for this object to aparticular user or group. Click ...
184 Chapter 9 To define a new label and apply it to the object E Type the label name in the New Label fi eld. E Click the right-arrow button to move the new label to the Applied Labels list. E Click OK . Deploying Streams To enable a stream to be used with the thin-client application IBM® SPSS® Mode...
186 Chapter 9 Figure 9-24 Stream Deployment options Deployment type. Choose how you want to deploy the stream. All streams require a designated scoring node before they can be deployed; additional requirements and options depend on thedeployment type. <none> . The stream will not be deployed t...
187 Using IBM SPSS Modeler with a Repository Scoring node. Select a graph, output or export node to identify the stream branch to be used for scoring the data. While the stream can actually contain any number of valid branches, models, andterminal nodes, one and only one scoring branch must be desig...
188 Chapter 9 Scoring and Modeling Parameters When deploying a stream to IBM SPSS Collaboration and Deployment Services, you can choosewhich parameters can be viewed or edited each time the model is updated or scored. For example,you might specify maximum and minimum values, or some other value that...
189 Using IBM SPSS Modeler with a Repository Figure 9-26 Stream with scoring branch highlighted If the stream already had a scoring branch de fi ned, the newly-designated branch replaces it as the scoring branch. You can set the color of the scoring branch indication by means of a Custom Coloroption...
190 Chapter 9 To designate a branch as the scoring branch (Tools menu) E Connect the model nugget to a terminal node (a processing or output node downstream from the nugget). E On the main menu, click: Tools > Stream Properties > Deployment E On the Deployment type list, click Scoring Only or ...
191 Using IBM SPSS Modeler with a Repository Single Model in Stream Figure 9-28 Scoring branch with single model in the stream If a single linked model nugget is on the scoring branch when it is identi fi ed as such, that nugget becomes the refresh model for the stream. Multiple Models in Stream If ...
194 Chapter 9 No Models in Stream If there are no models in the stream, or only models with no model links, the deployment type isset to Scoring Only. Checking a Scoring Branch for Errors When you designate the scoring branch, it is checked for errors, such as not having an EnterpriseView node in th...
Chapter 1 1 1 10 0 0 0 Exporting to External Applications About Exporting to External Applications IBM® SPSS® Modeler provides a number of mechanisms to export the entire data mining processto external applications, so that the work you do to prepare data and build models can be used toyour advantag...
196 Chapter 10 E Specify connection settings to the repository if necessary. For more information, see the topic Connecting to the Repository in Chapter 9 on p. 161. For speci fi c port, password, and other connection details, contact your local system administrator. Note : The repository server mus...
197 Exporting to External Applications E In the Export (or Save) dialog box, specify a target directory and a unique name for the model. Note : You can change options for PMML export in the User Options dialog box. On the main menu, click: Tools > Options > User Options and click the PMML tab....
198 Chapter 10 Figure 10-3 Selecting the XML file for a model saved using PMML Use variable labels if present in model. The PMML may specify both variable names and variable labels (such as Referrer ID for RefID ) for variables in the data dictionary. Select this option to use variable labels if the...
199 Exporting to External Applications Neural Net C5.0 Logistic Regression Genlin SVM Bayes Net Apriori Carma K-Means Kohonen TwoStep KNN Statistics Model The following model created in SPSS Modeler can be exported as PMML 3.2: Decision List Database native models. For models generated using databas...
Chapter 1 1 1 11 1 1 1 Projects and Reports Introduction to Projects A project is a group of fi les related to a data mining task. Projects include data streams, graphs, generated models, reports, and anything else that you have created in IBM® SPSS® Modeler. At fi rst glance, it may seem that SPSS ...
201 Projects and Reports CRISP-DM View By supporting the Cross-Industry Standard Process for Data Mining (CRISP-DM), IBM® SPSS®Modeler projects provide an industry-proven and non-proprietary way of organizing the piecesof your data mining efforts. CRISP-DM uses six phases to describe the process fro...
202 Chapter 11 Classes View The Classes view in the project pane organizes your work in IBM® SPSS® Modeler categoricallyby the types of objects created. Saved objects can be added to any of the following categories: Streams Nodes Models Tables, graphs, reports Other (non-SPSS Modeler fi les, such as...
203 Projects and Reports E On the main menu, click: File > Project > New Project... Adding to a Project Once you have created or opened a project, you can add objects, such as data streams, nodes,and reports, using several methods. Adding Objects from the Managers Using the managers in the upp...
204 Chapter 11 Adding Nodes from the Canvas You can add individual nodes from the stream canvas by using the Save dialog box. E Select a node on the canvas. E Right-click, and click Save Node . Alternatively, on the main menu click: Edit > Node > Save Node... E In the Save dialog box, select A...
205 Projects and Reports Transferring a Project Make sure that the project you want to transfer is open in the project pane. To transfer a project: E Right-click the root project folder and click Transfer Project . E If prompted, log in to IBM SPSS Collaboration and Deployment Services Repository. E...
206 Chapter 11 Summary. You can enter a summary for your data mining project that will be displayed in the project report. Contents. Lists the type and number of components referenced by the project fi le (not editable). Save unsaved object as. Speci fi es whether unsaved objects should be saved to ...
207 Projects and Reports Figure 11-6 Annotations tab in the project properties dialog box E Enter keywords and text to describe the project. Folder Properties and Annotations Individual project folders (in both CRISP-DM and Classes view) can be annotated. In CRISP-DMview, this can be an extremely ef...
209 Projects and Reports Closing a Project When you exit IBM® SPSS® Modeler or open a new project, the existing project fi le ( .cpj ) is closed. Some fi les associated with the project (such as streams, nodes or graphs) may still be open. If you want to leave these fi les open, reply No to the mess...
210 Chapter 11 Figure 11-9 Generated report window To generate a report: E Select the project folder in either CRISP-DM or Classes view. E Right-click the folder and click Project Report . E Specify the report options and click Generate Report .
212 Chapter 11 File type. Available fi le types are: HTML document. The report is saved as a single HTML fi le. If your report contains graphs, they are saved as PNG fi les and are referenced by the HTML fi le. When publishing your report on the Internet, make sure to upload both the HTML fi le and ...
Chapter 1 1 1 12 2 2 2 Customizing IBM SPSS Modeler Customizing IBM SPSS Modeler Options There are a number of operations you can perform to customize IBM® SPSS® Modeler to yourneeds. Primarily, this customization consists of setting speci fi c user options such as memory allocation, default directo...
216 Chapter 12 Figure 12-1 System Options dialog box Maximum memory. Select to impose a limit in megabytes on SPSS Modeler’s memory usage. On some platforms, SPSS Modeler limits its process size to reduce the toll on computers with limitedresources or heavy loads. If you are dealing with large amoun...
217 Customizing IBM SPSS Modeler is the path used for all client-side operations and output fi les (if they are referenced with relative paths). Set Server Directory. The Set Server Directory option on the File menu is enabled whenever there is a remote server connection. Use this option to specify ...
218 Chapter 12 Figure 12-2 User Options dialog box, Notifications tab Show stream execution feedback dialog . Select to display a dialog box that includes a progress indicator when a stream has been running for three seconds. The dialog box also includes detailsof the output objects created by the s...
219 Customizing IBM SPSS Modeler Visual Notifications The options in this group are used to specify the behavior of the Outputs and Models tabs in themanagers pane at the top right of the display when new items are generated. Select New Model or New Output from the list to specify the behavior of th...
220 Chapter 12 Select Always to always open a new output window. Select If generated by current stream to open a new window for output generated by the stream currently visible in the canvas. Select Never to restrict the software from automatically opening new windows for generated output. Click Def...
221 Customizing IBM SPSS Modeler Standard Fonts & Colors (effective on restart). Options in this control box are used to specify the SPSS Modeler screen design, color scheme, and the size of the fonts displayed. Options selectedhere do not take effect until you close and restart SPSS Modeler. Lo...
222 Chapter 12 Figure 12-4 User Options dialog box, PMML tab Export PMML. Here you can con fi gure variations of PMML that work best with your target application. Select With extensions to allow PMML extensions for special cases where there is no standard PMML equivalent. Note that in most cases thi...
223 Customizing IBM SPSS Modeler Customizing the Nodes Palette Streams are built using nodes. The Nodes Palette at the bottom of the IBM® SPSS® Modelerwindow contains all of the nodes it is possible to use in stream building. For more information,see the topic Nodes Palette in Chapter 3 on p. 18. Yo...
224 Chapter 12 Figure 12-6 Palette Manager showing the tabs displayed on the Nodes Palette Palette Name. Each available palette tab, whether shown on the Nodes Palette or not, is listed. This includes any palette tabs that you have created. For more information, see the topic Creating a Palette Tab ...
225 Customizing IBM SPSS Modeler Creating a Palette Tab Figure 12-7 Palette tab creation on the Create/Edit Palette dialog box To create a custom palette tab: E From the Tools menu, open the Palette Manager. E To the right of the Shown? column, click the Add Palette button; the Create/Edit Palette d...
226 Chapter 12 Figure 12-8 Palette Manager showing the tabs displayed on the Nodes Palette To select which tabs are to be shown on the Nodes Palette: E From the Tools menu, open the Palette Manager. E Using the check boxes in the Shown? column, select whether to include or hide each palette tab. To ...
227 Customizing IBM SPSS Modeler Figure 12-9 Subpalettes available for the Modeling Palette tab To select subpalettes for display on a palette tab: E From the Tools menu, open the Palette Manager. E Select the palette that you require. E Click the Sub Palettes button; the Sub Palettes dialog box is ...
229 Customizing IBM SPSS Modeler To change the nodes shown on a palette tab, select the palette tab and then, from the menu on theleft, select to display either all nodes, or just those in a speci fi c subpalette. Figure 12-11 Modeling palette tab showing the Classification subpalette CEMI Node Mana...
Chapter 1 1 1 13 3 3 3 Performance Considerations forStreams and Nodes You can design your streams to maximize performance by arranging the nodes in the mostef fi cient con fi guration, by enabling node caches when appropriate, and by paying attention to other considerations as detailed in this sect...
231 Performance Considerations for Streams and Nodes The following operations cannot be performed in most databases. They should be placed in thestream after the operations in the preceding list: Operations on any nondatabase data, such as fl at fi les Merge by order Balance Distinct operations in d...
233 Performance Considerations for Streams and Nodes Note : The following databases support temporary tables for the purpose of caching: DB2, Netezza, Oracle, SQL Server, and Teradata. Other databases will use a normal table for databasecaching. The SQL code can be customized for speci fi c database...
234 Chapter 13 Aggregate. When the Keys are contiguous option is not set, this node reads (but does not store) its entire input data set before it produces any aggregated output. In the more extreme situations,where the size of the aggregated data reaches a limit (determined by the SPSS Modeler Serv...
235 Performance Considerations for Streams and Nodes offset value is not a literal integer; for example, @OFFSET(Sales, Month) . The offset value is the fi eld name Month , whose value is unknown until executed. The server must save all values of the Sales fi eld to ensure accurate results. Where an...
Appendix A A A A Accessibility in IBM SPSS Modeler Overview of Accessibility in IBM SPSS Modeler This release offers greatly enhanced accessibility for all users, as well as speci fi c support for users with visual and other functional impairments. This section describes the features and methods ofw...
237 Accessibility in IBM SPSS Modeler Controlling the Automatic Launching of New Windows The Noti fi cations tab on the User Options dialog box is also used to control whether newly generated output, such as tables and charts, are launched in a separate window. It may be easierfor you to disable thi...
238 Appendix A Controlling the Automatic Launching of New Windows The Noti fi cations tab on the User Options dialog box is also used to control whether newly generated output is launched in a separate window. It may be easier for you to disable this optionand open an output window as needed. E To s...
239 Accessibility in IBM SPSS Modeler Shortcut Key Function Ctrl+F6 Moves focus to the stream canvas. Ctrl+F7 Moves focus to the managers pane. Ctrl+F8 Moves focus to the project pane. Node and Stream Shortcuts Shortcut Key Function Ctrl+N Creates a new blank stream canvas. Ctrl+O Displays the Open ...
240 Appendix A Shortcut Key Function Ctrl+Alt+L When a model nugget is selected in the stream, opens an Insertdialog box to enable you to load a saved model from a .nod fi le into the stream. Ctrl+Alt+R Displays the Annotations tab for a selected node, enabling you torename the node. Ctrl+Alt+U Crea...
242 Appendix A Shortcuts for Comments When working with on-screen comments, you can use the following shortcuts. Shortcut Key Function Alt+C Toggles the show/hide comment feature. Alt+M Inserts a new comment if comments are currently displayed;shows comments if they are currently hidden. Tab On the ...
243 Accessibility in IBM SPSS Modeler Cluster Viewer only The Cluster Viewer has a Clusters view that contains a cluster-by-features grid. To choose the Clusters view instead of the Model Summary view: E Press Tab repeatedly until the View button is selected. E Press Down Arrow twice to select Clust...
244 Appendix A E Spacebar. Selects the Variable File node. E Ctrl+Enter. Adds the Variable File node to the stream canvas. This key combination also keeps selection on the Variable File node so that the next node added will be connected to it. E Tab. Moves focus back to the node palette. E Right Arr...
245 Accessibility in IBM SPSS Modeler Use F3 to destroy all connections for a selected node in the canvas. Once you have created a stream, use Ctrl+E to run the current stream. A complete list of shortcut keys is available. For more information, see the topic Shortcuts for Navigating the Main Window...
246 Appendix A Accessibility in the Interactive Tree Window The standard display of a decision tree model in the Interactive Tree window may cause problemsfor screen readers. To access an accessible version, on the Interactive Tree menus click: View > Accessible Window This displays a view simila...
247 Accessibility in IBM SPSS Modeler Typing the first letter to find element in tree list. When looking for an element in the categories pane, extracted results pane, or library tree, you can type the fi rst letter of the element when the pane has the focus. This will select the next occurrence of ...
Appendix B B B B Unicode Support Unicode Support in IBM SPSS Modeler IBM® SPSS® Modeler is fully Unicode-enabled for both IBM® SPSS® Modeler and IBM®SPSS® Modeler Server. This makes it possible to exchange data with other applications thatsupport Unicode, including multi-language databases, without ...
Appendix C C C C Notices This information was developed for products and services offered worldwide. IBM may not offer the products, services, or features discussed in this document in other countries.Consult your local IBM representative for information on the products and services currentlyavailab...
250 Appendix C Such information may be available, subject to appropriate terms and conditions, including insome cases, payment of a fee. The licensed program described in this document and all licensed material available for it areprovided by IBM under terms of the IBM Customer Agreement, IBM Intern...
Index 508 compliance, 236 abs function, 138accessibility, 236, 247 example, 243–244features in IBM SPSS Modeler, 236tips in IBM SPSS Modeler, 246 adding to a project, 202 adding IBM SPSS Modeler Server connections, 14, 16Aggregate node performance, 234 allbut fi rst function, 141 allbutlast function...
IBM Manuals
-
IBM SC41-5210-04
Manual
-
IBM TM7
Manual
-
IBM 8677
Manual
-
IBM EM78P809N
Manual
-
IBM 610
Manual
-
IBM AIX 4.3
Manual
-
IBM 8128
Manual
-
IBM PANEL1173-841
Manual
-
IBM L150
Manual
-
IBM E50
Manual
-
IBM 8319
Manual
-
IBM DS3300
Manual
-
IBM 9519-AG1
Manual
-
IBM 6.00E+04
Manual
-
IBM PD78082(A)
Manual
-
IBM S544-5343-03
Manual
-
IBM PPC-1510PT
Manual
-
IBM 37L1388
Manual
-
IBM V2.3
Manual
-
IBM DS4500
Manual