This chapter describes techniques for structuring the data that is returned by BI Publisher's data engine, including grouping, linking, group filters, and group-level and global-level functions.
This chapter includes the following sections:
The Data Model diagram helps you to quickly and easily define data sets, break groups, and totals for a report based on multiple data sets.
If you do not link the data sets (or queries) the data engine produces a multipart unrelated query data set.
For example, in the data model shown in Figure 15-1, one query selects products and another selects customers. Notice that there is no relationship between the products and customers.
This results in the data structure shown in Figure 15-2.
In many cases, the data fetched for one part of the data set (or query) is determined by the data fetched for another part. This is often called a "master/detail," or "parent/child," relationship, and is defined with a data link between two data sets (or queries). When you run a master/detail data model, each row of the master (or parent) query causes the detail (or child) query to be executed, retrieving only matching rows.
In the example shown in Figure 15-3, two data sets are linked by the element Customer ID. The Orders data set a child of the Customers data set.
This produces the data structure shown in Figure 15-4.
Following are recommended guidelines for building data models:
Reduce the number of data sets or queries in your data model as much as possible. In general, the fewer data sets and queries you have, the faster your data model will run. While multiquery data models are often easier to understand, single-query data models tend to execute more quickly. It is important to understand that in parent-child queries, for every parent, the child query is executed.
You should only use multiquery data models in the following scenarios:
To perform functions that the query type, such as a SQL query, does not support directly.
To support complex views (for example, distributed queries or GROUP BY queries).
To simulate a view when you do not have or want to use a view.
The data model editor enables you to combine data from multiple data sets into a single XML data structure. Data sets from multiple data sources can be merged either as sequential XML or at line-level to create a single combined hierarchical XML. Using the data model editor you can easily combine data from the following data set types: SQL query, OLAP (MDX query), LDAP, and Microsoft Excel.
The data model editor supports the following
Group data — Groups are created to organize the columns in your report. Groups can do two things: separate a query's data into sets, and filter a query's data.
When you create a query, the data engine creates a group that contains the columns selected by the query; you can create groups to modify the hierarchy of the data appearing in a data model. Groups are used primarily when you want to treat some columns differently than others. For example, you create groups to produce subtotals or create breaks.
Link data — Define master-detail links between data sets to group data at multiple levels.
Aggregate data — Create group level totals and subtotals.
Transform data — Modify source data to conform to business terms and reporting requirements.
Create calculations — Compute data values that are required for your report that are not available in the underlying data sources.
The data model editor provides functions at the element level, the group level, and the global level. Note that not all data set types support all functions. See the Important Notes section that accompanies your data set type for limitations. Figure 15-5 highlights some of the features and actions available in the data model editor.
By default, the data sets that you created are shown in the Diagram View as separate objects, as seen in Figure 15-6.
The data set structure builder has three views:
Diagram View — (Shown in Figure 15-6) This view displays your data sets and enables you to graphically create links and filters, add elements based on expressions, add aggregate functions and global-level functions, edit element properties, and delete elements. The Diagram View is typically the view you use to build your data structure.
Structure View — This view has two modes:
Table View and Output
The table view displays element properties in a table and enables you to update XML element alias names, presentation names of the elements, sorting, null values, and reset options. Figure 15-7 shows the structure Table View.
The Output view provides a clear view of the XML structure that is generated. The Output view cannot be updated. Figure 15-8 shows the Output view.
Code View — This view displays the data structure code created by the data structure builder that is read by the data engine. The code view cannot be updated. Figure 15-9 shows the code view.
Joining and structuring data at the source into one combined data set is sometimes not possible. For example, you cannot join data at the source when data resides in disparate sources such as Microsoft SQL Server and an Oracle database. You can use the BI Publisher data engine to combine and structure data after you extract it from the data source. Even if your data is coming from the same source, if you are creating large reports or documents with potentially hundreds of thousands of rows or pages, structuring your data so that it matches the intended layout optimizes document generation.
Create a link to define a master-detail (or parent-child) relationship between two data sets. You can create links as element-level links or group-level links. The resulting, hierarchical XML data is the same. Creating links as element-level links is the preferred method. Group-level links are provided for backward compatibility with data templates from earlier versions of BI Publisher.
A data link (or parent-child relationship) relates the results of multiple queries. A data link can establish these relationships:
Between one query's column and another query's column
Between one query's group and another query's group (this is useful when you want the child query to know about its parent's data)
Element-level links create a bind (join) between two data sets and define a master-detail (parent-child) relationship between them. This is the preferred method of defining master detail relationships between data sets. The simplest way to link data sets is by creating element-level links because they do not require you to code a join between the two data sets through a bind variable.
Group-level links also determine the way data sets are structured as hierarchical XML, but lack the join information that the data engine needs to execute the master and detail queries. When you define a group-level link, you must also update your query with a link between the two data sets through a unique bind variable.
Link data sets to define a master-detail (or parent-child) relationship between two data sets. Defining an element-level link enables you to establish the binding between the elements of the master and detail data sets.
To define an element-level link, do one of the following:
Open the element action menu and click Create Link.
The Create Link dialog launches and displays the elements from the other data sets. Choose the element and click OK to create the link. The Create Link dialog is shown in Figure 15-11.
Alternatively, from the parent group, click and drag the element you want to bind to the matching element in the child group, as shown in Figure 15-12.
After dropping the element from the parent data set to the matching element on the child data set, a connector displays between the data sets. Pause your cursor over the connector to display the link (as shown in Figure 15-13).
A group-level link defines a master-detail relationship between two data sets. The following figure shows two data sets with a group-level link defined. Next to the data sets the resulting XML data structure is shown, as in Figure 15-14.
To define a group-level link:
In the parent group, click Menu (in the upper right corner of the object).
Click Create Group Link as shown in Figure 15-15.
In the Create Group Link dialog, select the child group and click OK. The Create Link dialog is shown in Figure 15-16.
Click Menu and then click Edit Data Set to add the bind variables to your query.
An example is shown in Table 15-1.
Table 15-1 Example: Edit Data Set
| Data Set: DEPT | Data Set: EMP | 
|---|---|
| 
Select DEPT.DEPTNO as DEPTID,
        DEPT.DNAME as DNAME,
        DEPT.LOC as LOC 
from    OE.DEPT DEPT
 | 
Select EMP.EMPNO as EMPNO,
        EMP.ENAME as ENAME,
        EMP.JOB as JOB,
        EMP.MGR as MGR,
        EMP.HIREDATE as HIREDATE,
        EMP.SAL as SAL,
        EMP.COMM as COMM,
        EMP.DEPTNO as DEPTNO 
from    OE.EMP EMP 
where   DEPTNO=:DEPTID
 | 
| Important: A unique bind variable must be defined in the child query. | 
In addition to creating parent-child structures by linking two data sets, you can also group elements in the same data set by other elements. This might be helpful if your query returns data that has header data repeated for each detail row. By creating a subgroup you can shape the XML data for better, more efficient document generation.
To create a subgroup:
Select the element by which you want to group the other elements in the data set.
Click the element action menu icon to open the menu and select Group by as shown in Figure 15-17.
This creates a new group within the displayed data set. The following figure shows the G_3 data set grouped by the element COMPANY. This creates a new group called G_4, that contains the other four elements in the data set. Figure 15-18 shows how the grouped data set is displayed in the Diagram View along with the structure.
You can perform any of the group actions on the new group you have created.
On the group's title bar, click Menu and then click Ungroup.
Once you have created a group within your data set, two new options display on the element action menu that enable you to move elements between the parent and child groups.
For the element that you want to move, click the element action icon to open the menu. If the element is in the parent group and you want to move it to the child group, and then Move this element to Child Group.
If the element is in the child group and you want to move it to the parent group, select and then Move this element to Parent Group. In Figure 15-19, the element action menu for OFFICE_DSC displays the option to move the element to the parent group.
| Important: Before moving an element be aware of any dependencies on other elements. | 
You can use the data model editor to aggregate data at the group or report level. For example, if you group sales data by Customer Name, you can aggregate sales to get a subtotal for each customer's sales. Note that you can only aggregate data for at the parent level for a child element.
The aggregate functions are:
Average — Calculates the average of all the occurrences of an element.
Count — Counts the number of occurrences of an element.
First — Displays the value of the first occurrence of an element in the group.
Last — Displays the value of the last occurrence of an element in the group.
Maximum — Displays the highest value of all occurrences of an element in the group.
Minimum — Displays the lowest value of all occurrences of an element in a group.
Summary — Sums the value of all occurrences of an element in the group.
To create group-level aggregate elements:
Drag the element to the Drop here for aggregate function field in the parent group.
Figure 15-20 shows creating a group-level aggregate function in the G_DEPT based on the SALARY element.
Once you drop the element, a new element is created in the parent group. By default, the Count function is applied. The icon next to the name of the new aggregate element indicates the function. Pause your cursor over the icon to display the function.
Figure 15-21 shows the new aggregate element, CS_1. with the default Count function defined.
To change the function: Click the function icon to view a list of available functions and choose from the list, as shown in Figure 15-22.
To rename the element or update other properties, click the element's Action menu icon. On the menu, click Properties. The Properties dialog is shown in Figure 15-23.
| Important: Be careful when renaming an element as it can have dependency on other elements. | 
Set the properties described in Table 15-2 as needed.
Table 15-2 Element Properties
| Property | Description | 
|---|---|
| Column Name | The internal name assigned to the element by the BI Publisher data model editor. This name cannot be updated. | 
| Alias (XML Tag Name) | BI Publisher assigns a default tag name for the element in the XML data file. You can update this tag name to assign a more user-friendly name within the data file. | 
| Display Name | The Display Name appears in the report design tools. Update this name to be meaningful to your business users. | 
| Function | If you have not already selected the desired function, then you can select it from the list here. | 
| Data Type | BI Publisher assigns a default data type of Integer or Double depending on the function. Some functions also provide the option of Float. | 
| Value if Null | If the value returned from the function is null, you can supply a default value here to prevent having a null in your data. | 
| Round | By default, the value is rounded to the nearest third decimal. You can change the round value, if needed. | 
| Do Not Reset | By default, the function resets at the group level. For example, if your data set is grouped by DEPARTMENT_ID, and you have defined a sum function for SALARY, then the sum is reset for each group of DEPARTMENT_ID data, giving you the sum of SALARY for that department only. If instead you want the function to reset only at the global level, and not at the group level, select Do Not Reset. This creates a running total of SALARY for all departments. Note that this property is for group level functions only. | 
Filters enable you to conditionally remove records selected by your queries. Groups can have two types of filters:
Expression — Create an expression using predefined functions and operators
PL/SQL Function — Create a custom filter
To create a group filter:
Click Menu, and then select Create Group Filter. The Edit Group Filter dialog is displayed as shown in Figure 15-24.
Select the Group Filter Type: Expression or PL/SQL.
| Note: For PL/SQL filters, you must first specify the PL/SQL Package as the Oracle DB Default Package in the data model properties. See Setting Data Model Properties. | 
Enter the Filter:
To enter an expression, select the elements and click the shuttle button to move the element to the Group Filter definition box. Click the predefined functions and operators to insert them in the Group Filter box.
Refer to Function Reference for a description of the available functions.
Click Validate Expression to ensure that the entry is valid.
To enter a PL/SQL function, select the PL/SQL package from the Available box and click the shuttle button to move the function to the Group Filter box.
Your PL/SQL function in the default package must return a Boolean type.
After you have added the group filter, the data set object displays the filter indicator, as shown in Figure 15-25.
To edit or delete a group filter:
Click Menu for the data set.
Choose the appropriate action:
To edit the group filter, choose Edit Group Filter to launch the Group Filter dialog for editing.
To delete the group filter, choose Delete Group Filter.
You can perform the following functions at the element level:
Group by an element to create a subgroup, as described in Creating Subgroups
Create element-level links between data sets, as described in Creating Element-Level Links
Set element properties, as described in Setting Element Properties
You can set properties for individual elements. Note that these properties are also editable from the Structure View. If you need to update multiple element properties, it may be more efficient to use the Structure View. See Using the Structure View to Edit Your Data Structure.
To set element-level properties using the element dialog:
Click the element's action menu icon. From the menu, select Properties. The Properties dialog is shown in Figure 15-26.
Set the properties as needed, as described in Table 15-3.
Table 15-3 Element Properties
| Property | Description | 
|---|---|
| Alias | BI Publisher assigns a default tag name to the element in the XML data file. You can update this tag name to assign a more user-friendly name within the data file. | 
| Display Name | The Display Name appears in the report design tools. Update this name to be meaningful to your business users. | 
| Data Type | BI Publisher assigns a default data type of Integer or Double depending on the function. Some functions also provide the option of Float. | 
| Sort Order | You can sort XML data in a group by one or more elements. For example, if in a data set employees are grouped by department and manager, you can sort the XML data by department. Within each department you can group and sort data by manager, and within each manager subgroup, employees can be sorted by salary. If the element is not in a parent group, the Sort Order property is not available. | 
| Value if Null | If the value returned from the function is null, you can supply a default value here to prevent having a null in your data. | 
Sorting is supported for parent group break columns only. For example, if a data set of employees is grouped by department and manager, you can sort the XML data by department. Within each department you can group and sort data by manager. If you know how the data should be sorted in the final report, you specify sorting at data generation time to optimize document generation.
To apply a sort order to a group:
Click the action menu icon of the element you want to sort by. From the menu, select Properties.
Select the Sort Order.
Figure 15-27 shows the Properties dialog for the DEPARTMENT_ID element with the Sort Order list displayed.
This section describes how to perform group-functions. It includes the following topics:
The Menu button is available at the group level and enables you to perform the following:
Create and delete group links, as described in Creating Group-Level Links
Create, edit, and delete group filters, as described in Creating Group Filters
Add an element to the group based on an expression, as described in Adding a Group-Level or Global-Level Element by Expression
Edit the data set, as described in Editing the Data Set
Remove elements from the group, as described in Removing Elements from the Group
Edit group properties, as described in Editing the Group Properties
The group-level Menu button is shown in Figure 15-28.
To edit the underlying data set:
Click Edit Data Set to launch the data set editor.
See the appropriate section for the data set type in Chapter 14, "Creating Data Sets" for more information.
To remove an element from the group:
On the element row, click the menu and then click Remove Element. An example is shown in Figure 15-29.
| Note: You can only remove elements added as a group function (sum, count, and so on) or added as an expression. | 
To edit the group properties:
Click Menu and select Properties.
Edit the Group Name and click OK, as shown in Figure 15-30.
The Global Level Functions object enables you to add elements to your report data set at the top report level. You can add the following types of elements as top-level data:
Elements based on aggregate functions
Elements based on expressions
Elements based on PL/SQL statements (for Oracle Database data sources)
| Important: If you select a data type of Integer for any calculated element and the expression returns a fraction, the data is not truncated. | 
The Global Level Functions object is shown in Figure 15-31. To add elements based on aggregate functions, drag the element to the "Drop here for aggregate function" space of the object. To add an element based on an expression or PL/SQL, click Menu, and choose the appropriate action.
To add a global aggregate function:
Drag and drop the data element from the data set to the "Drop here for aggregate function" area of the Global Level Functions object.
For example, Figure 15-32 shows creating a global level aggregate function based on the Salary element.
When you release the mouse, the data model editor assigns a default name to the aggregate element and assigns Count as the default function. Available functions are:
Count
Average
First
Last
Maximum
Minimum
Summary
To change the function, click the function icon to the left of the new element name and choose the function from the list.
Figure 15-33 shows the function for the new global level element CS_1 being modified from Count to Average.
To change the default name, click the actions icon to the right of the element name and click Properties to launch the Edit Properties dialog, See Setting Element Properties. for more about the properties available on this dialog.
To add a group-level or global-level element by expression:
To add a group-level element: On the Group object, click Menu and select Add Element by Expression.
To add a global level element: On the Global Level Functions object, click Menu and select Add Element by Expression, as shown in Figure 15-34.
In the Add Element by Expression dialog, enter the fields, as shown in Figure 15-35.
Table 15-4 Add Element by Expression Dialog Fields
| Field | Description | 
|---|---|
| Name | Enter a name for this element. | 
| Display Name | The Display Name appears in the report design tools. Enter a name that is meaningful to your business users. | 
| Data Type | Select from the list of data types: String, Integer, Double, Float, or Date. | 
Enter the expression.
Use the shuttle arrow to move the data elements required for the expression from the Available box to the Expression box.
Click an operator to insert it in the Expression box, or choose from the function list.
Refer to Function Reference for a description of the available functions.
Click Validate Expression to validate.
The PL/SQL function must return a VARCHAR data type.
To add a global-level element by PL/SQL:
On the Properties page, specify the PL/SQL Package as the Oracle DB Default Package in the data model properties. See Setting Data Model Properties.
On the Global Level Functions object, click Menu, and then click Add Element by PL/SQL.
In the Add Element by PL/SQL dialog, enter the fields, as shown in Figure 15-36 and as described in Table 15-5.
Select the PL/SQL package from the Available box and click the shuttle button to move the function to the Group Filter box.
The Structure view enables you to preview the structure of your data model. The Data Source column displays the date elements in a hierarchical tree that you can collapse and expand. Use this view to verify the accuracy of the data model structure. The Structure view is shown in Figure 15-37.
Use the Structure page to define user-friendly names for elements in the data model. You can rename both the XML element tag name (XML View) and the name that displays in the report layout tools (Business Name). Figure 15-38 shows renaming the DEPARTMENT_ID element to display as Department Number.
Table 15-6 describes the usage of supported functions available from the Add Element by Expression dialog and the Edit Group Filter dialog.
Table 15-6 Supported Functions from the Add Element by Expression Dialog
| Function | Description | Syntax | Example | 
|---|---|---|---|
| IF | Logical IF operator Evaluates boolean_expr, and returns true_return if boolean_expr is true, and false_return if boolean_expr is false. | IF (boolean_expr, true_return, false_return) | IF (G_1.DEPARTMENT_ID == 10, 'PASSED', 'FAIL')returns 'PASSED' if DEPARTMENT_ID = 10, otherwise returns 'FAIL' | 
| NOT | Logical NOT operator Evaluates boolean_expr, and returns true if boolean_expr is false. | STRING(NOT(boolean_expr)) | STRING(NOT(G_1.JOB_ID == 'MANAGER'))returns 'TRUE' if JOB_ID = MANAGER, otherwise returns 'FALSE' | 
| AND | Logical AND operator Evaluates boolean_expr1 and boolean_expr2, and returns true if both boolean expressions are true, otherwise returns false. | STRING(AND(boolean_expr1, boolean_expr2, ...)) | STRING(AND (G_1.JOB_ID == 'MANAGER', G_1.DEPARTMENT_ID == 10))returns 'TRUE' if both JOB_ID = MANAGER and DEPARTMENT_ID = 10, otherwise returns 'FALSE' | 
| && | Logical AND operator Evaluates boolean_expr1 and boolean_expr2, and returns true if both boolean expressions are true, otherwise returns false. | STRING(boolean_expr1 && boolean_expr2) | STRING(G_1.JOB_ID == 'MANAGER' && G_1.DEPARTMENT_ID == 10) returns 'TRUE' if both JOB_ID = MANAGER and DEPARTMENT_ID = 10, otherwise returns 'FALSE' | 
| || | Logical OR operator Evaluates boolean_expr1 and boolean_expr2 and returns true if both boolean expressions are true, otherwise returns false. | STRING(OR(boolean_expr1, boolean_expr2) | STRING(OR (G_1.JOB_ID == 'MANAGER', G_1.DEPARTMENT_ID == 10)) returns 'TRUE' if either JOB_ID = MANAGER or DEPARTMENT_ID = 10, otherwise returns 'FALSE' | 
| Returns the maximum value of the element in the set. | MAX(expr1, expr2, expr3, ...) | MAX(G1_Salary, 10000) returns max of salary or 10000 | |
| Returns the minimum value of the element in the set. | MIN(expr1, expr2, expr3, ...) | MIN(G1_Salary,5000) returns min of salary or 5000 | |
| Returns a number rounded to the integer places right of the decimal point. | ROUND(number[,integer]) If integer is omitted, number is rounded to 0 places. Integer can be negative to round off digits left of the decimal point. Integer must be an integer. | ROUND(2.777) returns 3 ROUND(2.777, 2) returns 2.78 | |
| Returns the smallest integer equal to or less than n. | FLOOR(n) | FLOOR(2.777) returns 2 | |
| Returns the largest integer greater than or equal to n. | CEILING(n) | CEILING(2.777) returns 3 | |
| Returns the absolute value of n. | ABS(n) | ABS(-3) returns 3 | |
| Returns the average value of the expression. | AVG(expr1, expr2, expr3, ...) | AVG(G_1.SALARY,G_1.COMMISSION_PCT*G_1.SALARY) returns the average of SALARY and COMMISSION For example, if SALARY = 14000 and COMMISSION_PCT = .4, the expression evaluates to 9800.0 | |
| Returns the length of an array. The LENGTH function calculates the length using characters as defined by the input character set. If char is null, the function returns null. If char is an array, it returns the length of the array. | LENGTH(expr) | Example to return the length of an array: LENGTH{1, 2, 4, 4}) returns 4 Example to return the length of a string: LENGTH('countries') returns 9 | |
| Returns the sum of the value of the expression. | SUM(expr1, expr2, ...) | SUM (G_1.SALARY, G_1.COMMISSION_PCT*G_1.SALARY) returns sum of salary and commission For example, if SALARY = 14000 and COMMISSION_PCT =.4, the expression evaluates to 19,600.0 | |
| Replaces null (returned as a blank) with a string in the results of a query. | NVL(expr1, expr2) If expr1 is null, then NVL returns expr2. If expr1 is not null, then NVL returns expr1. | NVL(G_1.COMMISSION_PCT, .3) returns .3 when G_1.COMMISSION_PCT is null | |
| Returns char1 concatenated with char2. | CONCAT(char1, char2) | CONCAT(CONCAT(First_Name, ' '), Last_Name) where First_Name = Joe and Last_Name = Smith returns Joe Smith | |
| Returns char as a string data type. | STRING(expr) | STRING(G1_SALARY) where salary = 4400 returns 4400 as a string | |
| Extracts a substring from a string. | SUBSTRING(string, start_pos, end_pos) string is the source string. start_pos is the position to start the extraction. end_pos is the end position of the string to extract (optional). | SUBSTRING('this is a test', 5, 7) returns "is" (that is, characters 6 through 7)SUBSTRING('this is a test', 5) returns "is a test" | |
| Returns the position/location of the first character of a substring in a string. | INSTR(string1, string2) string1 is the string to search. string2 is the substring to search for in string1. | INSTR('this is a test', 'is a') returns 5 | |
| Converts a valid Java date string to a date data type in canonical format. | DATE(char, format_string) where (1) char is any valid Java date string (for example, 13-JAN-2013)(2) format_string is the Java date format of the input string (for example, dd-MMM-yyyy) The input and format strings must be a valid Java date format string. | DATE(01-Jan-2013,'dd-MMM-yyyy') returns 2013-01-01T08:00:00.000+00:00 | |
| Converts a date argument in the Java date format to a formatted string. | FORMAT_DATE(date,format_string) | FORMAT_DATE(SYSDATE,'dd-MMM-yyyy') where the value of SYSDATE = 2013-01-24T16:32:45.000-08:00returns 24-Jan-2013 | |
| Converts a number or numeric string to a string in the specified number format. | FORMAT_NUMBER(number,format_string) | FORMAT_NUMBER(SOME_NUMBER, '$9,999.00') where the value of SOME_NUMBER = 12345.678returns $12,345.68 | |
| DECODE | Replaces the value of an expression with another value based on the specified search and replace criteria. | DECODE(expr, search, result [, search, result]...[, default]) | DECODE(PROD_FAMILY_CODE,100,'Colas',200,'Root Beer',300,'Cream Sodas',400,'Fruit Sodas','Other')returns(1) 'Colas' if PROD_FAMILY_CODE = 100(2) 'Root Beer' if PROD_FAMILY_CODE = 200(3) 'Cream Sodas' if PROD_FAMILY_CODE = 300(4) 'Fruit Sodas' if PROD_FAMILY_CODE = 400(5) 'Other' if PROD_FAMILY_CODE is any other value | 
| REPLACE | Replaces a sequence of characters in a string with another set of characters. | REPLACE(expr,string1,string2) where string1 is the string to search for and string2 is the string to replace. | REPLACE(G_1.FIRST_NAME,'B','L') where G_1.FIRST_NAME = Barry returns Larry |