SlideShare a Scribd company logo
Timothy C. Fanelli - Senior IT Specialist
23 September 2013

Three Key Concepts for
Understanding JSR-352:
Batch Applications for the
Java Platform

© 2013 IBM Corporation

Sunday, September 22, 13
Important Disclaimers
THE INFORMATION CONTAINED IN THIS PRESENTATION IS PROVIDED FOR INFORMATIONAL PURPOSES ONLY.
WHILST EFFORTS WERE MADE TO VERIFY THE COMPLETENESS AND ACCURACY OF THE INFORMATION
CONTAINED IN THIS PRESENTATION, IT IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED.
ALL PERFORMANCE DATA INCLUDED IN THIS PRESENTATION HAVE BEEN GATHERED IN A CONTROLLED
ENVIRONMENT. YOUR OWN TEST RESULTS MAY VARY BASED ON HARDWARE, SOFTWARE OR INFRASTRUCTURE
DIFFERENCES.
ALL DATA INCLUDED IN THIS PRESENTATION ARE MEANT TO BE USED ONLY AS A GUIDE.
IN ADDITION, THE INFORMATION CONTAINED IN THIS PRESENTATION IS BASED ON IBM’S CURRENT PRODUCT
PLANS AND STRATEGY, WHICH ARE SUBJECT TO CHANGE BY IBM, WITHOUT NOTICE.
IBM AND ITS AFFILIATED COMPANIES SHALL NOT BE RESPONSIBLE FOR ANY DAMAGES ARISING OUT OF THE USE
OF, OR OTHERWISE RELATED TO, THIS PRESENTATION OR ANY OTHER DOCUMENTATION.
NOTHING CONTAINED IN THIS PRESENTATION IS INTENDED TO, OR SHALL HAVE THE EFFECT OF:
- CREATING ANY WARRANT OR REPRESENTATION FROM IBM, ITS AFFILIATED COMPANIES OR ITS OR THEIR
SUPPLIERS AND/OR LICENSORS
2

Sunday, September 22, 13

© 2013 IBM Corporation
About me
 Timothy C. Fanelli
 Senior IT Specialist, IBM - Mainframe
Workload Modernization
 Instructor of Software Engineering Clarkson University, Potsdam NY
 tim@timfanelli.com
 tfanelli@us.ibm.com, tfanelli@clarkson.edu
 Visit the IBM booth #5112 and meet other
IBM developers at JavaOne 2013
3

Sunday, September 22, 13

© 2013 IBM Corporation
Agenda
 Background
 Three Key Concepts
– Implementation
– Orchestration
– Execution
 An Example JSR 352 Application
 Advanced Topics
– Splits and Flows
– Partitioning
– Java EE
 Conclusion and Thoughts on What’s Next...

4

Sunday, September 22, 13

© 2013 IBM Corporation
Background

© 2013 IBM Corporation

Sunday, September 22, 13
Batch Processing
 One of the oldest processing paradigms
 Historically associated with mainframe computing
 Still incredibly relevant today, with fresh challenges in an OLTP driven world

6

Sunday, September 22, 13

© 2013 IBM Corporation
Batch Processing
 One of the oldest processing paradigms
 Historically associated with mainframe computing
 Still incredibly relevant today, with fresh challenges in an OLTP driven world

6

Sunday, September 22, 13

© 2013 IBM Corporation
Java for Batch Processing?
 Mainframe developers have shied away from Java
– Performance concerns over native languages
– Integration concerns for legacy data
– Disparate developer skill set between System Z and Java

7

Sunday, September 22, 13

© 2013 IBM Corporation
Java for Batch Processing?
 Mainframe developers have shied away from Java
– Performance concerns over native languages
– Integration concerns for legacy data
– Disparate developer skill set between System Z and Java
 Java and JavaEE have dominated the Online Transaction Processing world

7

Sunday, September 22, 13

© 2013 IBM Corporation
Java for Batch Processing?
 Mainframe developers have shied away from Java
– Performance concerns over native languages
– Integration concerns for legacy data
– Disparate developer skill set between System Z and Java
 Java and JavaEE have dominated the Online Transaction Processing world
 Time to bridge the two worlds together
– IBM Java for zOS, IBM WebSphere, and Spring Batch paved new paths
– Just-in-Time Compilation, Garbage Collection optimizations proved it out
– Adoption is wide-spread!

7

Sunday, September 22, 13

© 2013 IBM Corporation
Java for Batch Processing?
 Mainframe developers have shied away from Java
– Performance concerns over native languages
– Integration concerns for legacy data
– Disparate developer skill set between System Z and Java
 Java and JavaEE have dominated the Online Transaction Processing world
 Time to bridge the two worlds together
– IBM Java for zOS, IBM WebSphere, and Spring Batch paved new paths
– Just-in-Time Compilation, Garbage Collection optimizations proved it out
– Adoption is wide-spread!
 Only remaining challenge was the lack of a standard
– The need for JSR-352 was obvious

7

Sunday, September 22, 13

© 2013 IBM Corporation
JSR 352: Batch Applications for the Java Platform
 Expert working group formed 29 November 2011
– IBM*, VMWare, RedHat, Oracle, Credit Suisse, Independent participants
– Broad range of talent with deep batch experience

8

Sunday, September 22, 13

© 2013 IBM Corporation
JSR 352: Batch Applications for the Java Platform
 Expert working group formed 29 November 2011
– IBM*, VMWare, RedHat, Oracle, Credit Suisse, Independent participants
– Broad range of talent with deep batch experience
 Final Release 24 May 2013

8

Sunday, September 22, 13

© 2013 IBM Corporation
JSR 352: Batch Applications for the Java Platform
 Expert working group formed 29 November 2011
– IBM*, VMWare, RedHat, Oracle, Credit Suisse, Independent participants
– Broad range of talent with deep batch experience
 Final Release 24 May 2013
 Included in Java EE 7!

8

Sunday, September 22, 13

© 2013 IBM Corporation
Three Key Concepts...

© 2013 IBM Corporation

Sunday, September 22, 13
Three Key Concepts ...
 JSR 352 defines
– Implementation: A programming model for
implementing the artifacts
– Orchestration: A Job Specification Language,
which orchestrates the execution of a batch
artifacts within a job.
– Execution: A runtime environment for executing
batch application, according to a defined
lifecycle.

Orchestrate

Implement

Execute

10

Sunday, September 22, 13

© 2013 IBM Corporation
Three Key Concepts ...
 JSR 352 defines
– Implementation: A programming model for
implementing the artifacts
– Orchestration: A Job Specification Language,
which orchestrates the execution of a batch
artifacts within a job.
– Execution: A runtime environment for executing
batch application, according to a defined
lifecycle.

Orchestrate

Implement

Execute

 Note: “key” concepts, not “new” concepts!
– Roles and abstractions should be familiar to
SOA and JavaEE developers
10

Sunday, September 22, 13

© 2013 IBM Corporation
Anatomy of JSR352
 Those concepts define the anatomy of JSR 352: Batch
Applications for the Java Platform...

11

Sunday, September 22, 13

© 2013 IBM Corporation
Anatomy of JSR352
 Those concepts define the anatomy of JSR 352: Batch
Applications for the Java Platform...

Orchestrate

Implement

Execute

11

Sunday, September 22, 13

© 2013 IBM Corporation
Anatomy of JSR352
 Those concepts define the anatomy of JSR 352: Batch
Applications for the Java Platform...

Listeners
Contexts

Listeners

Partitioning

Batchlet

Orchestrate

Implement

Job
Operator

Job

Reader

Step

Processor

Chunk

Writer

Execute

Job Repository
Chunk
Chunk
Chunk
Listeners

11

Sunday, September 22, 13

© 2013 IBM Corporation
Implementation: The programming model
 Chunk and Batchlet provide models for implementing
a step.

Listeners
Contexts

Listeners

Partitioning

Batchlet
Reader
Processor

Chunk

Writer

Chunk
Chunk
Chunk
Listeners

12

Sunday, September 22, 13

© 2013 IBM Corporation
Implementation: The programming model
 Chunk and Batchlet provide models for implementing
a step.
 Contexts provide Job- and Step- level runtime
information, and provide interim data persistence.

Listeners
Contexts

Listeners

Partitioning

Batchlet
Reader
Processor

Chunk

Writer

Chunk
Chunk
Chunk
Listeners

12

Sunday, September 22, 13

© 2013 IBM Corporation
Implementation: The programming model
 Chunk and Batchlet provide models for implementing
a step.
 Contexts provide Job- and Step- level runtime
information, and provide interim data persistence.
 Listeners provide callback hooks to respond to
lifecycle events on batch artifacts.

Listeners
Contexts

Listeners

Partitioning

Batchlet
Reader
Processor

Chunk

Writer

Chunk
Chunk
Chunk
Listeners

12

Sunday, September 22, 13

© 2013 IBM Corporation
Implementation: The programming model
 Chunk and Batchlet provide models for implementing
a step.
 Contexts provide Job- and Step- level runtime
information, and provide interim data persistence.
 Listeners provide callback hooks to respond to
lifecycle events on batch artifacts.

Listeners
Contexts

Listeners

Partitioning

Batchlet
Reader
Processor

Chunk

 Partitioning provides a mechanism imposing parallel
processing on jobs and steps

Writer

Chunk
Chunk
Chunk
Listeners

12

Sunday, September 22, 13

© 2013 IBM Corporation
Implementation: The programming model
Chunk vs Batchlet
Listeners
Contexts

Listeners

Partitioning

Batchlet
Reader
Processor

Chunk

Writer

Chunk
Chunk
Chunk
Listeners

13

Sunday, September 22, 13

© 2013 IBM Corporation
Implementation: The programming model
Chunk vs Batchlet
 Both are implementations of a step within a batch job
Listeners
Contexts

Listeners

Partitioning

Batchlet
Reader
Processor

Chunk

Writer

Chunk
Chunk
Chunk
Listeners

13

Sunday, September 22, 13

© 2013 IBM Corporation
Implementation: The programming model
Chunk vs Batchlet
 Both are implementations of a step within a batch job
 The chunk model
– Encapsulates a very common pattern: ETL
– Single “reader”, “processor” and “writer”
– Reader/Processor combination is invoked until
an entire “chunk” of data is processed
– Output “chunk” is written atomically

Listeners
Contexts

Listeners

Partitioning

Batchlet
Reader
Processor

Chunk

Writer

Chunk
Chunk
Chunk
Listeners

13

Sunday, September 22, 13

© 2013 IBM Corporation
Implementation: The programming model
Chunk vs Batchlet
 Both are implementations of a step within a batch job
 The chunk model
– Encapsulates a very common pattern: ETL
– Single “reader”, “processor” and “writer”
– Reader/Processor combination is invoked until
an entire “chunk” of data is processed
– Output “chunk” is written atomically
 Batchlet provides a “roll your own” step type
– Invoked and runs to completion, producing a
return code upon exit.

13

Sunday, September 22, 13

Listeners
Contexts

Listeners

Partitioning

Batchlet
Reader
Processor

Chunk

Writer

Chunk
Chunk
Chunk
Listeners

© 2013 IBM Corporation
Orchestration: The Job Specification Language (JSL)
 The JSL defines a batch job as an XML document
Job

14

Sunday, September 22, 13

Step

© 2013 IBM Corporation
Orchestration: The Job Specification Language (JSL)
 The JSL defines a batch job as an XML document
 Describes a step as an assemblage of batch artifacts

14

Sunday, September 22, 13

Job

Step

© 2013 IBM Corporation
Orchestration: The Job Specification Language (JSL)
 The JSL defines a batch job as an XML document
 Describes a step as an assemblage of batch artifacts

Job

Step

 Provides for the description of steps, step groupings,
and execution sequencing

14

Sunday, September 22, 13

© 2013 IBM Corporation
Execution: The JobOperator and Repository
 JobOperator is the runtime interface for job
management, including start, stop, restart and job
repository related commands

Job
Operator

Job Repository

15

Sunday, September 22, 13

© 2013 IBM Corporation
Execution: The JobOperator and Repository
 JobOperator is the runtime interface for job
management, including start, stop, restart and job
repository related commands
 The Job Repository holds information about
completed and executing jobs

15

Sunday, September 22, 13

Job
Operator

Job Repository

© 2013 IBM Corporation
Execution: The JobOperator and Repository
 JobOperator is the runtime interface for job
management, including start, stop, restart and job
repository related commands
 The Job Repository holds information about
completed and executing jobs

Job
Operator

Job Repository

 To start a batch job, get a JobOperator instance use
it to start a job described (described by JSL).

15

Sunday, September 22, 13

© 2013 IBM Corporation
Execution: JobInstance, JobExecution, and StepExecution

Job

Step

*
JobInstance

*
JobExecution

*

*

StepExecution
Job
Operator

16

Sunday, September 22, 13

Job Repository

© 2013 IBM Corporation
Execution: JobInstance, JobExecution, and StepExecution
 The state of a job is broken down into various parts,
and persisted in the repository
– Submitting a job creates a JobInstance, a
logical representation of a particular “run” of a
job.
– A JobExecution is a single attempt to run a
JobInstance. A restart attempt creates another
JobExecution
– Similarly, a StepExecution is a single attempt to
run a step within a job. It is created when a step
starts execution.
Job
Operator

16

Sunday, September 22, 13

Job

Step

*
JobInstance

*
JobExecution

*

*

StepExecution

Job Repository

© 2013 IBM Corporation
An Example JSR 352
Application

© 2013 IBM Corporation

Sunday, September 22, 13
The Application

18

Sunday, September 22, 13

© 2013 IBM Corporation
The Application
‣ A typical “batch hello world”:

18

Sunday, September 22, 13

© 2013 IBM Corporation
The Application
‣ A typical “batch hello world”:
– Reads strings from an input file

18

Sunday, September 22, 13

© 2013 IBM Corporation
The Application
‣ A typical “batch hello world”:
– Reads strings from an input file
– Performs some validation or transforms

18

Sunday, September 22, 13

© 2013 IBM Corporation
The Application
‣ A typical “batch hello world”:
– Reads strings from an input file
– Performs some validation or transforms
– Writes validated or transformed string to an output file

18

Sunday, September 22, 13

© 2013 IBM Corporation
The Application
‣ A typical “batch hello world”:
– Reads strings from an input file
– Performs some validation or transforms
– Writes validated or transformed string to an output file
‣ Key capabilities

18

Sunday, September 22, 13

© 2013 IBM Corporation
The Application
‣ A typical “batch hello world”:
– Reads strings from an input file
– Performs some validation or transforms
– Writes validated or transformed string to an output file
‣ Key capabilities
– If something goes wrong, we don’t want to discard all the
prior work; and we want to pick up where we left off

18

Sunday, September 22, 13

© 2013 IBM Corporation
The Application
‣ A typical “batch hello world”:
– Reads strings from an input file
– Performs some validation or transforms
– Writes validated or transformed string to an output file
‣ Key capabilities
– If something goes wrong, we don’t want to discard all the
prior work; and we want to pick up where we left off
– We want control over the transaction scoping so prevent
lock contention in high volume periods

18

Sunday, September 22, 13

© 2013 IBM Corporation
The Application
‣ A typical “batch hello world”:
– Reads strings from an input file
– Performs some validation or transforms
– Writes validated or transformed string to an output file
‣ Key capabilities
– If something goes wrong, we don’t want to discard all the
prior work; and we want to pick up where we left off
– We want control over the transaction scoping so prevent
lock contention in high volume periods
– We want flexibility to “plug and play” where our records
come from

18

Sunday, September 22, 13

© 2013 IBM Corporation
The Application
‣ A typical “batch hello world”:
– Reads strings from an input file
– Performs some validation or transforms
– Writes validated or transformed string to an output file
‣ Key capabilities
– If something goes wrong, we don’t want to discard all the
prior work; and we want to pick up where we left off
– We want control over the transaction scoping so prevent
lock contention in high volume periods
– We want flexibility to “plug and play” where our records
come from
– For unit testing, development testing, and QA testing records may come from a variety of sources

18

Sunday, September 22, 13

© 2013 IBM Corporation
The Design
‣ Let’s implement a string-transform in an extract-transform-load pattern

19

Sunday, September 22, 13

© 2013 IBM Corporation
The Design
‣ Let’s implement a string-transform in an extract-transform-load pattern
‣ We’ll use JSR352’s Chunk programming model

19

Sunday, September 22, 13

© 2013 IBM Corporation
The Design
‣ Let’s implement a string-transform in an extract-transform-load pattern
‣ We’ll use JSR352’s Chunk programming model
– Encapsulates the ETL pattern components as Reader, Processor, and Writer interfaces

19

Sunday, September 22, 13

© 2013 IBM Corporation
The Design
‣ Let’s implement a string-transform in an extract-transform-load pattern
‣ We’ll use JSR352’s Chunk programming model
– Encapsulates the ETL pattern components as Reader, Processor, and Writer interfaces
– Loosely coupled artifacts will be orchestrated into a single-step job later

19

Sunday, September 22, 13

© 2013 IBM Corporation
The Design
‣ Let’s implement a string-transform in an extract-transform-load pattern
‣ We’ll use JSR352’s Chunk programming model
– Encapsulates the ETL pattern components as Reader, Processor, and Writer interfaces
– Loosely coupled artifacts will be orchestrated into a single-step job later
– “Free” checkpoint/restart capability

19

Sunday, September 22, 13

© 2013 IBM Corporation
The Design
‣ Let’s implement a string-transform in an extract-transform-load pattern
‣ We’ll use JSR352’s Chunk programming model
– Encapsulates the ETL pattern components as Reader, Processor, and Writer interfaces
– Loosely coupled artifacts will be orchestrated into a single-step job later
– “Free” checkpoint/restart capability
– Transaction scoping imposed externally in the job descriptor

19

Sunday, September 22, 13

© 2013 IBM Corporation
The Design
‣ Let’s implement a string-transform in an extract-transform-load pattern
‣ We’ll use JSR352’s Chunk programming model
– Encapsulates the ETL pattern components as Reader, Processor, and Writer interfaces
– Loosely coupled artifacts will be orchestrated into a single-step job later
– “Free” checkpoint/restart capability
– Transaction scoping imposed externally in the job descriptor
‣ Job will be executed as a Java SE command line batch application

19

Sunday, September 22, 13

© 2013 IBM Corporation
The Code
Implement

20

Sunday, September 22, 13

© 2013 IBM Corporation
The Code
Implement

‣ An ItemReader encapsulates the data
access and deserialization of a record.

20

Sunday, September 22, 13

© 2013 IBM Corporation
The Code
Implement

‣ An ItemReader encapsulates the data
access and deserialization of a record.
‣ No restriction on data access paradigm: use
DAO patterns, JDBC, JPA, Hibernate,
Spring Data, etc!

20

Sunday, September 22, 13

© 2013 IBM Corporation
The Code
Implement

‣ An ItemReader encapsulates the data
access and deserialization of a record.
‣ No restriction on data access paradigm: use
DAO patterns, JDBC, JPA, Hibernate,
Spring Data, etc!
‣ Checkpoint/Restart data provided as
Serializable argument to “open” and from
“checkpointInfo” methods.
20

Sunday, September 22, 13

© 2013 IBM Corporation
The Code
Implement

21

Sunday, September 22, 13

© 2013 IBM Corporation
The Code
Implement

 An ItemWriter is the output counterpart to
ItemReader

21

Sunday, September 22, 13

© 2013 IBM Corporation
The Code
Implement

 An ItemWriter is the output counterpart to
ItemReader
 Primary difference is that writeItems
accepts a “chunk” of output objects (as a
list) to serialize.

21

Sunday, September 22, 13

© 2013 IBM Corporation
The Code
Implement

 An ItemWriter is the output counterpart to
ItemReader
 Primary difference is that writeItems
accepts a “chunk” of output objects (as a
list) to serialize.
 Again, no restriction on data access
paradigm!

21

Sunday, September 22, 13

© 2013 IBM Corporation
The Code
Implement

22

Sunday, September 22, 13

© 2013 IBM Corporation
The Code
Implement

 An ItemProcessor encapsulates the
business logic applied to each record

22

Sunday, September 22, 13

© 2013 IBM Corporation
The Code
Implement

 An ItemProcessor encapsulates the
business logic applied to each record
 “main” here demonstrates the invocation of
a batch job, using the JobOperator

22

Sunday, September 22, 13

© 2013 IBM Corporation
The Code
Implement

 An ItemProcessor encapsulates the
business logic applied to each record
 “main” here demonstrates the invocation of
a batch job, using the JobOperator
 Would typically not be in the processor
implementation

22

Sunday, September 22, 13

© 2013 IBM Corporation
The Batch Descriptor and Job Specification
Orchestrate

23

Sunday, September 22, 13

© 2013 IBM Corporation
The Batch Descriptor and Job Specification
Orchestrate

 batch.xml defines and names
the batch artifacts in this
application archive

23

Sunday, September 22, 13

© 2013 IBM Corporation
The Batch Descriptor and Job Specification
Orchestrate

 batch.xml defines and names
the batch artifacts in this
application archive
 sample.xml is an example Job
Specification Language
document for SampleBatchApp

23

Sunday, September 22, 13

© 2013 IBM Corporation
The Execution
Execute

24

Sunday, September 22, 13

© 2013 IBM Corporation
The Execution
Execute

 Package the application as a standard JAR or WAR
for deployment in JavaSE or EE environments
– batch.xml goes in META-INF or WEB-INF/
classes/META-INF
– JSL may go in META-INF/batch-jobs, or
submitted from an external source (up to the
provider!)

24

Sunday, September 22, 13

© 2013 IBM Corporation
The Execution
Execute

25

Sunday, September 22, 13

© 2013 IBM Corporation
The Execution
 See it live?

25

Sunday, September 22, 13

Execute

© 2013 IBM Corporation
Advanced Topics

© 2013 IBM Corporation

Sunday, September 22, 13
Job Management - Restart, Stop, Abandon
Execute

27

Sunday, September 22, 13

© 2013 IBM Corporation
Job Management - Restart, Stop, Abandon
 Had something gone wrong, what then?
– The “main” program shown was too simple... only
“started” the job

27

Sunday, September 22, 13

Execute

© 2013 IBM Corporation
Job Management - Restart, Stop, Abandon
 Had something gone wrong, what then?
– The “main” program shown was too simple... only
“started” the job

Execute

 JobOperator exposes APIs for a variety of job
management tasks: start, stop, abandon, restart
– Would have had to take advantage of these for
advanced job management capabilities.

27

Sunday, September 22, 13

© 2013 IBM Corporation
Job Management - Restart, Stop, Abandon
 Had something gone wrong, what then?
– The “main” program shown was too simple... only
“started” the job

Execute

 JobOperator exposes APIs for a variety of job
management tasks: start, stop, abandon, restart
– Would have had to take advantage of these for
advanced job management capabilities.
 The door is left open for more advanced batch job
management systems to be built!
– Integration into existing enterprise schedulers?
– New Java EE batch scheduling standard?
– Plenty of options, but currently left to the provider
to implement
27

Sunday, September 22, 13

© 2013 IBM Corporation
Java EE Integration
Execute

28

Sunday, September 22, 13

© 2013 IBM Corporation
Java EE Integration
 JSR-352: Java Batch is included in Java EE 7

28

Sunday, September 22, 13

Execute

© 2013 IBM Corporation
Java EE Integration
 JSR-352: Java Batch is included in Java EE 7

Execute

 Provides EE clustering, security, resource
management, etc to Java Batch applications

28

Sunday, September 22, 13

© 2013 IBM Corporation
Java EE Integration
 JSR-352: Java Batch is included in Java EE 7

Execute

 Provides EE clustering, security, resource
management, etc to Java Batch applications
 Performance benefits to dispatching into longrunning, reusable container
– JIT compilation through the first couple runs
– Eliminates overhead of starting / stopping JVM

28

Sunday, September 22, 13

© 2013 IBM Corporation
Parallel Job Processing
Orchestrate

29

Sunday, September 22, 13

© 2013 IBM Corporation
Parallel Job Processing
 Splits and Flows provide a mechanism for executing
job steps concurrently at the orchestration layer

29

Sunday, September 22, 13

Orchestrate

© 2013 IBM Corporation
Parallel Job Processing
 Splits and Flows provide a mechanism for executing
job steps concurrently at the orchestration layer

Orchestrate

 A flow is a sequence of one or more steps which
execute sequentially, but as a single unit.

29

Sunday, September 22, 13

© 2013 IBM Corporation
Parallel Job Processing
 Splits and Flows provide a mechanism for executing
job steps concurrently at the orchestration layer

Orchestrate

 A flow is a sequence of one or more steps which
execute sequentially, but as a single unit.
 A Split is a collection of flows that may execute
concurrently
– A split may only contain “flows”; a step is not
implicitly a flow

29

Sunday, September 22, 13

© 2013 IBM Corporation
Parallel Job Processing
 Splits and Flows provide a mechanism for executing
job steps concurrently at the orchestration layer

Orchestrate

 A flow is a sequence of one or more steps which
execute sequentially, but as a single unit.
 A Split is a collection of flows that may execute
concurrently
– A split may only contain “flows”; a step is not
implicitly a flow
 This is done entirely in the JSL descriptor
– Imposed on the batch application with no code
changes!

29

Sunday, September 22, 13

© 2013 IBM Corporation
Parallel Job Processing
Implement

30

Sunday, September 22, 13

© 2013 IBM Corporation
Parallel Job Processing
 Step-level parallelism can be achieved programmatically using
step partitioning

30

Sunday, September 22, 13

Implement

© 2013 IBM Corporation
Parallel Job Processing
 Step-level parallelism can be achieved programmatically using
step partitioning

Implement

 A partitioned step runs as multiple instances with distinct
property sets

30

Sunday, September 22, 13

© 2013 IBM Corporation
Parallel Job Processing
 Step-level parallelism can be achieved programmatically using
step partitioning

Implement

 A partitioned step runs as multiple instances with distinct
property sets
 PartitionMapper defines the number of partitions, and property
values for each partition
– Can be a fixed set of partitions in JSL
– Can be dynamic using a PartitionMapper implementation

30

Sunday, September 22, 13

© 2013 IBM Corporation
Parallel Job Processing

31

Sunday, September 22, 13

© 2013 IBM Corporation
Wrap up...

© 2013 IBM Corporation

Sunday, September 22, 13
Batch Processing
 The oldest “new thing” in Java
 JSR 352 applies the modern thinking and abstraction of Java EE and SOA and applies it to
sequential batch processing
 The standardized programming model provides application developers vendor portability
 Inclusion in Java EE 7 ensures wide spread availability

33

Sunday, September 22, 13

© 2013 IBM Corporation
Questions?
Find this presentation and more!
http://ibm.co/JavaOne2013

© 2013 IBM Corporation

Sunday, September 22, 13

More Related Content

PPT
Was l iberty for java batch and jsr352
PDF
JavaOne2013: Implement a High Level Parallel API - Richard Ning
PDF
JavaOne BOF 5957 Lightning Fast Access to Big Data
ODP
Bci for Beginners
ODP
Windows Debugging Tools - JavaOne 2013
PPTX
JSR 236 Concurrency Utils for EE presentation for JavaOne 2013 (CON7948)
PDF
Inside IBM Java 7
PDF
Java one 2015 [con3339]
Was l iberty for java batch and jsr352
JavaOne2013: Implement a High Level Parallel API - Richard Ning
JavaOne BOF 5957 Lightning Fast Access to Big Data
Bci for Beginners
Windows Debugging Tools - JavaOne 2013
JSR 236 Concurrency Utils for EE presentation for JavaOne 2013 (CON7948)
Inside IBM Java 7
Java one 2015 [con3339]

What's hot (20)

PDF
JavaOne2013: Secure Engineering Practices for Java
PDF
JavaOne 2013 CON7370: Java Interprocess Communication Challenges in Low-Laten...
PDF
What's New in IBM Java 8 SE?
ODP
Debugging Native heap OOM - JavaOne 2013
PPTX
[RakutenTechConf2013] [E-3] Financial Web System with Java EE 6
PDF
SHOW107: The DataSource Session: Take XPages data boldly where no XPages data...
PDF
AD116 XPages Extension Library: Making Application Development Even Easier
PDF
A Java Implementer's Guide to Better Apache Spark Performance
PDF
Java Enterprise Edition 6 Overview
PPTX
Whats Next for JCA?
PPTX
Top 50 java ee 7 best practices [con5669]
PPTX
All of the Performance Tuning Features in Oracle SQL Developer
PPTX
Java vs .Net
PDF
Impact2014: Introduction to the IBM Java Tools
PDF
JavaOne2013: Build Your Own Runtime Monitoring for the IBM JDK with the Healt...
PPT
JavaScript Frameworks and Java EE – A Great Match
PPTX
Image Converter
PDF
JavaOne 2015: From Java Code to Machine Code
PDF
Hibernate Interview Questions
PPTX
Future of Java EE with Java SE 8
JavaOne2013: Secure Engineering Practices for Java
JavaOne 2013 CON7370: Java Interprocess Communication Challenges in Low-Laten...
What's New in IBM Java 8 SE?
Debugging Native heap OOM - JavaOne 2013
[RakutenTechConf2013] [E-3] Financial Web System with Java EE 6
SHOW107: The DataSource Session: Take XPages data boldly where no XPages data...
AD116 XPages Extension Library: Making Application Development Even Easier
A Java Implementer's Guide to Better Apache Spark Performance
Java Enterprise Edition 6 Overview
Whats Next for JCA?
Top 50 java ee 7 best practices [con5669]
All of the Performance Tuning Features in Oracle SQL Developer
Java vs .Net
Impact2014: Introduction to the IBM Java Tools
JavaOne2013: Build Your Own Runtime Monitoring for the IBM JDK with the Healt...
JavaScript Frameworks and Java EE – A Great Match
Image Converter
JavaOne 2015: From Java Code to Machine Code
Hibernate Interview Questions
Future of Java EE with Java SE 8
Ad

Viewers also liked (11)

PDF
Spring Batch - Lessons Learned out of a real life banking system.
ODP
Developing Microservices using Spring - Beginner's Guide
ODP
3978 Why is Java so different... A Session for Cobol/PLI/Assembler Developers
PDF
Spring Batch Performance Tuning
PPT
J2EE Batch Processing
PDF
Design & Develop Batch Applications in Java/JEE
PDF
Three Key Concepts for Understanding JSR-352: Batch Programming for the Java ...
PDF
JPA 2.1 performance tuning tips
PPTX
Spring batch for large enterprises operations
PDF
Java EE 7 Batch processing in the Real World
PPTX
Java EE vs Spring Framework
Spring Batch - Lessons Learned out of a real life banking system.
Developing Microservices using Spring - Beginner's Guide
3978 Why is Java so different... A Session for Cobol/PLI/Assembler Developers
Spring Batch Performance Tuning
J2EE Batch Processing
Design & Develop Batch Applications in Java/JEE
Three Key Concepts for Understanding JSR-352: Batch Programming for the Java ...
JPA 2.1 performance tuning tips
Spring batch for large enterprises operations
Java EE 7 Batch processing in the Real World
Java EE vs Spring Framework
Ad

Similar to Three key concepts for java batch (20)

PDF
Batch Applications for the Java Platform
PDF
Batch Applications for Java Platform 1.0: Java EE 7 and GlassFish
PPTX
PPTX
Batching and Java EE (jdk.io)
PPTX
Hadoop vs Java Batch Processing JSR 352
PDF
Spring batch overivew
PPTX
JavaFX and JEE 7
PPTX
Spring batch
PDF
Atlanta JUG - Integrating Spring Batch and Spring Integration
PDF
Presente e Futuro: Java EE.next()
PPTX
Spring batch
PDF
Core Banking System modernization for Japanese Bank
PPTX
Exploring Data Integration Capabilities of the WSO2 Platform
PDF
F3-DP-2015-Milata-Tomas-java-ee-batch-editor (1)
PDF
AI Industrial Summit - SOFIA, BULGARIA - A High-Speed Data Ingestion Microser...
PPTX
Java EE8 - by Kito Mann
PDF
Join the Java Evolution GIDS Bangalore & Pune
PPT
Dev212 Comparing Net And Java The View From 2006
PDF
AAI 2236-Using the New Java Concurrency Utilities with IBM WebSphere
PDF
AAI-2236 Using the new Java Concurrency Utilities with IBM WebSphere
Batch Applications for the Java Platform
Batch Applications for Java Platform 1.0: Java EE 7 and GlassFish
Batching and Java EE (jdk.io)
Hadoop vs Java Batch Processing JSR 352
Spring batch overivew
JavaFX and JEE 7
Spring batch
Atlanta JUG - Integrating Spring Batch and Spring Integration
Presente e Futuro: Java EE.next()
Spring batch
Core Banking System modernization for Japanese Bank
Exploring Data Integration Capabilities of the WSO2 Platform
F3-DP-2015-Milata-Tomas-java-ee-batch-editor (1)
AI Industrial Summit - SOFIA, BULGARIA - A High-Speed Data Ingestion Microser...
Java EE8 - by Kito Mann
Join the Java Evolution GIDS Bangalore & Pune
Dev212 Comparing Net And Java The View From 2006
AAI 2236-Using the New Java Concurrency Utilities with IBM WebSphere
AAI-2236 Using the new Java Concurrency Utilities with IBM WebSphere

Recently uploaded (20)

PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Approach and Philosophy of On baking technology
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
Big Data Technologies - Introduction.pptx
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
1. Introduction to Computer Programming.pptx
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
Machine learning based COVID-19 study performance prediction
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Approach and Philosophy of On baking technology
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
MIND Revenue Release Quarter 2 2025 Press Release
Big Data Technologies - Introduction.pptx
Network Security Unit 5.pdf for BCA BBA.
1. Introduction to Computer Programming.pptx
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Group 1 Presentation -Planning and Decision Making .pptx
Advanced methodologies resolving dimensionality complications for autism neur...
“AI and Expert System Decision Support & Business Intelligence Systems”
SOPHOS-XG Firewall Administrator PPT.pptx
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
Machine learning based COVID-19 study performance prediction
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
MYSQL Presentation for SQL database connectivity
Assigned Numbers - 2025 - Bluetooth® Document
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Digital-Transformation-Roadmap-for-Companies.pptx

Three key concepts for java batch

  • 1. Timothy C. Fanelli - Senior IT Specialist 23 September 2013 Three Key Concepts for Understanding JSR-352: Batch Applications for the Java Platform © 2013 IBM Corporation Sunday, September 22, 13
  • 2. Important Disclaimers THE INFORMATION CONTAINED IN THIS PRESENTATION IS PROVIDED FOR INFORMATIONAL PURPOSES ONLY. WHILST EFFORTS WERE MADE TO VERIFY THE COMPLETENESS AND ACCURACY OF THE INFORMATION CONTAINED IN THIS PRESENTATION, IT IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED. ALL PERFORMANCE DATA INCLUDED IN THIS PRESENTATION HAVE BEEN GATHERED IN A CONTROLLED ENVIRONMENT. YOUR OWN TEST RESULTS MAY VARY BASED ON HARDWARE, SOFTWARE OR INFRASTRUCTURE DIFFERENCES. ALL DATA INCLUDED IN THIS PRESENTATION ARE MEANT TO BE USED ONLY AS A GUIDE. IN ADDITION, THE INFORMATION CONTAINED IN THIS PRESENTATION IS BASED ON IBM’S CURRENT PRODUCT PLANS AND STRATEGY, WHICH ARE SUBJECT TO CHANGE BY IBM, WITHOUT NOTICE. IBM AND ITS AFFILIATED COMPANIES SHALL NOT BE RESPONSIBLE FOR ANY DAMAGES ARISING OUT OF THE USE OF, OR OTHERWISE RELATED TO, THIS PRESENTATION OR ANY OTHER DOCUMENTATION. NOTHING CONTAINED IN THIS PRESENTATION IS INTENDED TO, OR SHALL HAVE THE EFFECT OF: - CREATING ANY WARRANT OR REPRESENTATION FROM IBM, ITS AFFILIATED COMPANIES OR ITS OR THEIR SUPPLIERS AND/OR LICENSORS 2 Sunday, September 22, 13 © 2013 IBM Corporation
  • 3. About me  Timothy C. Fanelli  Senior IT Specialist, IBM - Mainframe Workload Modernization  Instructor of Software Engineering Clarkson University, Potsdam NY  [email protected][email protected], [email protected]  Visit the IBM booth #5112 and meet other IBM developers at JavaOne 2013 3 Sunday, September 22, 13 © 2013 IBM Corporation
  • 4. Agenda  Background  Three Key Concepts – Implementation – Orchestration – Execution  An Example JSR 352 Application  Advanced Topics – Splits and Flows – Partitioning – Java EE  Conclusion and Thoughts on What’s Next... 4 Sunday, September 22, 13 © 2013 IBM Corporation
  • 5. Background © 2013 IBM Corporation Sunday, September 22, 13
  • 6. Batch Processing  One of the oldest processing paradigms  Historically associated with mainframe computing  Still incredibly relevant today, with fresh challenges in an OLTP driven world 6 Sunday, September 22, 13 © 2013 IBM Corporation
  • 7. Batch Processing  One of the oldest processing paradigms  Historically associated with mainframe computing  Still incredibly relevant today, with fresh challenges in an OLTP driven world 6 Sunday, September 22, 13 © 2013 IBM Corporation
  • 8. Java for Batch Processing?  Mainframe developers have shied away from Java – Performance concerns over native languages – Integration concerns for legacy data – Disparate developer skill set between System Z and Java 7 Sunday, September 22, 13 © 2013 IBM Corporation
  • 9. Java for Batch Processing?  Mainframe developers have shied away from Java – Performance concerns over native languages – Integration concerns for legacy data – Disparate developer skill set between System Z and Java  Java and JavaEE have dominated the Online Transaction Processing world 7 Sunday, September 22, 13 © 2013 IBM Corporation
  • 10. Java for Batch Processing?  Mainframe developers have shied away from Java – Performance concerns over native languages – Integration concerns for legacy data – Disparate developer skill set between System Z and Java  Java and JavaEE have dominated the Online Transaction Processing world  Time to bridge the two worlds together – IBM Java for zOS, IBM WebSphere, and Spring Batch paved new paths – Just-in-Time Compilation, Garbage Collection optimizations proved it out – Adoption is wide-spread! 7 Sunday, September 22, 13 © 2013 IBM Corporation
  • 11. Java for Batch Processing?  Mainframe developers have shied away from Java – Performance concerns over native languages – Integration concerns for legacy data – Disparate developer skill set between System Z and Java  Java and JavaEE have dominated the Online Transaction Processing world  Time to bridge the two worlds together – IBM Java for zOS, IBM WebSphere, and Spring Batch paved new paths – Just-in-Time Compilation, Garbage Collection optimizations proved it out – Adoption is wide-spread!  Only remaining challenge was the lack of a standard – The need for JSR-352 was obvious 7 Sunday, September 22, 13 © 2013 IBM Corporation
  • 12. JSR 352: Batch Applications for the Java Platform  Expert working group formed 29 November 2011 – IBM*, VMWare, RedHat, Oracle, Credit Suisse, Independent participants – Broad range of talent with deep batch experience 8 Sunday, September 22, 13 © 2013 IBM Corporation
  • 13. JSR 352: Batch Applications for the Java Platform  Expert working group formed 29 November 2011 – IBM*, VMWare, RedHat, Oracle, Credit Suisse, Independent participants – Broad range of talent with deep batch experience  Final Release 24 May 2013 8 Sunday, September 22, 13 © 2013 IBM Corporation
  • 14. JSR 352: Batch Applications for the Java Platform  Expert working group formed 29 November 2011 – IBM*, VMWare, RedHat, Oracle, Credit Suisse, Independent participants – Broad range of talent with deep batch experience  Final Release 24 May 2013  Included in Java EE 7! 8 Sunday, September 22, 13 © 2013 IBM Corporation
  • 15. Three Key Concepts... © 2013 IBM Corporation Sunday, September 22, 13
  • 16. Three Key Concepts ...  JSR 352 defines – Implementation: A programming model for implementing the artifacts – Orchestration: A Job Specification Language, which orchestrates the execution of a batch artifacts within a job. – Execution: A runtime environment for executing batch application, according to a defined lifecycle. Orchestrate Implement Execute 10 Sunday, September 22, 13 © 2013 IBM Corporation
  • 17. Three Key Concepts ...  JSR 352 defines – Implementation: A programming model for implementing the artifacts – Orchestration: A Job Specification Language, which orchestrates the execution of a batch artifacts within a job. – Execution: A runtime environment for executing batch application, according to a defined lifecycle. Orchestrate Implement Execute  Note: “key” concepts, not “new” concepts! – Roles and abstractions should be familiar to SOA and JavaEE developers 10 Sunday, September 22, 13 © 2013 IBM Corporation
  • 18. Anatomy of JSR352  Those concepts define the anatomy of JSR 352: Batch Applications for the Java Platform... 11 Sunday, September 22, 13 © 2013 IBM Corporation
  • 19. Anatomy of JSR352  Those concepts define the anatomy of JSR 352: Batch Applications for the Java Platform... Orchestrate Implement Execute 11 Sunday, September 22, 13 © 2013 IBM Corporation
  • 20. Anatomy of JSR352  Those concepts define the anatomy of JSR 352: Batch Applications for the Java Platform... Listeners Contexts Listeners Partitioning Batchlet Orchestrate Implement Job Operator Job Reader Step Processor Chunk Writer Execute Job Repository Chunk Chunk Chunk Listeners 11 Sunday, September 22, 13 © 2013 IBM Corporation
  • 21. Implementation: The programming model  Chunk and Batchlet provide models for implementing a step. Listeners Contexts Listeners Partitioning Batchlet Reader Processor Chunk Writer Chunk Chunk Chunk Listeners 12 Sunday, September 22, 13 © 2013 IBM Corporation
  • 22. Implementation: The programming model  Chunk and Batchlet provide models for implementing a step.  Contexts provide Job- and Step- level runtime information, and provide interim data persistence. Listeners Contexts Listeners Partitioning Batchlet Reader Processor Chunk Writer Chunk Chunk Chunk Listeners 12 Sunday, September 22, 13 © 2013 IBM Corporation
  • 23. Implementation: The programming model  Chunk and Batchlet provide models for implementing a step.  Contexts provide Job- and Step- level runtime information, and provide interim data persistence.  Listeners provide callback hooks to respond to lifecycle events on batch artifacts. Listeners Contexts Listeners Partitioning Batchlet Reader Processor Chunk Writer Chunk Chunk Chunk Listeners 12 Sunday, September 22, 13 © 2013 IBM Corporation
  • 24. Implementation: The programming model  Chunk and Batchlet provide models for implementing a step.  Contexts provide Job- and Step- level runtime information, and provide interim data persistence.  Listeners provide callback hooks to respond to lifecycle events on batch artifacts. Listeners Contexts Listeners Partitioning Batchlet Reader Processor Chunk  Partitioning provides a mechanism imposing parallel processing on jobs and steps Writer Chunk Chunk Chunk Listeners 12 Sunday, September 22, 13 © 2013 IBM Corporation
  • 25. Implementation: The programming model Chunk vs Batchlet Listeners Contexts Listeners Partitioning Batchlet Reader Processor Chunk Writer Chunk Chunk Chunk Listeners 13 Sunday, September 22, 13 © 2013 IBM Corporation
  • 26. Implementation: The programming model Chunk vs Batchlet  Both are implementations of a step within a batch job Listeners Contexts Listeners Partitioning Batchlet Reader Processor Chunk Writer Chunk Chunk Chunk Listeners 13 Sunday, September 22, 13 © 2013 IBM Corporation
  • 27. Implementation: The programming model Chunk vs Batchlet  Both are implementations of a step within a batch job  The chunk model – Encapsulates a very common pattern: ETL – Single “reader”, “processor” and “writer” – Reader/Processor combination is invoked until an entire “chunk” of data is processed – Output “chunk” is written atomically Listeners Contexts Listeners Partitioning Batchlet Reader Processor Chunk Writer Chunk Chunk Chunk Listeners 13 Sunday, September 22, 13 © 2013 IBM Corporation
  • 28. Implementation: The programming model Chunk vs Batchlet  Both are implementations of a step within a batch job  The chunk model – Encapsulates a very common pattern: ETL – Single “reader”, “processor” and “writer” – Reader/Processor combination is invoked until an entire “chunk” of data is processed – Output “chunk” is written atomically  Batchlet provides a “roll your own” step type – Invoked and runs to completion, producing a return code upon exit. 13 Sunday, September 22, 13 Listeners Contexts Listeners Partitioning Batchlet Reader Processor Chunk Writer Chunk Chunk Chunk Listeners © 2013 IBM Corporation
  • 29. Orchestration: The Job Specification Language (JSL)  The JSL defines a batch job as an XML document Job 14 Sunday, September 22, 13 Step © 2013 IBM Corporation
  • 30. Orchestration: The Job Specification Language (JSL)  The JSL defines a batch job as an XML document  Describes a step as an assemblage of batch artifacts 14 Sunday, September 22, 13 Job Step © 2013 IBM Corporation
  • 31. Orchestration: The Job Specification Language (JSL)  The JSL defines a batch job as an XML document  Describes a step as an assemblage of batch artifacts Job Step  Provides for the description of steps, step groupings, and execution sequencing 14 Sunday, September 22, 13 © 2013 IBM Corporation
  • 32. Execution: The JobOperator and Repository  JobOperator is the runtime interface for job management, including start, stop, restart and job repository related commands Job Operator Job Repository 15 Sunday, September 22, 13 © 2013 IBM Corporation
  • 33. Execution: The JobOperator and Repository  JobOperator is the runtime interface for job management, including start, stop, restart and job repository related commands  The Job Repository holds information about completed and executing jobs 15 Sunday, September 22, 13 Job Operator Job Repository © 2013 IBM Corporation
  • 34. Execution: The JobOperator and Repository  JobOperator is the runtime interface for job management, including start, stop, restart and job repository related commands  The Job Repository holds information about completed and executing jobs Job Operator Job Repository  To start a batch job, get a JobOperator instance use it to start a job described (described by JSL). 15 Sunday, September 22, 13 © 2013 IBM Corporation
  • 35. Execution: JobInstance, JobExecution, and StepExecution Job Step * JobInstance * JobExecution * * StepExecution Job Operator 16 Sunday, September 22, 13 Job Repository © 2013 IBM Corporation
  • 36. Execution: JobInstance, JobExecution, and StepExecution  The state of a job is broken down into various parts, and persisted in the repository – Submitting a job creates a JobInstance, a logical representation of a particular “run” of a job. – A JobExecution is a single attempt to run a JobInstance. A restart attempt creates another JobExecution – Similarly, a StepExecution is a single attempt to run a step within a job. It is created when a step starts execution. Job Operator 16 Sunday, September 22, 13 Job Step * JobInstance * JobExecution * * StepExecution Job Repository © 2013 IBM Corporation
  • 37. An Example JSR 352 Application © 2013 IBM Corporation Sunday, September 22, 13
  • 38. The Application 18 Sunday, September 22, 13 © 2013 IBM Corporation
  • 39. The Application ‣ A typical “batch hello world”: 18 Sunday, September 22, 13 © 2013 IBM Corporation
  • 40. The Application ‣ A typical “batch hello world”: – Reads strings from an input file 18 Sunday, September 22, 13 © 2013 IBM Corporation
  • 41. The Application ‣ A typical “batch hello world”: – Reads strings from an input file – Performs some validation or transforms 18 Sunday, September 22, 13 © 2013 IBM Corporation
  • 42. The Application ‣ A typical “batch hello world”: – Reads strings from an input file – Performs some validation or transforms – Writes validated or transformed string to an output file 18 Sunday, September 22, 13 © 2013 IBM Corporation
  • 43. The Application ‣ A typical “batch hello world”: – Reads strings from an input file – Performs some validation or transforms – Writes validated or transformed string to an output file ‣ Key capabilities 18 Sunday, September 22, 13 © 2013 IBM Corporation
  • 44. The Application ‣ A typical “batch hello world”: – Reads strings from an input file – Performs some validation or transforms – Writes validated or transformed string to an output file ‣ Key capabilities – If something goes wrong, we don’t want to discard all the prior work; and we want to pick up where we left off 18 Sunday, September 22, 13 © 2013 IBM Corporation
  • 45. The Application ‣ A typical “batch hello world”: – Reads strings from an input file – Performs some validation or transforms – Writes validated or transformed string to an output file ‣ Key capabilities – If something goes wrong, we don’t want to discard all the prior work; and we want to pick up where we left off – We want control over the transaction scoping so prevent lock contention in high volume periods 18 Sunday, September 22, 13 © 2013 IBM Corporation
  • 46. The Application ‣ A typical “batch hello world”: – Reads strings from an input file – Performs some validation or transforms – Writes validated or transformed string to an output file ‣ Key capabilities – If something goes wrong, we don’t want to discard all the prior work; and we want to pick up where we left off – We want control over the transaction scoping so prevent lock contention in high volume periods – We want flexibility to “plug and play” where our records come from 18 Sunday, September 22, 13 © 2013 IBM Corporation
  • 47. The Application ‣ A typical “batch hello world”: – Reads strings from an input file – Performs some validation or transforms – Writes validated or transformed string to an output file ‣ Key capabilities – If something goes wrong, we don’t want to discard all the prior work; and we want to pick up where we left off – We want control over the transaction scoping so prevent lock contention in high volume periods – We want flexibility to “plug and play” where our records come from – For unit testing, development testing, and QA testing records may come from a variety of sources 18 Sunday, September 22, 13 © 2013 IBM Corporation
  • 48. The Design ‣ Let’s implement a string-transform in an extract-transform-load pattern 19 Sunday, September 22, 13 © 2013 IBM Corporation
  • 49. The Design ‣ Let’s implement a string-transform in an extract-transform-load pattern ‣ We’ll use JSR352’s Chunk programming model 19 Sunday, September 22, 13 © 2013 IBM Corporation
  • 50. The Design ‣ Let’s implement a string-transform in an extract-transform-load pattern ‣ We’ll use JSR352’s Chunk programming model – Encapsulates the ETL pattern components as Reader, Processor, and Writer interfaces 19 Sunday, September 22, 13 © 2013 IBM Corporation
  • 51. The Design ‣ Let’s implement a string-transform in an extract-transform-load pattern ‣ We’ll use JSR352’s Chunk programming model – Encapsulates the ETL pattern components as Reader, Processor, and Writer interfaces – Loosely coupled artifacts will be orchestrated into a single-step job later 19 Sunday, September 22, 13 © 2013 IBM Corporation
  • 52. The Design ‣ Let’s implement a string-transform in an extract-transform-load pattern ‣ We’ll use JSR352’s Chunk programming model – Encapsulates the ETL pattern components as Reader, Processor, and Writer interfaces – Loosely coupled artifacts will be orchestrated into a single-step job later – “Free” checkpoint/restart capability 19 Sunday, September 22, 13 © 2013 IBM Corporation
  • 53. The Design ‣ Let’s implement a string-transform in an extract-transform-load pattern ‣ We’ll use JSR352’s Chunk programming model – Encapsulates the ETL pattern components as Reader, Processor, and Writer interfaces – Loosely coupled artifacts will be orchestrated into a single-step job later – “Free” checkpoint/restart capability – Transaction scoping imposed externally in the job descriptor 19 Sunday, September 22, 13 © 2013 IBM Corporation
  • 54. The Design ‣ Let’s implement a string-transform in an extract-transform-load pattern ‣ We’ll use JSR352’s Chunk programming model – Encapsulates the ETL pattern components as Reader, Processor, and Writer interfaces – Loosely coupled artifacts will be orchestrated into a single-step job later – “Free” checkpoint/restart capability – Transaction scoping imposed externally in the job descriptor ‣ Job will be executed as a Java SE command line batch application 19 Sunday, September 22, 13 © 2013 IBM Corporation
  • 55. The Code Implement 20 Sunday, September 22, 13 © 2013 IBM Corporation
  • 56. The Code Implement ‣ An ItemReader encapsulates the data access and deserialization of a record. 20 Sunday, September 22, 13 © 2013 IBM Corporation
  • 57. The Code Implement ‣ An ItemReader encapsulates the data access and deserialization of a record. ‣ No restriction on data access paradigm: use DAO patterns, JDBC, JPA, Hibernate, Spring Data, etc! 20 Sunday, September 22, 13 © 2013 IBM Corporation
  • 58. The Code Implement ‣ An ItemReader encapsulates the data access and deserialization of a record. ‣ No restriction on data access paradigm: use DAO patterns, JDBC, JPA, Hibernate, Spring Data, etc! ‣ Checkpoint/Restart data provided as Serializable argument to “open” and from “checkpointInfo” methods. 20 Sunday, September 22, 13 © 2013 IBM Corporation
  • 59. The Code Implement 21 Sunday, September 22, 13 © 2013 IBM Corporation
  • 60. The Code Implement  An ItemWriter is the output counterpart to ItemReader 21 Sunday, September 22, 13 © 2013 IBM Corporation
  • 61. The Code Implement  An ItemWriter is the output counterpart to ItemReader  Primary difference is that writeItems accepts a “chunk” of output objects (as a list) to serialize. 21 Sunday, September 22, 13 © 2013 IBM Corporation
  • 62. The Code Implement  An ItemWriter is the output counterpart to ItemReader  Primary difference is that writeItems accepts a “chunk” of output objects (as a list) to serialize.  Again, no restriction on data access paradigm! 21 Sunday, September 22, 13 © 2013 IBM Corporation
  • 63. The Code Implement 22 Sunday, September 22, 13 © 2013 IBM Corporation
  • 64. The Code Implement  An ItemProcessor encapsulates the business logic applied to each record 22 Sunday, September 22, 13 © 2013 IBM Corporation
  • 65. The Code Implement  An ItemProcessor encapsulates the business logic applied to each record  “main” here demonstrates the invocation of a batch job, using the JobOperator 22 Sunday, September 22, 13 © 2013 IBM Corporation
  • 66. The Code Implement  An ItemProcessor encapsulates the business logic applied to each record  “main” here demonstrates the invocation of a batch job, using the JobOperator  Would typically not be in the processor implementation 22 Sunday, September 22, 13 © 2013 IBM Corporation
  • 67. The Batch Descriptor and Job Specification Orchestrate 23 Sunday, September 22, 13 © 2013 IBM Corporation
  • 68. The Batch Descriptor and Job Specification Orchestrate  batch.xml defines and names the batch artifacts in this application archive 23 Sunday, September 22, 13 © 2013 IBM Corporation
  • 69. The Batch Descriptor and Job Specification Orchestrate  batch.xml defines and names the batch artifacts in this application archive  sample.xml is an example Job Specification Language document for SampleBatchApp 23 Sunday, September 22, 13 © 2013 IBM Corporation
  • 70. The Execution Execute 24 Sunday, September 22, 13 © 2013 IBM Corporation
  • 71. The Execution Execute  Package the application as a standard JAR or WAR for deployment in JavaSE or EE environments – batch.xml goes in META-INF or WEB-INF/ classes/META-INF – JSL may go in META-INF/batch-jobs, or submitted from an external source (up to the provider!) 24 Sunday, September 22, 13 © 2013 IBM Corporation
  • 72. The Execution Execute 25 Sunday, September 22, 13 © 2013 IBM Corporation
  • 73. The Execution  See it live? 25 Sunday, September 22, 13 Execute © 2013 IBM Corporation
  • 74. Advanced Topics © 2013 IBM Corporation Sunday, September 22, 13
  • 75. Job Management - Restart, Stop, Abandon Execute 27 Sunday, September 22, 13 © 2013 IBM Corporation
  • 76. Job Management - Restart, Stop, Abandon  Had something gone wrong, what then? – The “main” program shown was too simple... only “started” the job 27 Sunday, September 22, 13 Execute © 2013 IBM Corporation
  • 77. Job Management - Restart, Stop, Abandon  Had something gone wrong, what then? – The “main” program shown was too simple... only “started” the job Execute  JobOperator exposes APIs for a variety of job management tasks: start, stop, abandon, restart – Would have had to take advantage of these for advanced job management capabilities. 27 Sunday, September 22, 13 © 2013 IBM Corporation
  • 78. Job Management - Restart, Stop, Abandon  Had something gone wrong, what then? – The “main” program shown was too simple... only “started” the job Execute  JobOperator exposes APIs for a variety of job management tasks: start, stop, abandon, restart – Would have had to take advantage of these for advanced job management capabilities.  The door is left open for more advanced batch job management systems to be built! – Integration into existing enterprise schedulers? – New Java EE batch scheduling standard? – Plenty of options, but currently left to the provider to implement 27 Sunday, September 22, 13 © 2013 IBM Corporation
  • 79. Java EE Integration Execute 28 Sunday, September 22, 13 © 2013 IBM Corporation
  • 80. Java EE Integration  JSR-352: Java Batch is included in Java EE 7 28 Sunday, September 22, 13 Execute © 2013 IBM Corporation
  • 81. Java EE Integration  JSR-352: Java Batch is included in Java EE 7 Execute  Provides EE clustering, security, resource management, etc to Java Batch applications 28 Sunday, September 22, 13 © 2013 IBM Corporation
  • 82. Java EE Integration  JSR-352: Java Batch is included in Java EE 7 Execute  Provides EE clustering, security, resource management, etc to Java Batch applications  Performance benefits to dispatching into longrunning, reusable container – JIT compilation through the first couple runs – Eliminates overhead of starting / stopping JVM 28 Sunday, September 22, 13 © 2013 IBM Corporation
  • 83. Parallel Job Processing Orchestrate 29 Sunday, September 22, 13 © 2013 IBM Corporation
  • 84. Parallel Job Processing  Splits and Flows provide a mechanism for executing job steps concurrently at the orchestration layer 29 Sunday, September 22, 13 Orchestrate © 2013 IBM Corporation
  • 85. Parallel Job Processing  Splits and Flows provide a mechanism for executing job steps concurrently at the orchestration layer Orchestrate  A flow is a sequence of one or more steps which execute sequentially, but as a single unit. 29 Sunday, September 22, 13 © 2013 IBM Corporation
  • 86. Parallel Job Processing  Splits and Flows provide a mechanism for executing job steps concurrently at the orchestration layer Orchestrate  A flow is a sequence of one or more steps which execute sequentially, but as a single unit.  A Split is a collection of flows that may execute concurrently – A split may only contain “flows”; a step is not implicitly a flow 29 Sunday, September 22, 13 © 2013 IBM Corporation
  • 87. Parallel Job Processing  Splits and Flows provide a mechanism for executing job steps concurrently at the orchestration layer Orchestrate  A flow is a sequence of one or more steps which execute sequentially, but as a single unit.  A Split is a collection of flows that may execute concurrently – A split may only contain “flows”; a step is not implicitly a flow  This is done entirely in the JSL descriptor – Imposed on the batch application with no code changes! 29 Sunday, September 22, 13 © 2013 IBM Corporation
  • 88. Parallel Job Processing Implement 30 Sunday, September 22, 13 © 2013 IBM Corporation
  • 89. Parallel Job Processing  Step-level parallelism can be achieved programmatically using step partitioning 30 Sunday, September 22, 13 Implement © 2013 IBM Corporation
  • 90. Parallel Job Processing  Step-level parallelism can be achieved programmatically using step partitioning Implement  A partitioned step runs as multiple instances with distinct property sets 30 Sunday, September 22, 13 © 2013 IBM Corporation
  • 91. Parallel Job Processing  Step-level parallelism can be achieved programmatically using step partitioning Implement  A partitioned step runs as multiple instances with distinct property sets  PartitionMapper defines the number of partitions, and property values for each partition – Can be a fixed set of partitions in JSL – Can be dynamic using a PartitionMapper implementation 30 Sunday, September 22, 13 © 2013 IBM Corporation
  • 92. Parallel Job Processing 31 Sunday, September 22, 13 © 2013 IBM Corporation
  • 93. Wrap up... © 2013 IBM Corporation Sunday, September 22, 13
  • 94. Batch Processing  The oldest “new thing” in Java  JSR 352 applies the modern thinking and abstraction of Java EE and SOA and applies it to sequential batch processing  The standardized programming model provides application developers vendor portability  Inclusion in Java EE 7 ensures wide spread availability 33 Sunday, September 22, 13 © 2013 IBM Corporation
  • 95. Questions? Find this presentation and more! http://ibm.co/JavaOne2013 © 2013 IBM Corporation Sunday, September 22, 13