concurrent kernel execution using stream

How to launch different kernels concurrently by using stream.Please guide me as I am new to cuda .
Thanks in advance

study:

  1. The Asynchronous concurrent execution section of the programming guide:

[url]Programming Guide :: CUDA Toolkit Documentation

  1. The concurrent kernels cuda sample code:

[url]http://docs.nvidia.com/cuda/cuda-samples/index.html#concurrent-kernels[/url]