All Products
Search
Document Center

Realtime Compute for Apache Flink:What is Alibaba Cloud Realtime Compute for Apache Flink?

Last Updated:Jun 20, 2025

Alibaba Cloud Realtime Compute for Apache Flink is an end-to-end real-time big data analytics platform built on Apache Flink. It processes data with sub-second response times. The platform simplifies business development by using standard SQL statements to help enterprises transform their operations into real-time and intelligent big data computing.

Overview

Alibaba Cloud Realtime Compute for Apache Flink is a fully managed serverless service built on Apache Flink. It supports multiple billing methods and requires no setup. The platform provides an end-to-end development, operation and maintenance, and management environment. It delivers powerful capabilities for entire project lifecycles, including draft development, data debugging, operation and monitoring, Autopilot, and intelligent diagnostics. Alibaba Cloud Realtime Compute for Apache Flink is fully compatible with Apache Flink. It features an enhanced enterprise-class Flink engine that delivers twice the performance of Apache Flink, allowing you to seamlessly migrate your tasks to the cloud. The service provides enterprise-class value-added features, such as Flink Change Data Capture (CDC) and complex event processing (CEP), along with various built-in upstream and downstream connectors to help enterprises build efficient, stable, and powerful real-time data applications.

Comparison with open source Apache Flink

Compared with open source Flink, Realtime Compute for Apache Flink offers significant advantages in functional extensions, performance optimization, and enterprise application support.

Type

Open source Flink

Realtime Compute for Apache Flink

Cloud advantages

Performance and cost

  • No built-in elastic scaling capability

  • Resource utilization depends on manual tuning

  • Through SQL operator optimization and self-developed state storage kernel Gemini, performance reaches twice that of open source Flink in Nexmark stream computing tests.

  • Supports Autopilot, solving various performance issues such as insufficient job throughput, backpressure in the entire pipeline, and resource waste. It automatically monitors and adjusts job resource allocation without manual intervention.

  • Supports fine-grained resource configuration at the operator level (CPU/Mem), improving resource utilization by 100% for large-scale jobs.

  • More flexible payment models, supporting Subscription, Pay-as-you-go or Hybrid billing, with CU-level billing.

Flexible payment models and better performance help enterprises reduce costs. CU-level intelligent scaling improves resource utilization

Compatibility and integration capabilities

  • Native Flink SQL and DataStream API

  • Requires manual integration of components such as MySQL, Kafka, Paimon, with frequent version updates potentially causing compatibility issues

  • Fully compatible with mainstream Flink APIs (SQL, Java, Python, YAML).

  • Provides 30+ ready-to-use connectors (such as MySQL, Kafka, Hologres, Paimon), supporting various storage types including databases, message middleware, data warehouses, lake formats, and file systems.

  • Allows registration of custom connectors to interface with external systems as needed.

  • Supports multiple Flink versions coexisting, enabling seamless migration and upgrades.

Lowers entry barriers, improves ecosystem integration efficiency, and ensures smooth business migration

Development efficiency and debugging experience

  • Lacks an end-to-end development management platform

  • Limited debugging tools

  • Supports whole database synchronization, database and table sharding synchronization, table schema change synchronization, and integrated full read from replica and incremental read from master (CDAS/CTAS).

  • Multi-version job management (code comparison and rollback), supporting integration with remote Git repositories (such as GitHub, GitLab, or Gitee).

  • Built-in native Flink functions, supporting custom function management and use; provides more than 20 Flink SQL templates for common scenarios.

  • Supports test data management, quick run debugging, intermediate result display, development-production isolation, and integrates with VS Code local development tool.

Reduces development difficulty, decreases debugging and testing costs, improves job deployment speed and quality

Operations and management capabilities

  • No built-in comprehensive monitoring and alerting system

  • Lacks graphical operation interface

  • Manual scaling, complex resource scheduling

Significantly reduces O&M costs and optimization difficulty. Fine-grained resource management substantially reduces costs. Improves job observability and response efficiency

Stability and reliability

  • Cluster deployment has regional limitations

  • Fault tolerance mechanism depends on user configuration

  • Same-city high availability architecture, multiple regions available, ensuring business stability.

  • Full-chain automatic fault tolerance capability, supporting JobManager high availability, system without single points of failure, more stable.

  • Supports system checkpoint and job snapshot lifecycle management, with state compatibility checking and data migration capabilities, maximizing reuse of existing state data.

Ensures stable operation of large-scale jobs, meeting enterprise-level production environment requirements

Enterprise-level service capabilities

  • Relies on community documentation, forums, and other unofficial support

  • No dedicated technical support team

  • Provides 24/7 professional technical support, backed by in-house employee maintenance services, with 99.9% SLA guarantee.

  • Rapid response mechanism, supporting customized requirements.

  • Continuous update and maintenance plan, providing long-term version support (LTS).

Obtains professional, trustworthy technical assurance, accelerating problem resolution and business deployment

Security and access control

  • Basic authentication mechanisms (such as Kerberos)

  • Permission control requires integration with external systems

  • Interoperable with Alibaba Cloud account system, supporting fine-grained RBAC (Role-Based Access Control).

  • Supports tenant-level and project-level resource and code isolation, meeting cross-team collaboration requirements.

  • Implements secure key management through variable management, reducing security risks from plaintext key exposure.

  • Comprehensive operation audit records all production changes.

Builds unified identity authentication system, ensuring data asset security and compliance

Extensibility and multi-ecosystem integration

  • Can extend functionality through plugin mechanisms

  • Ecosystem integration depends on community or developer maintenance

  • Supports emerging scenarios such as AI and intelligent data analytics.

  • Deep integration with data lakes (Iceberg, Hudi), data warehouses (ClickHouse, Hologres, MaxCompute).

  • Provides SDK and OpenAPI support for secondary development.

Creates a flexible, extensible unified real-time platform supporting diverse business scenarios

Billing

Realtime Compute for Apache Flink provides two billable items: management resources and computing resources. It supports the following billing methods:

  • Subscription: This billing method allows you to purchase resources for a specified period of time before you can use them.

  • Pay-as-you-go: This billing method allows you to use and release resources on demand. You are charged based on the number of resources that you use.

  • Hybrid billing: Configure elastic computing resources based on the subscription model, with elastic resources billed in pay-as-you-go mode.

For more information about billing rules, see Billing.

Usage

You can log on to the Realtime Compute for Apache Flink console to use this service.

Related background knowledge