清华大学学报(自然科学版)  2022, Vol. 62 Issue (12): 1851-1863    DOI: 10.16511/j.cnki.qhdxxb.2022.21.024
李维1,2, 李城龙3, 杨家海3
1. 清华大学 信息化技术中心, 北京 100084;
2. 中国地质大学(北京), 北京 100083;
3. 清华大学 网络科学与网络空间研究院, 北京 100084
As-Stream: An intelligent operator parallelization strategy for fluctuating data streams
LI Wei1,2, LI Chenglong3, YANG Jiahai3
1. Information Technology Center, Tsinghua University, Beijing 100084, China;
2. China University of Geosciences, Beijing 100083, China;
3. Institute for Network Science and Cyberspace, Tsinghua University, Beijing 100084, China
摘要 大量研究提出了从在线资源管理层面来优化波动数据流的方法, 却忽略了从流应用层面来优化算子并行度。例如, 在Apache Storm中, 算子并行度一旦设置就无法进行动态调整。该文提出了一种针对波动数据流的算子智能并行化策略As-Stream, 显著提升了流计算平台的性能。该方法在弹性智能监控模块中基于无监督学习和自适应分析对参数进行实时调优。As-Stream包括并行瓶颈识别、参数计划生成、参数迁移转换和参数迁移调度算法。该系统在Apache Storm平台上实现, 并在真实的分布式流计算环境中进行了大量测试。结果表明, As-Stream性能比现有通用调度策略有显著提升:当资源充足时, 平均吞吐量提高了2.4倍; 当资源受限时, 平均延迟减小了44%。
关键词 流计算机器学习算子并行度资源分配    
Abstract:A large number of studies have presented methods using online resource management to optimize stream computing for fluctuating data streams, but have not optimized the parallel operator operations at the streaming application level. For example, in Apache Storm, the operator parallelism cannot be dynamically adjusted once it is set. This paper presents an intelligent parallelization strategy for operators with fluctuating data streams, As-Stream, which significantly improves the streaming computing platform performance. This method uses real-time tuning of parameters based on unsupervised learning and self-adaptive analyses in an elastic intelligent monitoring module. As-Stream includes parallel bottleneck identification, parameter plan generation, parameter migration conversion and parameter migration scheduling algorithms. The system was implemented on an Apache Storm platform with a large number of tests in a real distributed stream computing environment. The results show that this system significantly improves the performance compared with existing default scheduling strategies. With sufficient resources, the average throughput is increased 2.4 fold while with limited resources, the average latency is reduced by 44%.
Key wordsstream computing    machine learning    operator parallelism    resource allocation
收稿日期: 2021-12-30      出版日期: 2022-11-10
基金资助:李城龙, 副研究员,
李维, 李城龙, 杨家海. As-Stream:一种针对波动数据流的算子智能并行化策略[J]. 清华大学学报(自然科学版), 2022, 62(12): 1851-1863.
LI Wei, LI Chenglong, YANG Jiahai. As-Stream: An intelligent operator parallelization strategy for fluctuating data streams. Journal of Tsinghua University(Science and Technology), 2022, 62(12): 1851-1863.
