中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Reducing Tail Latency of Interactive Multi-tier Workloads in the Cloud

文献类型:会议论文

作者Kejiang Ye; Cheng-Zhong Xu
出版日期2018
会议日期2018
会议地点英国
英文摘要Reducing tail latency becomes increasingly important to improve user-perceived service experience. User-facing latency-sensitive cloud applications typically contain multiple interactive tiers running in different virtual machines (VMs) with complex interaction patterns. Consolidation of those applications is a challenge. In this paper we study the consolidation of multi-tier interactive workloads from a new perspective of user-perceived tail latency. We propose a novel profiling-based consolidation methodology. The objective is to satisfy tail latency while reducing the number of physical machines. We consider two key factors that affecting the tail latency of multi-tier workloads: interference with neighboring VMs and interaction between different tiers. We model the consolidation of multi-tier workloads as an optimization problem with different objectives and constraints. We implement and evaluate the proposed models, as well as comparing with other methods (i.e., without profiling or without considering interaction influence). Experimental results show that the proposed method is able to greatly reduce the tail latency compared with the traditional consolidation method.
语种英语
URL标识查看原文
源URL[http://ir.siat.ac.cn:8080/handle/172644/14122]  
专题深圳先进技术研究院_数字所
推荐引用方式
GB/T 7714
Kejiang Ye,Cheng-Zhong Xu. Reducing Tail Latency of Interactive Multi-tier Workloads in the Cloud[C]. 见:. 英国. 2018.

入库方式: OAI收割

来源:深圳先进技术研究院

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。