Reducing the traffic bottleneck in cloud-based data management

Student thesis: Master Thesis and HD Thesis

  • Alexandros Fragkopoulos
In the era of networking and computing power one can find many solutions as to how to get connected to the rest of the world and provide one’s services and products. Nowadays, cloud computing is one of the leading approaches on how one can own computing and storage resources without actually built them. But things are not always that simple. Considerations that businesses have also their own (maybe of lower performance and capacity capabilities) infrastructures and don’t want to waste them, or that the cloud is considered yet insecure or even that they don’t trust the availability time of the cloud infrastructures, are some reasons for a business to think carefully before making a migration to the cloud. This project comes with a mixed solution. It makes a combination of cloud resources together with the utilization of private on-premises resources. The idea is based on the concept that a company needs the additional resources (provided it has any of its own) when the network becomes overloaded. This is the reason why a combined, hybrid scheme would make more sense. This solution though has been already provided by cloud bursting which is a kind of a hybrid cloud. Another problem that is often faced in cloud computing is the creation of various kinds of bottlenecks. This is not only loss in performance but also loss of money for the customers. Given this kind of problems and having them act as incentives the project’s target moves one step further and proposes a scheme which includes a randomized routing algorithm that will make a different use of the cloud provider’s data centers in the greater area around the company and an SDN on-premises network that will be responsible for the selection (if many) of the applications that have to migrate to the cloud for a limited amount of time (during peak hours of the day for the application) to offload the internal network. The aim of this is to distribute traffic across different data center and decentralize it. This project’s use case is an e-commerce website’s migration. The company makes use of its infrastructure for most of the time, has complete control over its data and applications and is using a pay-per-use charging model for the usage of cloud services. Since the project introduces a randomized routing algorithm it is important to mention that this algorithm tries to address the bottleneck (i.e. storage I/O bottleneck) that is created by having many clients served from one data center (probably because it is located near an industrial zone) whereas, other data centers stay under-utilized. This algorithm will choose randomly a data center that is within the limitations the cloud provider’s data centers and can accommodate the enterprise’s data and instances. Then, the data will be replicated just before the load has to be redirected to the cloud. By using, an on-premises, SDN network there is also a benefit for the cloud provider that one can inform the local SDN controller about the available data centers that can be used in this list for random discovery. Thus, data centers that are overloaded or experiencing malfunctions can be excluded from the process. Last but not least, this proposed framework enables cloud providers to set an additional fee (in BW price) when the chosen data center is close to the enterprise.
SpecialisationTelecom Infrastructure
Publication date6 Jun 2013
Number of pages56
ID: 77024188