Systems Literature Review: 2012

Monday, October 01, 2012

Useful Oracle Queries

The following query shows the last sql text from an inactive session

select sess.sid,
       sess.serial#,
       sess.username,
       sql_text
 from v$sqlarea sqlarea, v$session sess
 where sess.prev_hash_value = sqlarea.hash_value
   and sess.prev_sql_addr  = sqlarea.address
   and sess.username is not null
   and sess.status='INACTIVE';

The following query shows which session (id) is blocking which session (id)


select blocking_session, sid, serial#, wait_class, seconds_in_wait
from v$session where blocking_session is not NULL order by blocking_session;
And this is a very relevant post to the topic

Friday, September 21, 2012

Icebergs in the Clouds: The Other Risks of Cloud Computing

This is an interesting series of problems by Bryan Ford from Yale university.

In the paper he counts some of the interesting problems with Cloud Computing in general:

He summarized some of the issues as follows:

Side channels
Stability Risks from Interactive Services
Risks of hidden failure correlation (cross-layer robustness)
The always-connected assumption
Digital Preservation Risks

The assumption is making the cloud more reliable.

Here is the link to the paper: https://www.usenix.org/system/files/conference/hotcloud12/hotcloud12-final54.pdf

The seven deadly sins of cloud computing research

The paper is presented in HotCloud2012 by

Steven Hand and Derek Murray as co-authors and Malte Schwarzkopf as the main author who is a PhD student at Cambridge.

The paper is on seven mistakes that researchers commit when dealing with Cloud and data processing in the cloud. The list is the following

Unnecessary distributed parallelism
Assuming performance homogeneity
Picking the low hanging fruit
Forcing the abstraction
Unrepresentative workload
Assuming perfect elasticity
Ignoring fault tolerance

what seems to be the case with all seven arguments is that it particularly focuses on parallel algorithms applicable to the cloud, mostly by benefiting from Map / Reduce.

The paper is heavily performance oriented and at least the first 3 points also represented by the author in the talk have performance comparisons. Those are the ones that provided nice graphs for the slides to the author. Some of the points that are interesting are the fact that machines may get overloaded and that the data needs to be more representative. But I think there is nothing very new about the paper and these are the already known facts re-iterated in the paper. Yet it is good that he highlighted some issues with the papers already being published. Maybe others will try to avoid the mistakes mentioned in the paper.

You can tell it is a work at the early years of this guy's PhD.

Link to the paper: https://www.usenix.org/system/files/conference/hotcloud12/hotcloud12-final70.pdf

Wednesday, March 14, 2012

Parallel Symbolic Execution for Automated Real-World Software Testing

http://eurosys2011.cs.uni-salzburg.at/pdf/eurosys2011-bucur.pdf

This paper introduces Cloud9, a platform for automated testing of real-world software. Our main contribution is the scalable parallelization of symbolic execution on clusters of commodity hardware, to help cope with path explosion. Cloud9 provides a systematic interface for writing “symbolic tests” that concisely specify entire families of inputs and behaviors to be tested, thus improving testing productivity. Cloud9 can handle not only single-threaded programs but also multi-threaded and distributed systems. It includes a

new symbolic environment model that is the ﬁrst to support all major aspects of the POSIX interface, such as processes, threads, synchronization, networking, IPC, and ﬁle I/O. We show that Cloud9 can automatically test real systems, like memcached, Apache httpd, lighttpd, the Python interpreter, rsync, and curl. We show how Cloud9 can use existing test suites to generate new test cases that capture untested corner cases (e.g., network stream fragmentation). Cloud9 can also diagnose incomplete bug ﬁxes by analyzing the difference between buggy paths before and after a patch.

Sunday, February 26, 2012

Designing Distributed Database Systems for Efficient Operation

Sangkyu Rho, Salvatore March

http://aisel.aisnet.org/cgi/viewcontent.cgi?article=1065&context=icis1995

Distributed database systems can yield significant cost and performance advantages over centralized systems for geographically distributed organizations. The efficiency o f a distributed database depends primarily on the data allocation (data replication and placement) and the operating strategies (where and h ow retrieval and update query processing operations are performed). W e develop a distributed database design approach that comprehensively treats data allocation and operating strategies, explicitly modeling their interdependencies for both retrieval and update processing. W e demonstrate that data replication, join node selection, and data reduction b y semijoin are important design and operating decisions that have significant impact on both the cost and response time of a distributed database system.

Tuesday, February 21, 2012

On the impact of network latency on distributed systems design

link: http://www.springerlink.com/content/v8x37536u2343u86/fulltext.pdf

Research in distributed database systems to date has assumed a “variable cost” model of network response time. However, network response time has two components: transmission time (variable with message size) and latency (ﬁxed). This research improves on existing models by incorporating a “ﬁxed plus variable cost” model of the network response time. In this research, we: (1) develop a distributed database design approach that incorporates a “ﬁxed plus variable cost”, network response time function; (2) run a set of experiments to create designs using this model, and (3) evaluate the impact the new model had on the design in various types of networks.

This is a followup paper by the same author:

Modeling Network Latency and Parallel Processing in Distributed Database Design

http://onlinelibrary.wiley.com/doi/10.1111/j.1540-5414.2003.02409.x/pdf

Sunday, February 19, 2012

Thialﬁ: A Client Notiﬁcation Service for Internet-Scale Applications

http://sigops.org/sosp/sosp11/current/2011-Cascais/printable/10-adya.pdf

Abstract. Ensuring the freshness of client data is a fundamental problem for
applications that rely on cloud infrastructure to store data and mediate sharing. Thialﬁ is a notiﬁcation service developed at Google to simplify this task. Thialﬁ supports applications written in multiple programming languages and running on multiple platforms, e.g., browsers, phones, and desktops. Applications register their interest in a set of shared objects and receive notiﬁcations when those objects change. Thialﬁ servers run in multiple Google data centers for availability and replicate their state asynchronously. Thialﬁ’s approach to recovery emphasizes simplicity: all server state is soft, and clients drive recovery and assist in replication. A principal goal of our design is to provide a straightforward API and good semantics despite a variety of failures, including server crashes, communication failures, storage unavailability, and data center failures.

Some notes: Version updates come from the application server and as a result the application server should maintain the new version number for the objects it shares with the clients. This implies that Thialfi imposes a requirement for versioning objects whereas in many cases versioning may not be even needed.

They consider only 10% of the clients to be online most of the time and measurements are taken for cases where the number of clients is not really high, an increase in the number of clients would change the metrics (drastically?).

Friday, February 17, 2012

Some interesting graph cut papers

Notes on graph cuts with submodular edge weights
http://users.cms.caltech.edu/~krausea/discml/papers/jegelka09subcuts.pdf

Optimization on Graphs with Variable
http://www.springerlink.com/content/g457x624814gm812/fulltext.pdf

Labelings of Graphs with Fixed and Variable Edge-Weights
http://epubs.siam.org/sidma/resource/1/sjdmec/v21/i3/p688_s1

Dynamic Graph Cuts for Efﬁcient Inference in Markov Random Fields
http://research.microsoft.com/en-us/um/people/pkohli/papers/pami07.pdf

Some thoughts on Replicability

Replicability

Horizontal Partitioning is where cloud is more significantly helpful so if you manage to determine stateless components or come up with recommendations for stateless components, it is easier to identify whether or not we can do any horizontal partitioning without significant effects and modifications to the application.
Data is an issue, so let's collocated code and data

Graph Partitioning with Natural Cuts

link: http://research.microsoft.com/pubs/142349/punchtr.pdf

Daniel Delling, Andrew GOldberg, Ilya Razenshteyn, Renato F. Weneck

Abstract. We present a novel approach to graph partitioning based on the notion of natural

cuts. Our algorithm, called PUNCH, has two phases. The rst phase performs a

series of minimum-cut computations to identify and contract dense regions of the

graph. This reduces the graph size signi cantly, but preserves its general structure.

The second phase uses a combination of greedy and local search heuristics to assemble

the nal partition. The algorithm performs especially well on road networks, which

have an abundance of natural cuts (such as bridges, mountain passes, and ferries).

In a few minutes, it obtains the best known partitions for continental-sized networks,

signi cantly improving on previous results.

Thursday, February 16, 2012

A New Approach to the Minimum Cut Problem

http://www.columbia.edu/~cs2035/courses/ieor6614.S09/Contrac

tion.pdf

Friday, February 10, 2012

An Evaluation of Alternative Architectures for Transaction Processing in the Cloud

Donald Kossmann, Tim Kraska, Simon Loesing SIGMOD2010

http://www.cs.berkeley.edu/~kraska/pub/sigmod10-cloudbench.pdf

Cloud computing promises a number of advantages for the deployment
of data-intensive applications. One important promise
is reduced cost with a pay-as-you-go business model. Another
promise is (virtually) unlimited throughput by adding servers if
the workload increases. This paper lists alternative architectures
to effect cloud computing for database applications and reports on
the results of a comprehensive evaluation of existing commercial
cloud services that have adopted these architectures. The focus of
this work is on transaction processing (i.e., read and update workloads),
rather than analytics or OLAP workloads, which have recently
gained a great deal of attention. The results are surprising
in several ways. Most importantly, it seems that all major vendors
have adopted a different architecture for their cloud services. As a
result, the cost and performance of the services vary significantly
depending on the workload.

Wednesday, February 08, 2012

To Move or Not to Move: the economics of Cloud Computing

To Move or Not to Move: The Economics of Cloud Computing
Byung Chul Tak, Bhuvan Urgaonkar, and Anand Sivasubramaniam, The Pennsylvania State University

HotCloud 2011

http://www.usenix.org/events/hotcloud11/tech/final_files/Tak.pdf

Cloud-based hosting promises cost advantages over conventional in-house (on-premise) application deployment. One important question when considering a move to the cloud is whether it makes sense for 'my' application to migrate to the cloud. This question is challenging to answer due to following reasons. Although many potential benefits of migrating to the cloud can be enumerated, some benefits may not apply to 'my' application. Also, there can be multiple ways in which an application might make use of the facilities offered by cloud providers. Answering these questions requires an in-depth understanding of the cost implications of all the possible choices specific to 'my' circumstances. In this study We identify an initial set of key factors affecting the costs of a deployement choice. Using benchmarks representing two different applications (TPC-W and TPC-E) we investigate the evolution of costs for different deployment choices. We show that application characteristics such as workload intensity, growth rate, storage capacity and software licensing costs produce complex combined effect on overall costs. We also discuss issues regarding workload variance and horizontal partitioning.

Tuesday, January 31, 2012

Applying graph partitioning methods in measurement-based dynamic load balancing [PPL Technical Report 2012]

Applying graph partitioning methods in measurement-based dynamic load balancing

[PPL Technical Report 2012]

Here is the link to the paper: http://charm.cs.illinois.edu/newPapers/12-03/paper.pdf

As noted it is a technical report by people from University of illinois.

[SUMMARY]

Systems Literature Review