Systems Literature Review: 2008

Wednesday, June 18, 2008

A Rendezvous of Content Adaptable Service and Product Line Modeling

Seo Jeong Lee1 and Soo Dong Kim2 -- PROFES 2005

They propose a service decision modeling technique for content adaptable applications

Michael Dertouzos [8] four fundamental forces envisioned in pervasive computing:
1- Natural Interaction
2- Automation
3- Individualized information access
4- Collaboration

Taxonomy of variability can be seen as below

The content adaptable service decision process

Define System Architecture

embrace contextual change
embrace ad hoc composition
recognize sharing as the default

Define the variation points and variants

Context is profile of network, device, user, service
for each of the above profiles we may think of a variation point

Define the dependencies between variation points
Define the dependencies between variants
Define the strategy of negotiation

It depends on the domain, service, and application
The decision value of the strategy should be one_of or in_the_range_of variant values.

Select the adequate algorithm or module

A QoS algorithm or something similar can be used to choose the require set of components and requirements based on the information that are fed to the system by the system designer.

Tuesday, June 17, 2008

Synergy between Software Product Line and Intelligent Mobile Middleware

Weishan Zhang and Klaus Marius Hanse 2007

current mobile middleware is designed based on "one-size-fits-all" paradigm lacking flexibility for optimization, customization, and adaptation.

They use the concepts of Frame-based techniques and its XVCL((XML based Variant Configuration Language) to define and configure points of variability.

[4] seems to be interesting to read in this paper.

They consider two major problems with the current mobile middleware applications:

Monolithic structure: Specialized optimization and customization might be required
Ontology evolution has not been addressed in the current ontology based middleware

They use service oriented architecture to connect different pieces of their services together. This actually imposes performance overhead to the system which may considerably degrade the execution and specification of their system.

Configuration is done as early as possible
Frame based ontology management and aggregation mechanism can run both on J2ME and J2SE
Ontology evolution is more than the management of the ontology itself
Flexible template capabilities for XVCL

They use racerpro as their main means of reasoning over the ontology.

Frame-based Ontology_Java Processing (FOJP)

Bridging the OWL ontologies to Java classes by providing mappings
Management and handling of ontology evolution
Managing the update of agent definition, including the agent belief, goals, actions, and plans

A context ontology is devided into two parts, the parts that change more frequently and the parts that stay more or less the same for a longer period of time. Then XVCL commands are used in a meta-ontology to bridge these concepts and provide an aggregation of all these classes of ontologies.

Ontology evolution involves two phases

meta-ontology development
other meta-artifacts for the mobile middleware including the code components

Monday, June 16, 2008

Supporting Pluggable Configuration Algorithms in PCOM

Marcus Hadnte, Klaus Herrmann, Gregor Shiele, Chrisitan Becker

The authors have defined the initial definition of PCOM in [1]

Devices have component containers that mange the hosted components on the device. The functionalieis are offered as contracts in terms of interfaces. Also it can have resource requirements that a component must meet in order to use a component. For applications there is an application anchor which is possibly the starting component (root) for an application.

configuration algorithms control the chaining of components.

The goals for PCOM are

Resilience failure
Efficiency & minimalization
Simplicity & Extensibility

In the new design the container is broken into parts

the application manager: starts the anchor but it restarts it to the very beginning point whenever needed, which is quite stupid
assembler: implements the functionality of computing valid configurations. Assembler can launch different configuration algorithms depending on the situation.
component container: are actually the providers of components for the other two components in the system.

Appplication Data Srevices: Making Steps Towards an Appliance Computing World

Andrew Huang, Benjamin Ling, John Barton, Armando Fox

The paper introduces to main dilemmas in using devides

They are more complex
There are too many features

The vision of the paper: "An appliance computing world is one in which people move data effortlessly among artifacts to accomplish a variety of tasks"

The paper introduces a set of princtiples and attributes for any ADS system

At1: People move data using concrete syntax. Like "Post the picture to my wall"
P1: Bring devices to the forefront: computers and devices are invisible into the physical infrastructure (Mark Weiser's vision)
A2: Devices are simple, single purpose appliances: This is not true cause the users have shown acceptance of devices with more complex capabilities. For example turning cellphones to cameras is not something being rejected by the users
P2: Keep the number of user controllable features on devices to a minimum: This should be correct as it provides better manipulation and control over the device. It should provide simpler user interfaces as well. It shouldn't be too complicated or anything at the end for the user to be used.
A3: People perform a variety of traditional tasks, as well as a new set of advanced tasks with their devices. The functionality to perform highlevel tasks can be placed on users' PCs but be kept hidden from the user.
P3: Place the software required to accomplish tasks in the network infrastructure

Their implementation of the ADS system sends request as tuples (userid, command-tag, data) with userid and command-tag used for the following purposes:

Application Selection
Access Control
Other service features

They have three parts to the architecture

Data Receive Stage

Role: Deals with device heterogenity
It handles all the device connection requirements but is very poor for scalability. It becomes a single point of failure for the system as well.
It relies on a stateless Access Point (What is stateless I don't really know) amd am aggregator enables extensibility of the Access point by adding new device features
Aggregator is actually the point of conflict as at that point all the integration between all the access points and the required input data for the application control phase happens.

Application Control Stage

The data is collected to create a chain of components that satisfy the application. It is not clear how this set of data is monitored to satisfy the requirement of the applications and components and how others should be aware of these requirements when developing components.
Command Canonicalizer

Allows having simple user interfaces

Template Database

Minimizing device configuration

Dataflow Manager

Coordinates data input bu the user: How this required data is specified?

Service Execution

Sunday, June 08, 2008

A Reflective Framework for Discovery and Interaction in Heterogeneous Mobile Environments

Grace, P., Blair, G.S., Samuel, S.: A reflective framework for discovery and interaction in heterogeneous mobile environments. SIGMOBILE Mob. Comput. Commun. Rev. 9 (2005) 2-14.

a component is “a unit of composition with contractually specified interfaces, which can be
deployed independently and is subject to third party creation” [14].

Three layers

concrete middleware section

binding framework
service discovery framework

abstract middleware-programming model
abstract to concrete mapping

lookup operation across different discover protocols.

Problem: How to find which discovery protocol is in use?

Having a fixed point of agreement

Not all protocols can gurantee to use this technology.
The higher level mechanisms may change

The approach that they promote is Cycle and See

Interesting component design for OpenCom

Toward Wide Area Interaction with Ubiquitous Computing Environments

The overall idea: to unify abstractions exposed by existing ubicomp systems to provide a coarse gained interface for application interfacing.

Two impediments to wider deployment of ubicomp environment

supporting users and applications withing single administrative or user domains
lack of a shared model for ubiquitous computing

The considered model for the initial version of web service based middle ware:

Environment Model

Through service discovery
Through a component that handles more complex models of the environment
Related aspects

Environment State
Environment Meta-state
Environment Implementation link: the set of software components

Event sources
Context sources
Services
Entity Handler

Entities
Context

Values
High level inferred context

Services
Entity relationships
Events
Data or content

Environment profiles: to provide semantic enrichment

entities
services
context
events
content

--------------------
Thoughts:
The paper proposes a bottom up integration of services the functionalities of middlewares with the requirements of an environment. The object in an environment are classified as discussed and the relations between them are established. Based on the requirements of users, rules are defined in the form of Jena rules that can extract the concepts of integration from ontologies and identify what components can be used for what services. The ontology preserves the relationships between the entities, their contexts, and the components.

The reasoner then identifies the set of appropriate components that have to be composed in order to provide the right combination for the request of the environment to be processed.

The problem with their approach is that they have chosen a bottom up approach to bind the components to the concepts of user needs. This makes the whole design very much dependent to the way the composition has been defined in the ontology, thus in case a relationship between the components changes, the whole design will lose its validity and the whole ontology needs to be changed.

On the other hand, this doesn't provide any possibility for component reuse cause the design is bottom up which means the components drive the design as opposed to having the design driving the components. So, it is not possible for the modules to be reused, but instead the whole system can be replaces, making its scalability absolutely questionable.

furthermore, for each new system a new integration model should be defined and thus a whole rework at the level of system design also should be done. so this new architecture doesn't solve the problem of adaptability to the new domain, it just makes it uniquely possible for different systems in different domain to choose the same technology to connect to an environment. This is not the role of a broker tho, is it?

Sunday, March 16, 2008

FARSITE: Federated, Available, and Reliable Storage for an Incompletely Trusted Environment

A. Adya, W. J. Bolosky, M. Castro, R. Chaiken, G. Cermak, J. R. Douceur, J. Howell, J. R. Lorch, M. Theimer, R. P. Wattenhofer, "FARSITE: Federated, Available, and Reliable Storage for an Incompletely Trusted Environment", 5th OSDI, Dec 2002. http://citeseer.ist.psu.edu/adya02farsite.html

------------------

Farsite: secure, scalable file system

logically functions as a centralized file server but physically distributed among a set of untrusted computers

Randomized Replication => availability
cryptographic techniques => secrecy of file content (confidentiality)
Byzantine-fault-tolerant => integrity

scalable => distributed hint mechanism
high performance => locally caching data, lazily propagating file updates, varying the duration and granularity

Farsite:

central file server

shared namespace

location-transparent access

reliable data storage

local desktop filesystems

low cost

privacy from nosy sysadmins

resistance to geographically localized faults

The security is provided as a matter of virtual security of cryptography, randomized replication, and Byzantine fault tolerance.

The goal: harness the collective resources of loosely coupled insecure and unreliable machines to provide logically centralized secure and reliable file storage service.

cryptography and replication to preserver the confidentiality and integrity

directory metadata is relatively small. It must be comprehensible and revisable directly by the system. Byzantine is used for this.

Farsite's intended workload and machine characteristics are those observed on desktop machines.
workload

high access locality

low persistent update rate

a pattern of read/write sharing that is sequential

Machine Characteristics

high fail stop rate

low but significant rate of malicious or opportunistic subversion

Administration in Farsite is an issue of configuring a minimal system and to authenticate new users and machines. Also signing certificates.

Farsite is intended to run on the desktop workstations ~ 10^5 machines nonce of which are dedicated servers. Connected by a high-bandwidth, low latency network whose topology can be ignored.

Fundamental technology trends for Farsite:

a general increase in unused disk capacity (disk capacity is increasing at a faster rate than disk usage, this enables replication of reliability)

a decrease in the computational cost of cryptographic operations (this enables distributed security)

The system allows the flexibility of multiple roots each of which can be regarded as the name of a virtual file server that is collaboratively created by the participating machines.

The security of any distributed system is an issue of managing trust.

The security components that rely on redundancy need to trust that an apparently distinct set of machines, is truly distinct and not a single malicious machine pretending to be many => Sybil Attack

The certificates

namespace certificate : associating the root with a set of machines managing the root metadata

user certificate: associating a user with his personal public key so that his identity can be validated

machine certificate: associating a machine with its own public key to establish the validity of a machine

Machine certificates in Farsite are not signed directly by CAs but rather by users whose certificates designate them as authorized to certify machines.

users' private key is encrypted by a symmetric key and then stored on a globally readable directory in Farsite. CA private key is kept offline because the entire security of Farsite depends on their secrecy.

Each machine in Farsite may play three roles

client: a machine that directly interacts with the user

directory group : a set of machines that collectively manage file information

file host

Automating Product-Line Variant Selection for Mobile Devices

White, J., Schmidt, D. C., Wuchner, E., and Nechypurenko, A. 2007. Automating Product-Line Variant Selection for Mobile Devices. In Proceedings of the 11th international Software Product Line Conference (September 10 - 14, 2007). International Conference on Software Product Line. IEEE Computer Society, Washington, DC, 129-140. DOI= http://dx.doi.org/10.1109/SPLC.2007.12

PLAs are a promising approach to help developers manage the complexity of variability between mobile devices.

PLAs can be retargeted for different requirement sets by leveraging common capabilities, patterns, and architectural styles.

The design of a PLA is typically guided by the Scope, Commonality, and Variability (SCV) [7].

With the large array of device types and rapid development speed of new devices and capabilities, the system will not be able to know about all device types a priori.

The problems with the existing component-based and feature-based models is the following:

lack of ability to consider resource consumption constraints, such as the consumed memory
An appropriate architecture for how a device discovery service would be used to characterize a device's nonfunctional requirements (OS, RAM, etc.)
Fast feature selection speed to help with dynamic software delivery for mobile devices

Contributions by the paper:

Scatter’s graphical requirement and resource specification mechanisms and show how they facilitate the capture and analysis of a wide variety of requirement types
how Scatter transforms requirement specifications into a format that can be operated on by a constraint solver
the automated variant selection engine, based on a Constraint Logic Programming Finite Domain (CLP(FD)) solver
how PLA constraints impact variant selection time for a constraint-based variant selection engine.
PLA design rules that we have gleaned from our experiments that help to improve variant selection time when using a constraint-based approach.

The three key challenges associated with creating automated variant selector in pervasive environments

Unknown device signatures (to respond to devices with different capabilities)
Variant Cost Optimization (the cost associated with the selected variants should be examined before orchestration, selection, and composition of the variants)
Limited selection time ( The time for selecting the appropriate set of variants should be reasonable compared to the time that the user is going to be available in a context where s/he needs the type of service)

In traditional PLA, software developers decide about the set of variants to be selected, configured, and organized to work together.

In pervasive environments there are two problems with manual component selection:

The target device signatures are not known ahead of time
variant selection must be done on demand
The solution would be to capture a formal model of PLA's commonalities and variabilities so that automation can take place
A model to capture non-functional requirements to prevent deploying the components on systems whose functional requirements fail due to the inconsistencies with the underlying infrastructures

Scatter has the following features

graphical modeling tool that defines a domain specific modeling language to visually model the components of the interface, the dependencies and composition rules of components, the non functional requirements of each component
A compiler to convert the graphical notation to a Prolog knowledge base and a CSP
remote mechanism to a device discovery service that communicates the discovered devices to Scatter's variant selection engine
A variant selection engine based on Prolog constraint solver that selects a correct and optional variant for a product

A key challenge in pervasive environments is that variant selection must take into account requirements based on business and context data.

At one extreme, a tool can limit the types of constraints that can be solved to a small subset that is considered most important. At the other extreme, a tool can allow developers to capture any type of constraint, but provide no guarantee of having a way of deducing a variant that satisfies them.

The strategy is to allow the datasources to change while the types of constraints remain constant.

The type of constraints as they have classified:

Software Stack on the device
Resource consumption constraints
hardware capability constraints
business/location based constraint

What does this mean? The restriction imposed by the specification format are only on the types of comparisons that can be done and not on the data that the comparison is based upon.

SOAP-based Web service and a CORBA remoting mechanism for remotely communicating device characteristics as they are discovered. (Key, Value) pairs form the reports to Scatter. (How does the device know that it should provide the following information in order to get the component it is looking for? There should be another agent installed on the device, being able to report the information to the device).

A rule is specified that only allows a component to be deployed on a device, if for every local nonfunctional requirement on the component, a resource is present that satisfies the requirement.

A CSP is a problem that involves finding a labeling (a set of values) for a set of variables that adhere to a set of labeling rules (constraints).

A variant becomes a binary string where the ith position represents if the ith component is present.

Nonfunctional requirements. Components with mismatched nonfunctional requirements are completely eliminated from the chain of composition.
Prune using low-granularity requirements. Rely on the footprints that various classes of variants provide
Limit resource tightness. Filter out unessential resource consumptive components
Create Service classes: Annotating the components based on the class that they are required to be selected from. The more non0functional requirements, the quicker a decision maker can find the required components that it is looking for.

Resource constraints are a key requirement type in mobile devices with limited capabilities.

The whole approach is based on CONSTRAINT-BASED SOLVER AUTOMATION

A key challenge of automating product variant selection is debugging mistakes in the product line specification.

Thursday, March 13, 2008

The Sybil Attack

Douceur, J.R. “The Sybil attack” in First International Workshop Peer-to-Peer Systems, IPTPS, 2002 Cambridge, MA, USA, March 7-8, 2002, pp. 251-260.

The goal:

To show that Sybil attacks are always possible without the presence of a logically centralized authority.
The impracticality of establishing distinct identities in a large-scale distributed system.

Peer-to-Peer systems commonly rely on the existence of multiple independent remote entities to mitigate the threat of hostile peers. There are two methods to do so:

Replicating computational or storage tasks among several remote sites to protect against integrity violation
Fragmenting tasks among several remote sites to protect against privacy violation

if the local entity has no direct physical knowledge of remote entities, it perceives them only as informational abstractions that we call identities.

The forging of multiple identities is called Sybil Attack

In the absence of a trusted identification authority (or unrealistic assumptions about the resources available to an attacker), a Sybil attack can severely compromise the initial generation of identities, thereby undermining the chain of vouchers.

faulty entities (deceptive) : The entities capable of performing any arbitrary behavior except as limited by explicit resource constraints

correct entities (honest): entities abiding the rules of any protocol we define

message: an uninterpreted finite-length bit string whose meaning is determined either by an explicit protocol or by an implicit agreement among a set of entities

Each entity e attempts to present an identity i to other entities in the system. l accepts i if e is able to present identity i to l successfully.

A secure hash of a public key is a straightforward and unforgeable identity. It can also generate a symmetric key for a communication session.

Three sources of information about another entity are:

a trusted agency
itself
other (untrusted) entities. (why is it considered untrusted, you can establish trust to some degree but does it still keep it untrusted?)

Direct validation:

Even when severely resource constrained, a faulty entity can counterfeit a constant number of multiple identities.
Each correct entity must simultaneously validate all the identities it is presented; otherwise, a faulty entity can counterfeit an unbounded number of identities.

Systems Literature Review