SAGA thread model

newer
Fwd (maclaren@cct.lsu.edu): [Grid]...

Thilo Kielmann

17 Jul 2006 17 Jul '06

2:16 a.m.

I am sorry to say, but SAGA's task model seems to me severely flawed. This is for two reasons: 1. the "main" thread (executing sync operations) needs to be considered as yet another task 2. there must be a concise definition of the shared state. current solutions are ad-hoc and mostly undefined. shared state is: - local objects, shared between multiple tasks of the same process here: definition of synchronization between tasks - remote objects, in the service(s) here: definition of legal execution orders so far, I can see only a few incidental definitions, but they are far from being concise. "tasks in a bulk operation have to be independent" "a task cancel is doing 'best effort' but can not guarantee cancelation" The latter, BTW, is a special case, because this is about connection termination for which you can formally prove that there is no protocol that can guarantee this AND notify both parties of successful termination. To be constructive: what the task model must do first thing is - define tasks - define which data is shared between tasks and which concurrency control happens on this shared data That is the only way to define clearly what tasks will do in the event of sharing, really. You may want to look at: http://www.amazon.com/gp/product/0201695812/qid=1153102128/sr=2-3/ref=pd_bbs_b_2_3/002-2045221-3597631?s=books&v=glance&n=283155 This is: Doug Lea, "Concurrent Programming in Java: Design Principles and Patterns" This book uses 280 pages on objects, shared state and concurrency control before using 95 pages for the thread operations... -- Thilo Kielmann http://www.cs.vu.nl/~kielmann/

Show replies by date

Thilo Kielmann

17 Jul 17 Jul

5:02 a.m.

New subject: [saga-rg] SAGA thread model

Giving it another thought, I think it isn't as bad as my last mail assumed to be. The difference is that SAGA tasks aren't threads. They are kind-of single operations (e.g., a single file.read, but no sequence of multiple of such operations). It is just that the obvious implementation in Java would be using threads for everything asynchronous... Still, if state sharing between multiple tasks (or between a task and the main thread) is desired, we need to start out by defining data consistency of all local and remote objects... Thilo On Mon, Jul 17, 2006 at 04:16:26AM +0200, Thilo Kielmann wrote:

...

Date: Mon, 17 Jul 2006 04:16:26 +0200 From: Thilo Kielmann <kielmann@cs.vu.nl> To: saga-rg@ggf.org Subject: [saga-rg] SAGA thread model

I am sorry to say, but SAGA's task model seems to me severely flawed. This is for two reasons:

1. the "main" thread (executing sync operations) needs to be considered as yet another task 2. there must be a concise definition of the shared state. current solutions are ad-hoc and mostly undefined. shared state is: - local objects, shared between multiple tasks of the same process here: definition of synchronization between tasks - remote objects, in the service(s) here: definition of legal execution orders

so far, I can see only a few incidental definitions, but they are far from being concise.

"tasks in a bulk operation have to be independent" "a task cancel is doing 'best effort' but can not guarantee cancelation"

The latter, BTW, is a special case, because this is about connection termination for which you can formally prove that there is no protocol that can guarantee this AND notify both parties of successful termination.

To be constructive: what the task model must do first thing is - define tasks - define which data is shared between tasks and which concurrency control happens on this shared data

That is the only way to define clearly what tasks will do in the event of sharing, really.

You may want to look at:

http://www.amazon.com/gp/product/0201695812/qid=1153102128/sr=2-3/ref=pd_bbs_b_2_3/002-2045221-3597631?s=books&v=glance&n=283155

This is: Doug Lea, "Concurrent Programming in Java: Design Principles and Patterns"

This book uses 280 pages on objects, shared state and concurrency control before using 95 pages for the thread operations...

-- Thilo Kielmann http://www.cs.vu.nl/~kielmann/

-- Thilo Kielmann http://www.cs.vu.nl/~kielmann/

Andre Merzky

2:42 p.m.

New subject: [saga-rg] SAGA thread model

Quoting [Thilo Kielmann] (Jul 17 2006):

...

Giving it another thought, I think it isn't as bad as my last mail assumed to be.

The difference is that SAGA tasks aren't threads. They are kind-of single operations (e.g., a single file.read, but no sequence of multiple of such operations). It is just that the obvious implementation in Java would be using threads for everything asynchronous...

Right, that is what saga tasks are: they represent a async operation.

...

Still, if state sharing between multiple tasks (or between a task and the main thread) is desired, we need to start out by defining data consistency of all local and remote objects...

We already had a discussion about this, which was triggered by similar comments from Felix. The result of the discussion was (cited from the spec intro): \subsubsection{Consistency Model} We had a lengthy discussion about consistency models, with the agreement that the consistency model is to be defined and documented by the implementation. The API spec itself does not assume any specific consistency model, as we feel that (a) POSIX consistency is not achievable within reasonable effort/performance, (b) if the user assumes the worst (no consistency), he will still be able to make good use of the API, and (c) reality will be somewhere in the middle. After discussing further with some OGSA folx at last GGF, I added: Implementors SHOULD, however, strive to implement ``At Most Once'' consistency, as that seems (a) to be generally supported by most Grid middleware, (b) implementable in distributed systems with reasonable effort, and (c) useful and intuitively expected by most end users. There have been some recent discussion on the BES and OGSA list about At-Least-Once and At-Most-Once, but I am pretty positive that our use cases benefit from At-Most-Once most. Is that what you are looking fore? It is not really saying anything about shared state of object, and the life time consequences for these objects (that is what this thread originally tried to discuss). For that, I tried to clean up the intro once more, see the CVS version of \subsubsection{Life Time Management} in the light of what we discussed about consistency and tasks, does that section make sense to you? Cheers, Andre.

...

Thilo

On Mon, Jul 17, 2006 at 04:16:26AM +0200, Thilo Kielmann wrote:

...
Date: Mon, 17 Jul 2006 04:16:26 +0200 From: Thilo Kielmann <kielmann@cs.vu.nl> To: saga-rg@ggf.org Subject: [saga-rg] SAGA thread model

I am sorry to say, but SAGA's task model seems to me severely flawed. This is for two reasons:

1. the "main" thread (executing sync operations) needs to be considered as yet another task 2. there must be a concise definition of the shared state. current solutions are ad-hoc and mostly undefined. shared state is: - local objects, shared between multiple tasks of the same process here: definition of synchronization between tasks - remote objects, in the service(s) here: definition of legal execution orders

so far, I can see only a few incidental definitions, but they are far from being concise.

"tasks in a bulk operation have to be independent" "a task cancel is doing 'best effort' but can not guarantee cancelation"

The latter, BTW, is a special case, because this is about connection termination for which you can formally prove that there is no protocol that can guarantee this AND notify both parties of successful termination.

To be constructive: what the task model must do first thing is - define tasks - define which data is shared between tasks and which concurrency control happens on this shared data

That is the only way to define clearly what tasks will do in the event of sharing, really.

You may want to look at:

http://www.amazon.com/gp/product/0201695812/qid=1153102128/sr=2-3/ref=pd_bbs_b_2_3/002-2045221-3597631?s=books&v=glance&n=283155

This is: Doug Lea, "Concurrent Programming in Java: Design Principles and Patterns"

This book uses 280 pages on objects, shared state and concurrency control before using 95 pages for the thread operations...

-- Thilo Kielmann http://www.cs.vu.nl/~kielmann/ -- "So much time, so little to do..." -- Garfield

Hirmer Stephan

4:10 p.m.

New subject: [saga-rg] SAGA thread model

Hi All, I was just following VU's discussion and like to comment on it. As I understand it, tasks are by definition independant from each other, because they are asynchronous operations, right? With this in mind, SAGA users should be responsible to manage possible race conditions by themselves. They are the only ones, which are aware of the exact nature of calls the put into tasks. And that s why, it is a easier task for them to avoid race conditions in their very own code, than for the SAGA spec, to avoid race conditions in every case without knowing about the exact semantics. Copying around, objects which are used by tasks seems to be problematic, as these different copies need to be synchronised afterwards (as Andre pointed out). Hence, I would propose to leave the problems of race conditions to the enduser, because there is probabely no general solution, which would not contradict certain use cases. regards, Stephan On Mon, 17 Jul 2006, Andre Merzky wrote:

...

Quoting [Thilo Kielmann] (Jul 17 2006):

...
Giving it another thought, I think it isn't as bad as my last mail assumed to be.

The difference is that SAGA tasks aren't threads. They are kind-of single operations (e.g., a single file.read, but no sequence of multiple of such operations). It is just that the obvious implementation in Java would be using threads for everything asynchronous...

Right, that is what saga tasks are: they represent a async operation.

...
Still, if state sharing between multiple tasks (or between a task and the main thread) is desired, we need to start out by defining data consistency of all local and remote objects...

We already had a discussion about this, which was triggered by similar comments from Felix. The result of the discussion was (cited from the spec intro):

\subsubsection{Consistency Model}

We had a lengthy discussion about consistency models, with the agreement that the consistency model is to be defined and documented by the implementation. The API spec itself does not assume any specific consistency model, as we feel that (a) POSIX consistency is not achievable within reasonable effort/performance, (b) if the user assumes the worst (no consistency), he will still be able to make good use of the API, and (c) reality will be somewhere in the middle.

After discussing further with some OGSA folx at last GGF, I added:

Implementors SHOULD, however, strive to implement ``At Most Once'' consistency, as that seems (a) to be generally supported by most Grid middleware, (b) implementable in distributed systems with reasonable effort, and (c) useful and intuitively expected by most end users.

There have been some recent discussion on the BES and OGSA list about At-Least-Once and At-Most-Once, but I am pretty positive that our use cases benefit from At-Most-Once most.

Is that what you are looking fore?

It is not really saying anything about shared state of object, and the life time consequences for these objects (that is what this thread originally tried to discuss).

For that, I tried to clean up the intro once more, see the CVS version of

\subsubsection{Life Time Management}

in the light of what we discussed about consistency and tasks, does that section make sense to you?

Cheers, Andre.

...
Thilo

On Mon, Jul 17, 2006 at 04:16:26AM +0200, Thilo Kielmann wrote:

...
Date: Mon, 17 Jul 2006 04:16:26 +0200 From: Thilo Kielmann <kielmann@cs.vu.nl> To: saga-rg@ggf.org Subject: [saga-rg] SAGA thread model

I am sorry to say, but SAGA's task model seems to me severely flawed. This is for two reasons:

1. the "main" thread (executing sync operations) needs to be considered as yet another task 2. there must be a concise definition of the shared state. current solutions are ad-hoc and mostly undefined. shared state is: - local objects, shared between multiple tasks of the same process here: definition of synchronization between tasks - remote objects, in the service(s) here: definition of legal execution orders

so far, I can see only a few incidental definitions, but they are far from being concise.

"tasks in a bulk operation have to be independent" "a task cancel is doing 'best effort' but can not guarantee cancelation"

The latter, BTW, is a special case, because this is about connection termination for which you can formally prove that there is no protocol that can guarantee this AND notify both parties of successful termination.

To be constructive: what the task model must do first thing is - define tasks - define which data is shared between tasks and which concurrency control happens on this shared data

That is the only way to define clearly what tasks will do in the event of sharing, really.

You may want to look at:

http://www.amazon.com/gp/product/0201695812/qid=1153102128/sr=2-3/ref=pd_bbs_b_2_3/002-2045221-3597631?s=books&v=glance&n=283155

This is: Doug Lea, "Concurrent Programming in Java: Design Principles and Patterns"

This book uses 280 pages on objects, shared state and concurrency control before using 95 pages for the thread operations...

-- Thilo Kielmann http://www.cs.vu.nl/~kielmann/ -- "So much time, so little to do..." -- Garfield

Andre Merzky

2:41 p.m.

New subject: [saga-rg] SAGA thread model

Hi Thilo, Quoting [Thilo Kielmann] (Jul 17 2006):

...

I am sorry to say, but SAGA's task model seems to me severely flawed. This is for two reasons:

1. the "main" thread (executing sync operations) needs to be considered as yet another task

In what respect? In respect to object state that is the case. Did you have something else in mind?

...

2. there must be a concise definition of the shared state. current solutions are ad-hoc and mostly undefined.

Why do you think that? We might need to better express that in the doc, but the current solution is "object state is shared". That is not really ad-hoc, and its well defined. So, what exactly do you think needs better definition?

...

shared state is: - local objects, shared between multiple tasks of the same process here: definition of synchronization between tasks

What does synchronization between tasks mean? As tasks are async, they are, by definition, not synchronized.

...

- remote objects, in the service(s) here: definition of legal execution orders

We don't make assumptions about 'legal execution order' - where would that order be defined? It is application (i.e. use case) dependent IMHO. Why should that order have an reflection in the API? Do you have an example? I might miss the point here...

...

so far, I can see only a few incidental definitions, but they are far from being concise.

Please have a look at the intro (CVS version). I worked on that over the last days, and tried to clarify these issues. Do they make sense to you?

...

"tasks in a bulk operation have to be independent"

bulks are not mentioned in the spec. I brought that as an example only. Should have left that out.

...

"a task cancel is doing 'best effort' but can not guarantee cancelation"

The latter, BTW, is a special case, because this is about connection termination for which you can formally prove that there is no protocol that can guarantee this AND notify both parties of successful termination.

Exactly. So, so we agree that delayed deallocation of resources for cancel and destructors makes sense, at least in those cases? That is described in the subsection "Freeing of Resources and Garbage Collection" in the specs intro - can you check if you agree with that?

...

To be constructive: what the task model must do first thing is - define tasks

The definition in the spec is: "Objects of this class represent asynchronous API calls." A task in saga is what RPC has as async rpc handle. As the SAGA spec is OO, calling it handle would be strange, but it is in fact not much more than that.

...

- define which data is shared between tasks and which concurrency control happens on this shared data

All is shared, no control.

...

That is the only way to define clearly what tasks will do in the event of sharing, really.

You may want to look at:

http://www.amazon.com/gp/product/0201695812/qid=1153102128/sr=2-3/ref=pd_bbs_b_2_3/002-2045221-3597631?s=books&v=glance&n=283155

This is: Doug Lea, "Concurrent Programming in Java: Design Principles and Patterns"

This book uses 280 pages on objects, shared state and concurrency control before using 95 pages for the thread operations...

I had a look through a number of other specifications a while ago, about exactly these topics. I must say that most left me in the dark. Some, as e.g. the MPI spec, moves the responsibility for correct call order to the end user: "A correct, portable program must invoke collective communications so that deadlock will not occur, whether collective communications are synchronizing or not." That is, in some sense, what we do as well: the order of async ops is undefined, and its the end users responsibility to sync them, if that is of any concern. Corba protocol spec says: "Overlapping requests - In general, GIOP message ordering constraints are minimal. GIOP is designed to allow overlapping asynchronous requests; it does not dictate the relative ordering of requests or replies." and is otherwise silent about object state (well, as far as I can say after skimming through the 1152 page monster...) API Specs like Gnome and the GTK+ thingie does the same as SAGA: allow async calls, and specify them, and leave ordering and consistency to the end user. Same for GridRPC by the way. Frankly, I don't see any other realisitic (== usable and implementable) way. Cheers, Andre. -- "So much time, so little to do..." -- Garfield

7183

Age (days ago)

7183

Last active (days ago)

List overview

Download

4 comments

3 participants

participants (3)

Andre Merzky
Hirmer Stephan
Thilo Kielmann