January 2007 - saga-rg - lists.ogf.org

Java Bindings
by Pascal Kleijer 25 Feb '07

25 Feb '07

As a follow-up of our SAGA sessions at OGF 19, here is one way on how to proceed with the SAGA Java Bindings and in fact with some other language bindings as well. Now this is to open the discussion and I don't want to impose the concept. So all of you that have interest in the Java binding please let your voice be heard. We have seen that each implementation of SAGA going on is not necessary standardized and not complete. The SAGA specification stipulates that partial is possible and each language bindings can introduce its specific flavor. That isn't the best way to get a standard around the corner. Java is a powerful language that allows a high level of abstraction. SAGA should follow a path similar to what has been done with, for example, XML: org.w3c.dom, org.xml.sax and JAXP. That means that SAGA: - Must have a complete abstract API based on façade Interfaces, - Minimize the number of concrete classes to Exceptions, Error and Factories (or Service Provider Interfaces - SPI), - Allow multiple vendor implementations in the back. For example multiple XML parsers can be installed at the same time. - Have a mechanisms to use one or more implementation at the same time, the user can choose one explicitly or not, - If one implementation does not support a specific section of the specification, a safe fall back mechanism should allow another implementation to be used transparently. The general SAGA API should have a common namespace like: org.ogf.saga. Specific implementation can be anything like nl.vu.saga. If the SAGA API façade is available and stabilized it could/should be submitted to the Java Community Process to become an official Java specification for Grid applications through the JSR (Java Specification Request). Pascal Kleijer k-pasukaru(a)ap.jp.nec.com

2 1

OGF 19 SAGA minutes II
by Pascal Kleijer 30 Jan '07

30 Jan '07

Dear all, This is the second, and last, batch of minutes. BR, Pascal Kleijer k-pasukaru(a)ap.jp.nec.com

1 1

OGF 19 SAGA Minutes
by Pascal Kleijer 30 Jan '07

30 Jan '07

Dear all, Here is the first batch of minutes for the SAGA sessions we had at OGF 19. BR, Pascal Kleijer k-pasukaru(a)ap.jp.nec.com SAGA - Public Comments Sunflower: 14:00-15:30 Monday 29th January 2007 Goal: Overview of the public comments made on the API. [AD] Workshop and All Hands-On presentation. Q: Can we have an overview of the API? A: Two persons request it. The presentation is shifted to the next SAGA session. Overview of Kaiser's Comments. Overview of Kleijer's Comments. Discussion on the state machine and "Unknown" state. Language binding might have additional information about how the state is used. Usage of Proxy Gates for streams. The consensus is to let it to the language binding and not within the API. Another mechanism has to be used. Overview of Illingworth's Comments. Overview of Pipan's Comments. Q: Life time of the default session. How do we handle it? A: Get created as needed, gets destroyed on application shutdown. Q: What is the purpose of adding a task to task_container more than once? A: Nothing presents it. Q: What happen when 3 files copy on the same job is done? A: Depends on the implementation and how the job is handled on the back-end. Conclusion: A task can only be added once to a container. The "add" will be silently ignored if added more then once. Q: Discussion on the error handling and error list vs. single error messages. A: No we should stay as is now. Q: Should the automatic conversion of a string to an array be there? A: To simplify the API no, the conversion to list or arrays should not be automatic. Q: Byte ordering important? A: We have only streams of bytes and files of bytes, thus no problem. TODO: Ensures that the API properly state that we have stream of bytes and file of bytes. Problem of having the attribute interface not supporting the asynchronous interface. The case is replica management where attribute are stored on the back-end. Conclusion: The attribute interface will implement the asynchronous interface. (This is ugly but necessary). Q: Should SAGA add a byte buffer class to handle streams, files, messages etc? A: Yes, the API should be changed to support only bytes and add a Byte Buffer class. API final submission for OGF 20. The document will be submitted end of March 2007. What to submit: - Clean New - Commented New - Change Log Session Close: 15:25 % SAGA - Extensions I Azalea: 9:00-10:30 Tuesday 30th January 2007 Goal: C++ Language Bindings & API Extensions Introduction of the C++ language bindings. Uses TR1 (currently Boost). Q: Is the usage of TR1 (Boost) a good idea? A: Yes this is standardized and supported by most compilers. The C/C++ community largely accepts this model. Q: Do we shutdown some user base with this? A: gcc 4 or more compilers should support it. Boost implementation is tested on many platforms everyday to ensure compatibility so TR1 should be fine. TODO: List the possible architecture to see if TR1 is available. Post on the mailing list and give a 2 month deadline. Q: Is the usage of templates good? A: Yes since this is the most clean and simple way to do in C++. TODO: Put in the mailing list to see if this is a good choice. TODO: Create a list of header files for SAGA to define a core L&F for C++ like in Java. Combine this into a language binding document for OGF. TODO: Create an entry in CVS for the C++ binding. Introduction of the Messaging API (Andre). Q: Is the topology approach useful and simple enough to support the 80/20 rule? A: Yes but it is build on top of the streams API and ensure to support the needs of the different use cases. Q: Are the labels for reliability correct? A: They might not be the best, subject to be changed. Q: The flags cannot be changed over time? A: No, this might introduce to complex implementation and undefined states if the connections are already setup. Q: Is the message class "manage" flag good? A: In terms of C++ no, we better have another scheme. Consider to define in the language binding. Q: Is this API not like MPI? A: Yes MPI can be one implementation. Q: Could we not dig into the use cases to set a list of usages and submit it to the MPI folk? A: Yes this is a good idea. It would be good to see how MPI does the messaging and take some of the syntax from it. TODO: Andre will tackle this and see how MPI did it and map it back to the proposal. Other groups in OGF are working on some sort of message communication. Synchronization is necessary and planned. TODO: The package name might not be good enough, seek a new name. TODO: See what is different from MPI and list them. Session Closed: 10:30 % SAGA - Extensions II Azalea: 11:00-12:30 Tuesday 30th January 2007 Goal: API Extensions Steve Fisher, presentation about service discovery API. Work based on GLUE. Q: Is this API useful for SAGA? A: Yes, it can nicely fill in some gaps in the current API. Q: Is SQL really the best choice? SAGA uses REGEX. A: Yes since the information is based on key-value pairs and the queries can be rather complex that REGEX cannot properly express. Q: How do we know where the endpoints are? A: This must be exchanged before hand at some point to enable the communication. Q: Can the API be extended to resource discovery? A: This has been discussed. Basically yes, the resources can be attached to a service. Each resource has a unique identifier attached to a service thus backtracking of the service can be done based on the resource id. TODO; Sit off-line to define the document draft. Add use cases and requirements in one document. Target OGF 20 for deadline. CPR presentation. Quick presentation and history of CPR. Overview of CPR requirements. Presentation of the API. Build on the CORE API, based on job and namespace. Q: How to address the blotting of checkpoint files? A: There are some ideas but no concrete solution to handle this. The policy of checkpoint file life time has to be addressed but might be out of scope of SAGA. Information Service Present the idea of the information service and some usage of the approach. Q: How does the API looks like? A: Show the SAGA namespace API as base. The API would be build on top of the namespace. Q: Should SAGA stick to REGEX or move to SQL? A: No consensus on what is better. Write down some code example of IS usage. Session Closed: 12:30 %

1 0

Fwd (dk@gup.jku.at): Workshop@OGF19 "Visualization on the Grid"
by Andre Merzky 25 Jan '07

25 Jan '07

Sorry if you receive this mail multiple times... ----- Forwarded message from Dieter Kranzlmueller <dk(a)gup.jku.at> ----- Announcement: Workshop@OGF19 "Visualization on the Grid" Chairmen: Thilo Kielmann, Dieter Kranzlmueller, Andre Merzky OGF19, Chapel Hill, NC, USA * Session 1: Thursday, February 1, 2:00 pm - 3:30 pm, Room: Windflower http://www.ogf.org/gf/event_schedule/index.php?id=562 * Session 2: Thursday, February 1, 4:00 pm - 5:30 pm, Room: Windflower http://www.ogf.org/gf/event_schedule/index.php?id=624 Grids are offering many ways to address and solve challening scientific problem, and different functionality is provided for different applications of e-Science. However, most existing grids focus on processing scientific data, leaving the endeavor of analyzing the results and understanding the findings to the user. Improvements to the latter are expected from grid-aware or grid-enabled visualization tools, which provide sophisticated visualization capability within a grid environment. Taking the dynamic and heterogeneous characteristics of the grid into account requires new functionality from visualization tools, such as dynamically adapting the visualization pipeline to the changing environment. This workshop intends to provide an exemplary overview of todays existing approaches to grid visualization, and establishes a forum for discussing perspectives and open issues in this domain. Participants are invited to share and discuss their own experiences or lack thereof with the providers of grid visualization functionality. The goal of the workshop is to identify the next steps on the road to adopting visualization as a means of working on the grid. Detailled Agenda: ----------------- Session 1: Existing Approaches (Chair: Andre Merzky) Thursday, February 1, 2:00 pm - 3:30 pm, Room: Windflower http://www.ogf.org/gf/event_schedule/index.php?id=562 * Luc Renambot (EVL, UIC, Chicago, IL), "High-resolution Collaboration over High-Speed Network"" * Dieter Kranzlmueller (GUP, JKU Linz, Austria), "The GVK/GVid Approach: Visualization on the Grid" * Ray Idaszak (RENCI, UNC Chapel Hill, USA), "The Emerging Grid-based Collaborative Viz Environment at RENCI" * Claudio Silva (SCI Utah, USA), "Supporting Data Exploration through Visualization" Session 2: Perspectives and Open Issues (Chair: Thilo Kielmann) Thursday, February 1, 4:00 pm - 5:30 pm, Room: Windflower http://www.ogf.org/gf/event_schedule/index.php?id=624 * Pascal Kleijer (NEC Corporation, Japan), "Service based Visualization: Concepts and Problems" * Andre Merzky (Vrije Univ. Amsterdam, The Netherlands), "OGF components for Grid Visualization Systems - Status and Outlook" * Panel: "What's next? Issues and goals for the future of visualization on the Grid?" Panelists: Ray Idaszak, Pascal Kleijer, Dieter Kranzlmueller, Andre Merzky, Luc Renambot, Claudio Silva ----- End forwarded message ----- -- "So much time, so little to do..." -- Garfield

1 0

Re: [SAGA-RG] SAGA Message API Extension
by Andre Merzky 24 Jan '07

24 Jan '07

Hi group, Werner and I, and separately Hartmut and I, chatted about the message object, structured and typed data buffers etc. We would like to propose the following approach: - leave the message data buffer unstructured and untyped - allow language bindings to add support for packing native data types into the buffer (similar to MPI and PVM) - use this message buffer for the message API, but possibly also for streams and files (additionally to the char* we support right now) - later discuss more structured message buffers on top of the unstructured ones - even later discuss message buffer with specific data models on top of the structured ones (that may be domain specific, and outside of the SAGA scope per se - more as informational docs or community practice) Lets pick this up at OGF next week from here... Best regards, Andre. Quoting [Werner Benger] (Jan 19 2007): > To: Andre Merzky <andre(a)merzky.net> > Subject: Re: SAGA Message API Extension > From: Werner Benger <benger(a)zib.de> > Cc: Andrei Hutanu <ahutanu(a)cct.lsu.edu>, John Shalf <JShalf(a)lbl.gov>, > SAGA RG <saga-rg(a)ogf.org>, > Gregor von Laszewski <gregor(a)mcs.anl.gov> > > Hi Andre, > > On Thu, 18 Jan 2007 15:47:34 -0600, Andre Merzky <andre(a)merzky.net> wrote: > > Hi Werner, > > > > Quoting [Werner Benger] (Jan 18 2007): > >> > >> Hi Andre, > >> > >> I have two other remarks, which might be orthogonal to the current > >> draft, but might still be good to have it mentioned there: > >> > >> * Structured messages: > >> > >> The current draft just talks about transporting an array of bytes, > >> but in practice we might want to transfer floats/doubles/ints etc. > >> While this *might* be implemented on top of the current msg API, > >> this would be a waste if the low-level protocol implementation > >> (e.g. MPI) already would support such types (including byte ordering > >> conversion). As such, it were useful to have the option to use > >> such mechanisms from a low-level protocol if supported. If not, > >> then it would need to be taken care of on top of the current level. > > > > Ah, good point - that at least needs clarification in the > > spec! > > > > Yes, you are right: the focus on opaque messages is a > > limitation for many use cases. OTOH, support for primitive > > types such such as ints or floats don't by you that much, > > and for more complex structures... - well, who knows better > > than you that agreeing on a data model is a reeaaally > > difficult job? ;-) > > > ah, data model issues are certainly shooting too high here. > however, a self-descriptive typed message structure like it > is possible in MPI or also HDF5 would do the job in a generic > way. It doesn't need to do any more than native C allows, ie. > support native types and arbitrary structures built from them. > That's just the same level as other low-level protocols might > already support. > > > So, basically the message API tries to avoid that topic for > > the main reason that it seems difficult to define. I would > > wholeheartly support any activity which tries to define > > domain or use case specific flavours of the API. That would > > be a simple excercise: you would only need to redefine the > > set_data method on the msg class accordingly. > > > A generic self-descriptive of data should be fully independent > from user cases or application domains. > > > So, the question is: is a very limited support for primitive > > data types something (really) useful? > > > In practice, you would need it anyway. So it's kind of moving > operations that the app needs to do anyway into the common > denominator SAGA. Plus we get the option that these operations > might be done by a lower-level protocol interface which might > be more efficient than if done by the application. > > What I would tend to avoid is to use e.g. a protocol, which knows > about floats and byteordering, just to shuffle bytes, and then > do the byteordering manually again. > > > >> * Interfacing Event Loops: > >> > >> If we want to use this API from within a larger application instead > >> of just self-standing programs, we might want to use mechanisms such > >> as socket callbacks for event handling (eg. the QSocketNotifier or > >> under X11 using XtAppAddInput). Would be good to have some support > >> to allow this, even though it might be optional. > > > > Right, thats an important point, in particular for the > > visualization use cases. Its actually in the spec, but well > > hidden :-) The endpoint class definition says: > > > > class endpoint : implements saga::object > > implements saga::async > > implements saga::monitoring > > [...] > >saga::async is actually an empty interface, but what that > > means is that the class will contain several versions of > > every class method: a synchronous one, and 3 additional > > ones. In C++ the rendering would look like: > > > > // connection setup > > saga::endpoint ep; > > ep.serve (); > > > > // normal, synchronous version > > saga::msg m = ep.recv (); > > > > // task version 1: synchronous > > saga::task t1 = ep.recv <saga::task::Sync> (msg); > > // task version 2: asynchronous > > saga::task t2 = ep.recv <saga::task::ASync> (msg); > > // task version 3: task > > saga::task t3 = ep.recv <saga::task::Task> (msg); > > > > These three versions of the recv method all return a task, > > which only differs in its state: t1 is Done, t2 is Running, > > and t3 is New (not yet running). You can get notification > > on when a task is Done etc. > > > Task means a thread or is this just some saga-internal data > type? > > > > > Additionally, the spec defines some metrics on the endpoint, > > among them: > > > > // Metrics: > > // name: Message > > // desc: fires if a message arrives > > // mode: Read > > // unit: 1 > > // type: String > > // value: "" > > // notes: - the value is the endpoint URL of the > > // sending party, if known. > > > > These metrics are used by the monitoring interface, which is > > also implemented by the endpoint. With that, you can add > > callbacks to an endpoint which gets called when a new msg > > arrives: > > > > saga::endpoint ep; > > ep.add_callback ("Message", my_cb); > > ep.serve (); > > > > my_cb is a user defined class which implements > > saga::callback, and whose cb() method gets then called on > > incoming messages. > > > > > > Sorry if that was somewhat lengthy. Anyway, point is: async > > ops and notification are covered, by means of the SAGA Core > > Look&Feel, which is inherited by this API. > > > Hm, ok. Good. Maybe you can add some concrete examples in the > appendix, such as how would interfering with QT really look like? > > Werner > > > > > > Cheers, Andre. > > > >> > >> Werner > >> > >> > >> On Thu, 18 Jan 2007 14:18:52 -0600, Andre Merzky <andre(a)merzky.net> wrote: > >> > >> > Hi John, Andrei, > >> > > >> > you are right: getting some feedback from the transport > >> > level folx is certainly a good idea. The API draft won't go > >> > into public comment for another month or so (at least), and > >> > then it will stay in public comment for another 2 months or > >> > longer - that should give us enough time to contact them. > >> > > >> > About ordering: the text Andrei cited is in the spec because > >> > ordering is, as of now, not an attribute of the connection > >> > or endpoint - so the spec tries to nail it down. It says > >> > "MUST be ordered, but no global ordering is required" > >> > because I thought that this covers the majority of use > >> > cases. > >> > > >> > I don't think there are use cases which require global > >> > ordering - or at least not enough to justify a requirement > >> > for global ordering. What is your opinion? Also, thats > >> > really difficult to implement in Grids IMHO. > >> > > >> > Use cases which do not require ordering should be happy with > >> > order preserving connections, too. Question now is: does > >> > the benefit of un-ordered implementations (simplier, smaller > >> > footprint) justify an attribute on API level? Or are there > >> > use cases which require non-ordered delivery for other > >> > reasons? > >> > > >> > Cheers, Andre. > >> > > >> > > >> > Quoting [Andrei Hutanu] (Jan 18 2007): > >> >> > >> >> Hi, > >> >> > >> >> >> > >> >> >>2) I see ordering is enforced, could that be an option? > >> >> > > >> >> > > >> >> >I think ordering is *not* enforced, but I do wonder if it should be > >> >> >an option or a channel property (certainly semireliable will likely > >> >> >result in some reording whereas a TCP channel would enforce ordering > >> >> >of the messages for instance). > >> >> > > >> >> >This is a controversial topic in the HPC message passing community > >> >> >(whether msg. ordering is a good or bad-thing to enforce in at the > >> >> >hardware level). > >> >> > > >> >> I was thinking the same (no strong feelings for either option or > >> >> property) but the text tells otherwise : > >> >> In 2.1 introduction : > >> >> In contrast, this message API extension guarantees that message blocks > >> >> of arbitrary size are delivered in order, and intact, without the need > >> >> for additional application level coordination or synchronization. > >> >> and > >> >> > >> >> then in 2.1.7 reliability corectness and ordering > >> >> The order of sent messages MUST be preserved by the implementation. > >> >> Global ordering is, however, not guaranteed to be preserved: > >> >> > >> >> Assume three endpoints A, B and C, all connected to each other. If A > >> >> sends two messages [a1, a2], in this order, it is guaranteed that both B > >> >> and C receive the messages in this order [a1, a2]. If, however, A sends > >> >> a message [a1] and then B sends a message [b1], C may receive the > >> >> messages in either order, [a1, b1] or [b1, a1]. > >> >> > >> >> Andrei -- "So much time, so little to do..." -- Garfield

1 0

SAGA Message API Extension
by Andre Merzky 22 Jan '07

22 Jan '07

Hi Folx , here is the updated draft of the SAGA Message API, in preparation for OGF-19. It is also available in CVS, as usual. We would be happy to get feedback on the list of course, so don't feel oblidged to hold back your comments for OGF-19 :-) Cheers, Andre. -- "So much time, so little to do..." -- Garfield

9 23

Re: [SAGA-RG] SAGA Message API Extension
by John Shalf 19 Jan '07

19 Jan '07

On Jan 19, 2007, at 7:49 AM, Werner Benger wrote: > If we just add a finegranular timestamp in UTC to each msg, would > that help here? It could, but you 1) have to do a system call (expensive context switch) to get the time 2) Worry about timer granularity 3) in the message bus case, there will be time-skew in the hosts attached to the bus, so you'd also have to keep tables of timeskew corrections for each of the hosts involved in the bus 4) We know what the order of the messages are that have arrived, but we only know after-the-fact whether a message has been delayed. So the timestamp might not be sufficient to ensure message ordering. 5) Still have to have some logic to reorder messages on the recv side 6) There is no 6.... at this point it still looks like a tough problem. > Werner > > On Thu, 18 Jan 2007 17:36:15 -0600, John Shalf <JShalf(a)lbl.gov> wrote: > >> >> On Jan 18, 2007, at 12:18 PM, Andre Merzky wrote: >> >>> Hi John, Andrei, >>> >>> you are right: getting some feedback from the transport >>> level folx is certainly a good idea. The API draft won't go >>> into public comment for another month or so (at least), and >>> then it will stay in public comment for another 2 months or >>> longer - that should give us enough time to contact them. >>> >>> About ordering: the text Andrei cited is in the spec because >>> ordering is, as of now, not an attribute of the connection >>> or endpoint - so the spec tries to nail it down. It says >>> "MUST be ordered, but no global ordering is required" >>> because I thought that this covers the majority of use >>> cases. >>> >>> I don't think there are use cases which require global >>> ordering - or at least not enough to justify a requirement >>> for global ordering. What is your opinion? Also, thats >>> really difficult to implement in Grids IMHO. >> >> Well as I mentioned before, global ordering is actually a hot topic >> for debate in folks who are doing the low-level one-sided messaging >> interfaces (GA/ARMCI vs. UPC/GASNet). The issue with enforcing >> global ordering is that it limits opportunities for performance >> optimization and requires a lot more complexity (SW and HW) and >> software overhead at the endpoints to ensure the ordering is >> enforced. However, global ordering makes it much easier to send >> messages that express fences or barriers. As you can imagine, not >> enforcing ordering (particularly for the message bus case) is a *lot* >> easier to implement, but makes the concept of fences and simultaneity >> of events to be more complicted (starts to look like General >> Relativity brain teasers). >> >> If we want to steer clear of this nasty debate, it seems we should be >> able to query the ordering enforcement (or request it if available) >> offered by the underlying protocol. >> >>> Use cases which do not require ordering should be happy with >>> order preserving connections, too. Question now is: does >>> the benefit of un-ordered implementations (simplier, smaller >>> footprint) justify an attribute on API level? Or are there >>> use cases which require non-ordered delivery for other >>> reasons? >>> >>> Cheers, Andre. >>> >>> >>> Quoting [Andrei Hutanu] (Jan 18 2007): >>>> >>>> Hi, >>>> >>>>>> >>>>>> 2) I see ordering is enforced, could that be an option? >>>>> >>>>> >>>>> I think ordering is *not* enforced, but I do wonder if it >>>>> should be >>>>> an option or a channel property (certainly semireliable will >>>>> likely >>>>> result in some reording whereas a TCP channel would enforce >>>>> ordering >>>>> of the messages for instance). >>>>> >>>>> This is a controversial topic in the HPC message passing community >>>>> (whether msg. ordering is a good or bad-thing to enforce in at the >>>>> hardware level). >>>>> >>>> I was thinking the same (no strong feelings for either option or >>>> property) but the text tells otherwise : >>>> In 2.1 introduction : >>>> In contrast, this message API extension guarantees that message >>>> blocks >>>> of arbitrary size are delivered in order, and intact, without the >>>> need >>>> for additional application level coordination or synchronization. >>>> and >>>> >>>> then in 2.1.7 reliability corectness and ordering >>>> The order of sent messages MUST be preserved by the implementation. >>>> Global ordering is, however, not guaranteed to be preserved: >>>> >>>> Assume three endpoints A, B and C, all connected to each other. >>>> If A >>>> sends two messages [a1, a2], in this order, it is guaranteed that >>>> both B >>>> and C receive the messages in this order [a1, a2]. If, however, A >>>> sends >>>> a message [a1] and then B sends a message [b1], C may receive the >>>> messages in either order, [a1, b1] or [b1, a1]. >>>> >>>> Andrei >>> >>> >>> >>> -- >>> "So much time, so little to do..." -- Garfield >>> -- >>> saga-rg mailing list >>> saga-rg(a)ogf.org >>> http://www.ogf.org/mailman/listinfo/saga-rg >> >> > > > > -- > ______________________________________________________________________ > __ > Dr. Werner Benger <benger(a)zib.de, > Werner.Benger(a)aei.mpg.de> > Zuse Institute Berlin ZIB > Takustrasse 7 Tel: +49 30 > 84185-184 > D-14195 Berlin-Dahlem, GERMANY Fax: +49 30 > 84185-107 > Max-Planck-Institut fuer Gravitationsphysik Albert-Einstein- > Institut > Am Muehlenberg 1 Tel: +49 (331) 567-7115 > D-14476 Golm bei Potsdam Fax: +49 (331) 567-7298 > http://www.photon.at/~werner/ > >

1 0

Re: [SAGA-RG] SAGA Message API Extension
by Andre Merzky 18 Jan '07

18 Jan '07

Hi Werner, Quoting [Werner Benger] (Jan 18 2007): > > Hi Andre, > > I have two other remarks, which might be orthogonal to the current > draft, but might still be good to have it mentioned there: > > * Structured messages: > > The current draft just talks about transporting an array of bytes, > but in practice we might want to transfer floats/doubles/ints etc. > While this *might* be implemented on top of the current msg API, > this would be a waste if the low-level protocol implementation > (e.g. MPI) already would support such types (including byte ordering > conversion). As such, it were useful to have the option to use > such mechanisms from a low-level protocol if supported. If not, > then it would need to be taken care of on top of the current level. Ah, good point - that at least needs clarification in the spec! Yes, you are right: the focus on opaque messages is a limitation for many use cases. OTOH, support for primitive types such such as ints or floats don't by you that much, and for more complex structures... - well, who knows better than you that agreeing on a data model is a reeaaally difficult job? ;-) So, basically the message API tries to avoid that topic for the main reason that it seems difficult to define. I would wholeheartly support any activity which tries to define domain or use case specific flavours of the API. That would be a simple excercise: you would only need to redefine the set_data method on the msg class accordingly. So, the question is: is a very limited support for primitive data types something (really) useful? > * Interfacing Event Loops: > > If we want to use this API from within a larger application instead > of just self-standing programs, we might want to use mechanisms such > as socket callbacks for event handling (eg. the QSocketNotifier or > under X11 using XtAppAddInput). Would be good to have some support > to allow this, even though it might be optional. Right, thats an important point, in particular for the visualization use cases. Its actually in the spec, but well hidden :-) The endpoint class definition says: class endpoint : implements saga::object implements saga::async implements saga::monitoring [...] saga::async is actually an empty interface, but what that means is that the class will contain several versions of every class method: a synchronous one, and 3 additional ones. In C++ the rendering would look like: // connection setup saga::endpoint ep; ep.serve (); // normal, synchronous version saga::msg m = ep.recv (); // task version 1: synchronous saga::task t1 = ep.recv <saga::task::Sync> (msg); // task version 2: asynchronous saga::task t2 = ep.recv <saga::task::ASync> (msg); // task version 3: task saga::task t3 = ep.recv <saga::task::Task> (msg); These three versions of the recv method all return a task, which only differs in its state: t1 is Done, t2 is Running, and t3 is New (not yet running). You can get notification on when a task is Done etc. Additionally, the spec defines some metrics on the endpoint, among them: // Metrics: // name: Message // desc: fires if a message arrives // mode: Read // unit: 1 // type: String // value: "" // notes: - the value is the endpoint URL of the // sending party, if known. These metrics are used by the monitoring interface, which is also implemented by the endpoint. With that, you can add callbacks to an endpoint which gets called when a new msg arrives: saga::endpoint ep; ep.add_callback ("Message", my_cb); ep.serve (); my_cb is a user defined class which implements saga::callback, and whose cb() method gets then called on incoming messages. Sorry if that was somewhat lengthy. Anyway, point is: async ops and notification are covered, by means of the SAGA Core Look&Feel, which is inherited by this API. Cheers, Andre. > > Werner > > > On Thu, 18 Jan 2007 14:18:52 -0600, Andre Merzky <andre(a)merzky.net> wrote: > > > Hi John, Andrei, > > > > you are right: getting some feedback from the transport > > level folx is certainly a good idea. The API draft won't go > > into public comment for another month or so (at least), and > > then it will stay in public comment for another 2 months or > > longer - that should give us enough time to contact them. > > > > About ordering: the text Andrei cited is in the spec because > > ordering is, as of now, not an attribute of the connection > > or endpoint - so the spec tries to nail it down. It says > > "MUST be ordered, but no global ordering is required" > > because I thought that this covers the majority of use > > cases. > > > > I don't think there are use cases which require global > > ordering - or at least not enough to justify a requirement > > for global ordering. What is your opinion? Also, thats > > really difficult to implement in Grids IMHO. > > > > Use cases which do not require ordering should be happy with > > order preserving connections, too. Question now is: does > > the benefit of un-ordered implementations (simplier, smaller > > footprint) justify an attribute on API level? Or are there > > use cases which require non-ordered delivery for other > > reasons? > > > > Cheers, Andre. > > > > > > Quoting [Andrei Hutanu] (Jan 18 2007): > >> > >> Hi, > >> > >> >> > >> >>2) I see ordering is enforced, could that be an option? > >> > > >> > > >> >I think ordering is *not* enforced, but I do wonder if it should be > >> >an option or a channel property (certainly semireliable will likely > >> >result in some reording whereas a TCP channel would enforce ordering > >> >of the messages for instance). > >> > > >> >This is a controversial topic in the HPC message passing community > >> >(whether msg. ordering is a good or bad-thing to enforce in at the > >> >hardware level). > >> > > >> I was thinking the same (no strong feelings for either option or > >> property) but the text tells otherwise : > >> In 2.1 introduction : > >> In contrast, this message API extension guarantees that message blocks > >> of arbitrary size are delivered in order, and intact, without the need > >> for additional application level coordination or synchronization. > >> and > >> > >> then in 2.1.7 reliability corectness and ordering > >> The order of sent messages MUST be preserved by the implementation. > >> Global ordering is, however, not guaranteed to be preserved: > >> > >> Assume three endpoints A, B and C, all connected to each other. If A > >> sends two messages [a1, a2], in this order, it is guaranteed that both B > >> and C receive the messages in this order [a1, a2]. If, however, A sends > >> a message [a1] and then B sends a message [b1], C may receive the > >> messages in either order, [a1, b1] or [b1, a1]. > >> > >> Andrei -- "So much time, so little to do..." -- Garfield

1 0

Fwd (replogle@ogf.org): [gf-chairs] OGF19 Schedule is up!
by Andre Merzky 07 Jan '07

07 Jan '07

Hi, our 5 SAGA sessions have been scheduled. The slots are: Monday, January 29 9:00 am - 10:30 am SAGA Security Discussion (Bellflower) Monday, January 29 2:00 pm - 3:30 pm SAGA Core API - public comments (Sunflower) Monday, January 29 4:00 pm - 5:30 pm SAGA Core API - Language Bindings (Sunflower) Tuesday, January 30 9:00 am - 10:30 am SAGA API Extensions: Messaging, Information Management (Azalea) Tuesday, January 30 11:00 am - 12:30 pm SAGA API Extensions: CPR, resource discovery, task dependencies (Azalea) There are a number of potential conflicts, with the GIN session on Monday, and two astro-RG sessions on Monday, and possibly with the GridSphere tutorial on Tuesday (which has the same target group IMHO). Anyway, I think its difficult to obtain slots with less collisions, and we should be able to live with those present. Any thoughts? The first session (security discussion) intents to catch up with the recent developments in the security area, and to start some synchronization of the SAGA API with known Grid security paradigms. Please allow me to point you to the "Workshop "Visualization on the Grid" on Thursday, 2:00 pm - 3:30 pm and 4:00 pm - 5:30 pm in the Windflower room. That workshop was initiated by members of the SAGA group(s), and tries to address (amongst others) API and programming issues for programmers of Grid based visualization systems. Cheers, Andre. ----- Forwarded message from Joel Replogle <replogle(a)ogf.org> ----- > To: gf-chairs(a)ggf.org > From: Joel Replogle <replogle(a)ogf.org> > Subject: [gf-chairs] OGF19 Schedule is up! > > OGF Chairs - The schedule for the OGF19 in Chapel Hill, North > Carolina USA is now up on the web-site. We're looking forward to a > very productive OGF meeting! > > Please see: > http://www.ogf.org/OGF19/schedule > > While we've worked hard to minimize schedule conflicts, it's very > possible that some remain. Please email me with conflicts that you > believe will negatively impact your group's sessions. > > Thanks, > > Joel > > ------------------------------------------------------------------------ > ----------- > Joel Replogle - Manager of Standards, Open Grid Forum > replogle(a)ogf.org http://www.ogf.org -- "So much time, so little to do..." -- Garfield

1 0

Simple Job Management
by Pascal Kleijer 07 Jan '07

07 Jan '07

Dear SAGA members, Based on the API document currently in public comment phase, we have implemented a very simple version of the Job Management API. Basically we stripped the underlying SAGA model and when directly to Job Management API. See the attachment UML graph for details. The reason was that we did not have enough time to implement the full SAGA core to support the API and we focus on the NAREGI Super Scheduler (SS). Consideration and simplifications: - We do not support the suspended state at this time (see below for details). - The wait method is reserved in java for signal synchronization and is not used in the same way; we did not implement a real wait method since we are not interested in synchronization of jobs at the moment. This might be equivalent of the Thread.join() method. - Metrics handling has not been added. This might come with future incarnation of the package. - Job_self is not supported, also we could pretend it is the same as job in java. - We do not include all the methods of the job_service so far in the factory, might come in later incarnations and rename the factory in service. - Checkpoint and migrate are not supported for now. In two models it has no sense and the SS seems not to support it yet. - Signal is not supported, internally some implementations have it. But again the SS does not. - Many attributes are not supported at the moment. - Session and security model ignored (the SS has his own model) others don't care. - A job description can take strings, collections and a string arrays as arguments. Other formats are allowed if the caller knows how to manipulate them. Properties that are known to use string arrays have direct assessors to facilitate their access. All properties can be stored as a single string to allow serialization. Since we are working in Java we decided to go pure pattern oriented. So an application has access to the factory, job pattern and a Job description. The concrete Job stubs implementations are not supposed to be exposed (but are accessible since the class is public). Now the design is made so that we can later on hock the SAGA core classes below the current API without breaking (to much) the code (assuming we hold on our design pattern approach). We have three concrete implementations of Jobs: Local, SSH and Super Scheduler. - The local job uses the java process object and handles a job on the same machine as the JVM. This job type does not support suspended mode at all. This is a fully synchronous job since all actions are taken on the spot, unless you submit the job on a queuing system. - The SSH is a remote job incarnation, the job can run on any machine that has the SSH daemon running, this can be a synchronous job, unless you submit the job on a queuing system. This job type does not support suspended mode for the moment and only POSIX systems can be used to launch the job. - The Super Scheduler is NAREGI specific and uses NAREGI’s middleware. This is an asynchronous job. Suspended mode cannot be directly handled even if the state exists in the SS, so this is still pending. This job produces internally WSDL documents; the necessary methods are private however. General comments and questions. Might be some meat for the public comments as well: Now we stumbled upon the state machine of the API. The "Unknown" and "New" state are unclear to us. In our opinion when you create a job either with the factory or directly with the constructor of the specific incarnation, we enter the "New" state. The "Unknown" state is now reserved for the very short time the object is instantiated but we directly switch to "New" once the constructor is finished. The principle in OO programming is to have a stable object once you finish constructing it and calling method; if the constructor is not enough to have a stable object you need a factory. So when you get an object back it should be in a stable state, thus the "Unknown" state is superficial in our opinion. Some metrics or attributes or the Job are useless since they come directly from the descriptor, Example: "ExecutionHosts", "WorkingDirectory" or "CPUTimeLimit". Unless you consider that these values might be different from the job description. Or if the job description don't mention them the job can have this values assigned by the back-end. Either case the API documentation should clarify this. The run_job from the service will not follow the API contract if implemented. Only one parameter can be returned in java. Also the streams are available thought the Job pattern. In the document section 3.8.8 Examples the example at line 16 and 17 is wrong (or the method is overwritten). There should be no string argument. The host should be set in the descriptor. -- Best regards, Pascal Kleijer ---------------------------------------------------------------- HPC Marketing Promotion Division, NEC Corporation 1-10, Nisshin-cho, Fuchu, Tokyo, 183-8501, Japan. Tel: +81-(0)42/333.6389 Fax: +81-(0)42/333.6382

2 3