Difference between revisions of "Containers/Mini-summit 2008"
|  (→Agenda) | |||
| Line 80: | Line 80: | ||
| ** libcg - userspace explotation of control                           groups/resource management | ** libcg - userspace explotation of control                           groups/resource management | ||
| ** Resource management | ** Resource management | ||
| + | |||
| * Checkpoint/Restart  (2:30pm-5pm) | * Checkpoint/Restart  (2:30pm-5pm) | ||
| − | **  | + | ** Documentation | 
| − | **  | + | *** Zap : http://www.ncl.cs.columbia.edu/publications/usenix2007_fordist.pdf | 
| − | **  | + | *** Metacluster : http://lxc.sourceforge.net/doc/ols2006/lxc-ols2006.pdf | 
| − | **  | + | *** OpenVZ : http://wiki.openvz.org/Checkpointing_and_live_migration | 
| − | **  | + | *** Checkpoint/Restart technology : http://en.wikipedia.org/wiki/Application_checkpointing | 
| − | **  | + | *** Virtual Servers and Checkpoint/Restart in Mainstream Linux : Sigops document | 
| − | ** Hopefully we can  | + | *** Remote fork: http://www.cse.nd.edu/~dthain/courses/classconf/wowsys2004/talks/rfork.pd | 
| + | *** Vmadump : http://bproc.sourceforge.net/c268.html | ||
| + | *** Posix CR : http://techpubs.sgi.com/library/tpl/cgi-bin/getdoc.cgi/0650/bks/SGI_Admin/CPR_OG/sgi_html/ch03.html | ||
| + | *** An OS services overview : http://sw-eng.falls-church.va.us/itsg/P08V31.htm | ||
| + | ** Goals and expectations of this summit | ||
| + | *** identify, discuss and (if possible) agree on the general design | ||
| + | *** identify, discuss and (if possible) agree on the technical points | ||
| + | *** decide on priorities for different components (eg. high, medium, low)  such that the final outcome is a practical road-map that would keep us busy for (at least) until the next OLS (though the "O" may change ;) | ||
| + | ** What are the problems that the linux community can solve with the checkpoint/restart ? | ||
| + | ** Preparing the kernel internals | ||
| + | *** How we implement it without affecting long term maintainability ? | ||
| + | *** What are the kernel subsystems, process resources and framework for CR ? | ||
| + | *** Which pieces to target first ? | ||
| + | |||
| + | The following technical points can be discussed during the mini-summit if we have time or later at the OLS. | ||
| + | |||
| + | ** Checkpointing / Restarting | ||
| + | *** Reaching a quescient point - network, processes, aio, avoiding side effects of quiesce/revive | ||
| + | *** Checkpoint - signal handler ? syscall ? crfs ? process hierarchy, resource dependencies, system and process resources | ||
| + | *** Restarting - New binary format handler ? converting between formats (from older kernel to newer) | ||
| + | *** Notification to processes which explicitly wish to be notified about quiesce, checkpoint and restart - container state ? new signals ? | ||
| + | ** Determining the userspace API - Posix 1003.1m ? | ||
| + | ** Passing the kernel internal state to/from userspace - coredump like file ? swap per container ? netlinks, CR filesystem ? army of different call for the CR (proc, existing syscalls, ...) | ||
| + | ** Hopefully we can continue to discuss in the next days and get a bit of a hackfest going during OLS :) | ||
| == Moderators == | == Moderators == | ||
Revision as of 09:00, 18 July 2008
There will be a containers mini-summit at the OLS'08. This page is for organizing this mini-summit. Feel free to edit.
When: 22nd of July 2008
Where: Ottawa, ON, Canada.
Proposal
The mini-summit proposal sent to OLS organizers. See proposal.
Topics to discuss
- Device accessibility cgroup (maybe with remap ability)
- TTYs
- Syslog
- Checkpoint/restart
- Memory controllers
- more?..
List of attendees
Please fill in your name here if you are going to attend, or email kir at openvz dot org if you are too lazy. Surely the list is not final, so put your name even if you are not sure you can make it.
This list is in no particular order.
- Pavel Emelyanov
- Denis Lunev
- Andrey Mirkin
- Serge Hallyn
- Dave Hansen
- Daniel Lezcano
- Srivatsa Vaddagiri
- Balbir Singh
- Sukadev Bhattiprolu
- Paul Menage
- Eric W. Biederman
- Oren Laadan
- Yamamoto Takashi
- Kamezawa Hiroyuki
- Benjamin Thery
- Herbert Pötzl
- Oleg Nesterov
- Dhaval Giani
- Bart Trojanowski
- Joseph Ruscio
- Constant Chan
- Linda Knippers
- Satoshi Uchida
- Masahiko Takahashi
- Martine Silbermann
- Benoit des Ligneris
- Patrick Naubert
- Daisuke Nishimura
- Sudhir Kumar
- Munehiro Ikeda
- Kamalesh Babulal
- John Schulz
- Poornima Nayak
Agenda
- Namespaces/Containers  (8:30am-11am)
- sysfs issues (and any /proc issues)
- uevents/hotplug
 
- Network namespaces issues
- multiple namespaces in one process
 
- Device namespace design?
- User namespace
- Additional needed namespaces
- Small namespaces What to do with small subsystem that might need virtualization. E.g. in openvz we have FUSE, binfmt_misc and some other small stuff virtualized. But how to merge it in mainline? Create a separate namespace for each? Mere them into one? How to call this then?
 
- Handling filesystem/namespace synchronization (not sure what the issue is)
- Container design
- How to enter a container
- Nature of a 'container' — kernel object or userspace fiction
 
 
- sysfs issues (and any /proc issues)
- Cgroups+Resource management  (11:30-2pm)
- Cgroup implementation
- Additional cgroups and their design
- libcg - userspace explotation of control groups/resource management
- Resource management
 
- Checkpoint/Restart  (2:30pm-5pm)
- Documentation
- Zap : http://www.ncl.cs.columbia.edu/publications/usenix2007_fordist.pdf
- Metacluster : http://lxc.sourceforge.net/doc/ols2006/lxc-ols2006.pdf
- OpenVZ : http://wiki.openvz.org/Checkpointing_and_live_migration
- Checkpoint/Restart technology : http://en.wikipedia.org/wiki/Application_checkpointing
- Virtual Servers and Checkpoint/Restart in Mainstream Linux : Sigops document
- Remote fork: http://www.cse.nd.edu/~dthain/courses/classconf/wowsys2004/talks/rfork.pd
- Vmadump : http://bproc.sourceforge.net/c268.html
- Posix CR : http://techpubs.sgi.com/library/tpl/cgi-bin/getdoc.cgi/0650/bks/SGI_Admin/CPR_OG/sgi_html/ch03.html
- An OS services overview : http://sw-eng.falls-church.va.us/itsg/P08V31.htm
 
- Goals and expectations of this summit
- identify, discuss and (if possible) agree on the general design
- identify, discuss and (if possible) agree on the technical points
- decide on priorities for different components (eg. high, medium, low) such that the final outcome is a practical road-map that would keep us busy for (at least) until the next OLS (though the "O" may change ;)
 
- What are the problems that the linux community can solve with the checkpoint/restart ?
- Preparing the kernel internals
- How we implement it without affecting long term maintainability ?
- What are the kernel subsystems, process resources and framework for CR ?
- Which pieces to target first ?
 
 
- Documentation
The following technical points can be discussed during the mini-summit if we have time or later at the OLS.
- Checkpointing / Restarting
- Reaching a quescient point - network, processes, aio, avoiding side effects of quiesce/revive
- Checkpoint - signal handler ? syscall ? crfs ? process hierarchy, resource dependencies, system and process resources
- Restarting - New binary format handler ? converting between formats (from older kernel to newer)
- Notification to processes which explicitly wish to be notified about quiesce, checkpoint and restart - container state ? new signals ?
 
- Determining the userspace API - Posix 1003.1m ?
- Passing the kernel internal state to/from userspace - coredump like file ? swap per container ? netlinks, CR filesystem ? army of different call for the CR (proc, existing syscalls, ...)
- Hopefully we can continue to discuss in the next days and get a bit of a hackfest going during OLS :)
 
- Checkpointing / Restarting
Moderators
- Namespaces/containers: Serge Hallyn, Dave Hansen
- Cgroups and resource management: Paul Menage, Balbir Singh, Dhaval Giani
- Checkpoint/restart: Daniel Lezcano, Oren Laadan
