proposal: daemon process #769

kemingy · 2022-08-12T09:51:12Z

Signed-off-by: Keming [email protected]

cc @aseaday @terrytangyuan

Signed-off-by: Keming <[email protected]>

kemingy · 2022-08-12T09:52:30Z

This is related to the following:

aseaday

Where the stdout log will be ?

kemingy · 2022-08-12T09:59:02Z

Where the stdout log will be ?

Some ideas:

manually redirect to files
use journalctl if the process is controlled by systemd
auto redirect to a file and provide access like envd log --name <container_name>

aseaday · 2022-08-12T10:01:11Z

Where the stdout log will be ?

Some ideas:

manually redirect to files

use journalctl if the process is controlled by systemd

auto redirect to a file and provide access like envd log --name <container_name>

manually redirect to files which we record it in documents LGTM

gaocegege · 2022-08-12T13:03:22Z

Do we need to store the logs in files? I think we can just print in STDOUT. and show them in envd logs

VoVAllen · 2022-08-12T13:10:50Z

@gaocegege Since there're multiple processes at the same time, it's hard to put everything in the same stdout

gaocegege · 2022-08-12T13:11:12Z

docs/proposals/20220812-daemon-service.md

+## API
+
+```python
+runtime.daemon(commands=[


Then how to support services like tensorboard with the help of this feature.

Need to use multiple features.

run a daemon tensorboard process (this proposal)

expose the port to host

specify the log dir mount

gaocegege · 2022-08-12T13:12:21Z

@gaocegege Since there're multiple processes at the same time, it's hard to put everything in the same stdout

Currently, the stdout looks like:

time="2022-08-11T09:41:16Z" level=info msg="zsh exists at /usr/bin/zsh"
time="2022-08-11T09:41:16Z" level=info msg="ssh server v0.2.0-alpha.13+1fec011 started in 0.0.0.0:2222"
[I 09:41:17.144 NotebookApp] Writing notebook server cookie secret to /home/envd/.local/share/jupyter/runtime/notebook_cookie_secret
[I 09:41:17.280 NotebookApp] Serving notebooks from local directory: /home/envd/mnist
[I 09:41:17.280 NotebookApp] Jupyter Notebook 6.4.12 is running at:
[I 09:41:17.280 NotebookApp] http://96956beaaa6c:8888/
[I 09:41:17.280 NotebookApp] Use Control-C to stop this server and shut down all kernels (twice to skip confirmation).
[W 09:41:17.282 NotebookApp] No web browser found: could not locate runnable browser.
time="2022-08-11T09:41:17Z" level=info msg="starting ssh session with command 'zsh'" session.id=179e909a-21b0-4031-b901-6b8e81cc8036
time="2022-08-11T09:41:17Z" level=info msg="agent requested" session.id=179e909a-21b0-4031-b901-6b8e81cc8036
time="2022-08-11T09:41:17Z" level=info msg="handling PTY session" session.id=179e909a-21b0-4031-b901-6b8e81cc8036
[I 09:41:34.769 NotebookApp] 302 GET / (172.17.0.1) 0.740000ms
[I 09:41:34.773 NotebookApp] 302 GET /tree? (172.17.0.1) 0.580000ms
[W 09:41:37.119 NotebookApp] 401 POST /login?next=%2Ftree%3F (172.17.0.1) 2.280000ms referer=http://localhost:38571/login?next=%2Ftree%3F
[W 09:41:39.143 NotebookApp] 401 POST /login?next=%2Ftree%3F (172.17.0.1) 1.890000ms referer=http://localhost:38571/login?next=%2Ftree%3F

VoVAllen · 2022-08-12T13:16:23Z

SSHD is also a deamon service. One stdout makes it hard to track the logs. Let's say if user launch an envd container for training job with tensorboard launched also, envd logs should only show the stdout of the python train.py instead of combining them together. Also store them separately can help debug when there's a problem.

terrytangyuan · 2022-08-12T14:59:14Z

docs/proposals/20220812-daemon-service.md

+
+## Goals
+
+* able to run multiple daemon processes controlled by `tini`


What is the implementation plan? Any architectural considerations that we should discuss here?

plan: can add the commands to tini

envd/pkg/lang/ir/compile.go

Lines 166 to 176 in b9f0af8

ep := []string{

"tini",

"--",

"bash",

"-c",

}

template := `set -e

/var/envd/bin/envd-ssh --authorized-keys %s --port %d --shell %s &

%s

wait -n`

I'd like to discuss if this is a general approach (need to work with other features like mount and expose) to solving the issues like feat(lang): Support TensorBoard #527 . Or if we should do it in another way?

aseaday · 2022-08-12T17:40:21Z

Do we need to store the logs in files? I think we can just print in STDOUT. and show them in envd logs

the stdin/stdout/stderr still returns the problems what file descriptor the super process pass to its daemon subprocess as fd 0,1,2. We could force them gather into a file or split them out.

kemingy · 2022-08-15T08:53:19Z

Here is an example to demonstrate how to use it for jupyter-lab:

def jupyter_lab():
    expose(local_port=8888, host_port=8888, svc="jupyter")
    runtime.daemon(commands=["jupyter-lab"])


def build():
    base(os="ubuntu20.04", language="python")
    install.pip_packages(["numpy", "jupyterlab"])
    jupyter_lab()

cc @Xiaoaier-Z-L

gaocegege · 2022-08-15T09:10:46Z

Do we need to store the logs in files? I think we can just print in STDOUT. and show them in envd logs

the stdin/stdout/stderr still returns the problems what file descriptor the super process pass to its daemon subprocess as fd 0,1,2. We could force them gather into a file or split them out.

SGTM

gaocegege

LGTM.

/cc @terrytangyuan @VoVAllen @aseaday @Xiaoaier-Z-L

aseaday

LGTM

VoVAllen

LGTM. Is it better to put expose under runtime namespace also?

kemingy · 2022-08-15T10:12:28Z

LGTM. Is it better to put expose under runtime namespace also?

Agree. BTW, expose is not implemented yet.

gaocegege · 2022-08-15T10:28:58Z

I am merging this to move forward. But feel free to comment if there is any problem.

proposal: daemon process

1867514

Signed-off-by: Keming <[email protected]>

aseaday reviewed Aug 12, 2022

View reviewed changes

kemingy added type/enhancement 💭 type/feature 💡 type/discussion 🧵 labels Aug 12, 2022

gaocegege reviewed Aug 12, 2022

View reviewed changes

terrytangyuan reviewed Aug 12, 2022

View reviewed changes

kemingy mentioned this pull request Aug 15, 2022

could we have any plan to support jupyter lab? #776

Closed

gaocegege approved these changes Aug 15, 2022

View reviewed changes

aseaday approved these changes Aug 15, 2022

View reviewed changes

VoVAllen approved these changes Aug 15, 2022

View reviewed changes

gaocegege merged commit 2f82fa5 into tensorchord:main Aug 15, 2022

kemingy mentioned this pull request Aug 15, 2022

feat(lang): add daemon function to run daemon process in the container #777

Merged

kemingy deleted the proposal_daemon branch August 16, 2022 14:43

kemingy mentioned this pull request Aug 16, 2022

feat(lang): implement expose func #780

Merged


		## Goals

		* able to run multiple daemon processes controlled by `tini`

	ep := []string{
	"tini",
	"--",
	"bash",
	"-c",
	}

	template := `set -e
	/var/envd/bin/envd-ssh --authorized-keys %s --port %d --shell %s &
	%s
	wait -n`

proposal: daemon process #769

proposal: daemon process #769

Uh oh!

Conversation

kemingy commented Aug 12, 2022

Uh oh!

kemingy commented Aug 12, 2022

Uh oh!

aseaday left a comment

Choose a reason for hiding this comment

Uh oh!

kemingy commented Aug 12, 2022

Uh oh!

aseaday commented Aug 12, 2022

Uh oh!

gaocegege commented Aug 12, 2022

Uh oh!

VoVAllen commented Aug 12, 2022

Uh oh!

gaocegege Aug 12, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kemingy Aug 12, 2022

Choose a reason for hiding this comment

Uh oh!

gaocegege commented Aug 12, 2022

Uh oh!

VoVAllen commented Aug 12, 2022

Uh oh!

terrytangyuan Aug 12, 2022

Choose a reason for hiding this comment

Uh oh!

kemingy Aug 12, 2022

Choose a reason for hiding this comment

Uh oh!

aseaday commented Aug 12, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kemingy commented Aug 15, 2022

Uh oh!

gaocegege commented Aug 15, 2022

Uh oh!

gaocegege left a comment

Choose a reason for hiding this comment

Uh oh!

aseaday left a comment

Choose a reason for hiding this comment

Uh oh!

VoVAllen left a comment

Choose a reason for hiding this comment

Uh oh!

kemingy commented Aug 15, 2022

Uh oh!

gaocegege commented Aug 15, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

gaocegege Aug 12, 2022 •

edited

Loading

aseaday commented Aug 12, 2022 •

edited

Loading