Skip to content

Commit 8d93486

Browse files
v0.1.6 (#54)
* Minor fix (#46) * leftover minor * minor * fix dockerfile for x86_64 to include hopper * fix benchmark script * minor * Models/gemma 3 (#47) * wip: revamp model registration * fix gemma3 for causal LM * gemma3 * update dockerfile * Bump transformers from 4.46.3 to 4.48.0 in /meta (#49) Bumps [transformers](https://github.com/huggingface/transformers) from 4.46.3 to 4.48.0. - [Release notes](https://github.com/huggingface/transformers/releases) - [Commits](huggingface/transformers@v4.46.3...v4.48.0) --- updated-dependencies: - dependency-name: transformers dependency-version: 4.48.0 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Xiaozhe Yao <[email protected]> * Support #44 * redo docs (#52) * WIP: Offloading and other utilities (#51) * wip: offload support * debug * fix sampling * fix aarch64 dockerfile * update aarch script * remove sgl-kernel from dependencies * update dockerfile * minor reorg * remove templated message printout * fix dockerfile in aarch build * relax triton req * move `triton` dep to requirements-cuda * minor fix * fix build issues * test dependencies * move torch-memory-saver * minor * fix build issues & add initial metrics ui * ready to build * wip: multistage build * Models/qwen3 (#53) * init: qwen3 * qwen3 * minor * minor refactor: health check on server starts * minor update * update buildfile * minor * minor bug fix * logger * update dev --------- Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
1 parent 8e610b7 commit 8d93486

File tree

132 files changed

+18752
-3104
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

132 files changed

+18752
-3104
lines changed

.dockerignore

Lines changed: 20 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2,3 +2,23 @@ docs/
22
scripts
33
*.egg-info
44
.local
5+
.github/
6+
.vscode/
7+
__pycache__/
8+
*.pyc
9+
*.pyo
10+
*.pyd
11+
*.db
12+
*.sqlite
13+
*.log
14+
*.egg
15+
*.egg-info
16+
*.whl
17+
*.tar.gz
18+
*.zip
19+
*.tar
20+
*.gz
21+
*.bz2
22+
.buildcache
23+
.pytest_cache
24+
.git

.gitignore

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -8,3 +8,4 @@ pyrightconfig.json
88
.data
99
*.ipynb
1010
.coverage*
11+
.buildcache/

README.md

Lines changed: 6 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,8 @@
11
# Scratchpad
22

3-
This is an experimental LLM serving system, forked and built on top of [SGLang SRT](https://github.com/sgl-project/sglang/tree/main/python/sglang/srt), and is used to support [SwissAI Model Serving](https://fmapi.swissai.cscs.ch/).
3+
This is an experimental LLM serving system, forked and built on top of [SGLang SRT](https://github.com/sgl-project/sglang/tree/main/python/sglang/srt), and is used to support [SwissAI Model Serving](https://serving.swissai.cscs.ch/).
4+
5+
## Adoption
6+
7+
- [SwissAI](https://serving.swissai.cscs.ch/) - as the primary serving engine for LLMs.
8+
- [HexGen-Flow](https://github.com/Relaxed-System-Lab/Hexgen-Flow) - as a LLM execution simulator for Text-to-SQL application.

docker/Dockerfile.aarch64-cuda

Lines changed: 0 additions & 39 deletions
This file was deleted.

docker/build_image.sh

Lines changed: 0 additions & 17 deletions
This file was deleted.

docs/.gitignore

Lines changed: 21 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1,21 @@
1-
build/
1+
# build output
2+
dist/
3+
# generated types
4+
.astro/
5+
6+
# dependencies
7+
node_modules/
8+
9+
# logs
10+
npm-debug.log*
11+
yarn-debug.log*
12+
yarn-error.log*
13+
pnpm-debug.log*
14+
15+
16+
# environment variables
17+
.env
18+
.env.production
19+
20+
# macOS-specific files
21+
.DS_Store

docs/README.md

Lines changed: 54 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,54 @@
1+
# Starlight Starter Kit: Basics
2+
3+
[![Built with Starlight](https://astro.badg.es/v2/built-with-starlight/tiny.svg)](https://starlight.astro.build)
4+
5+
```
6+
npm create astro@latest -- --template starlight
7+
```
8+
9+
[![Open in StackBlitz](https://developer.stackblitz.com/img/open_in_stackblitz.svg)](https://stackblitz.com/github/withastro/starlight/tree/main/examples/basics)
10+
[![Open with CodeSandbox](https://assets.codesandbox.io/github/button-edit-lime.svg)](https://codesandbox.io/p/sandbox/github/withastro/starlight/tree/main/examples/basics)
11+
[![Deploy to Netlify](https://www.netlify.com/img/deploy/button.svg)](https://app.netlify.com/start/deploy?repository=https://github.com/withastro/starlight&create_from_path=examples/basics)
12+
[![Deploy with Vercel](https://vercel.com/button)](https://vercel.com/new/clone?repository-url=https%3A%2F%2Fgithub.com%2Fwithastro%2Fstarlight%2Ftree%2Fmain%2Fexamples%2Fbasics&project-name=my-starlight-docs&repository-name=my-starlight-docs)
13+
14+
> 🧑‍🚀 **Seasoned astronaut?** Delete this file. Have fun!
15+
16+
## 🚀 Project Structure
17+
18+
Inside of your Astro + Starlight project, you'll see the following folders and files:
19+
20+
```
21+
.
22+
├── public/
23+
├── src/
24+
│ ├── assets/
25+
│ ├── content/
26+
│ │ ├── docs/
27+
│ └── content.config.ts
28+
├── astro.config.mjs
29+
├── package.json
30+
└── tsconfig.json
31+
```
32+
33+
Starlight looks for `.md` or `.mdx` files in the `src/content/docs/` directory. Each file is exposed as a route based on its file name.
34+
35+
Images can be added to `src/assets/` and embedded in Markdown with a relative link.
36+
37+
Static assets, like favicons, can be placed in the `public/` directory.
38+
39+
## 🧞 Commands
40+
41+
All commands are run from the root of the project, from a terminal:
42+
43+
| Command | Action |
44+
| :------------------------ | :----------------------------------------------- |
45+
| `npm install` | Installs dependencies |
46+
| `npm run dev` | Starts local dev server at `localhost:4321` |
47+
| `npm run build` | Build your production site to `./dist/` |
48+
| `npm run preview` | Preview your build locally, before deploying |
49+
| `npm run astro ...` | Run CLI commands like `astro add`, `astro check` |
50+
| `npm run astro -- --help` | Get help using the Astro CLI |
51+
52+
## 👀 Want to learn more?
53+
54+
Check out [Starlight’s docs](https://starlight.astro.build/), read [the Astro documentation](https://docs.astro.build), or jump into the [Astro Discord server](https://astro.build/chat).

docs/astro.config.mjs

Lines changed: 28 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,28 @@
1+
// @ts-check
2+
import { defineConfig } from 'astro/config';
3+
import starlight from '@astrojs/starlight';
4+
import starlightThemeFlexoki from 'starlight-theme-flexoki'
5+
6+
// https://astro.build/config
7+
export default defineConfig({
8+
integrations: [
9+
starlight({
10+
title: 'Scratchpad',
11+
plugins: [starlightThemeFlexoki()],
12+
social: [{ icon: 'github', label: 'GitHub', href: 'https://github.com/eth-easl/Scratchpad' }],
13+
sidebar: [
14+
{
15+
label: 'Guides',
16+
items: [
17+
// Each item here is one entry in the navigation menu.
18+
{ label: 'Example Guide', slug: 'guides/example' },
19+
],
20+
},
21+
{
22+
label: 'Reference',
23+
autogenerate: { directory: 'reference' },
24+
},
25+
],
26+
}),
27+
],
28+
});

docs/examples/batch.py

Lines changed: 0 additions & 19 deletions
This file was deleted.

docs/examples/mllama_request.py

Lines changed: 0 additions & 52 deletions
This file was deleted.

0 commit comments

Comments
 (0)