File tree Expand file tree Collapse file tree 1 file changed +3
-4
lines changed Expand file tree Collapse file tree 1 file changed +3
-4
lines changed Original file line number Diff line number Diff line change @@ -24,9 +24,8 @@ class BlockSpaceManagerV2(BlockSpaceManager):
24
24
autoregressively-generated tokens, and other advanced features such as
25
25
prefix caching, forking/copy-on-write, and sliding-window memory allocation.
26
26
27
- The current implementation is partial; in particular prefix caching and
28
- sliding-window are not feature complete. This class implements the design
29
- described in https://github.com/vllm-project/vllm/pull/3492.
27
+ This class implements the design described in
28
+ https://github.com/vllm-project/vllm/pull/3492.
30
29
31
30
Lookahead slots
32
31
The block manager has the notion of a "lookahead slot". These are slots
@@ -190,7 +189,7 @@ def allocate(self, seq_group: SequenceGroup) -> None:
190
189
191
190
assert (request_id
192
191
not in self .cross_block_tables ), \
193
- "block table already exists"
192
+ "block table already exists"
194
193
195
194
check_no_caching_or_swa_for_blockmgr_encdec (self , seq_group )
196
195
You can’t perform that action at this time.
0 commit comments