Skip to content

No Speedup across bsz: 1 --> 48 #7

@ZepinLi

Description

@ZepinLi

hi team,
Thanks for great contribution to opensource community!
I ran under the instructions of Readme. However, I did not see real speedup across all batch size settings from 1 to 48, while degradation on throughput happened.
Do I have anything wrong?

Results

*hardware: A100 8x40G

Image

Command Lines

for B in 1 16 32 48
do
echo "Running with B = $B" >> $LOG_FILE
ENABLE_INTRA_NODE_COMM=1 torchrun --standalone --nproc_per_node=8 tests/SnapKV/selfspec_benchmark.py
--model $MODEL_PATH/model.pth
--model_name $MODEL_HF
--rank_group 0 1 2 3 4 5 6 7
--gamma 2
--B $B
--prefix_len 32032
--max_len 32288
--draft_budget 257
--benchmark
--compile >> $LOG_FILE 2>&1
done

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions