Using multiples CPUs

Is there any reason why both these lines:

https://github.com/ray-project/llmperf/blob/f1d6bed47e4501b0e371082b41601b59ab55269f/token_benchmark_ray.py#L94
https://github.com/ray-project/llmperf/blob/f1d6bed47e4501b0e371082b41601b59ab55269f/token_benchmark_ray.py#L148

have fixed `num_clients=1`. Couldn't them be changed to:

```python
import multiprocessing
num_cores = multiprocessing.cpu_count()  # Get total available CPU cores
clients = construct_clients(llm_api=llm_api, num_clients=num_cores) 
```

As I've seen in the docs, the parallelism in ray can be done 2 ways:

```python
clients = [OpenAIChatCompletionsClient.remote() for _ in range(8)]  # multiple actors 

# OR

@ray.remote(num_cpus=2)  # Each actor uses 2 CPUs
class OpenAIChatCompletionsClient(LLMClient):
    pass
````	

And in this case I would prefer the first way... Am I missing anything or the way the code is written does not use all the available CPUs??

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Using multiples CPUs #86

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Using multiples CPUs #86

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions