NVIDIA
diff --git a/‎benchmarks/LICENSE‎
Lines changed: 1 addition & 1 deletion b/‎benchmarks/LICENSE‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎benchmarks/README.md‎
Lines changed: 12 additions & 12 deletions b/‎benchmarks/README.md‎
Lines changed: 12 additions & 12 deletions
diff --git a/‎benchmarks/cuquantum_benchmarks/__init__.py‎
Lines changed: 0 additions & 5 deletions b/‎benchmarks/cuquantum_benchmarks/__init__.py‎
Lines changed: 0 additions & 5 deletions
diff --git a/‎benchmarks/cuquantum_benchmarks/backends/backend_cirq.py‎
Lines changed: 0 additions & 52 deletions b/‎benchmarks/cuquantum_benchmarks/backends/backend_cirq.py‎
Lines changed: 0 additions & 52 deletions
diff --git a/‎benchmarks/nv_quantum_benchmarks/__init__.py‎
Lines changed: 5 additions & 0 deletions b/‎benchmarks/nv_quantum_benchmarks/__init__.py‎
Lines changed: 5 additions & 0 deletions
diff --git a/‎benchmarks/cuquantum_benchmarks/__main__.py‎ renamed to ‎benchmarks/nv_quantum_benchmarks/__main__.py‎ b/‎benchmarks/cuquantum_benchmarks/__main__.py‎ renamed to ‎benchmarks/nv_quantum_benchmarks/__main__.py‎
diff --git a/‎benchmarks/cuquantum_benchmarks/_utils.py‎ renamed to ‎benchmarks/nv_quantum_benchmarks/_utils.py‎
Lines changed: 4 additions & 4 deletions b/‎benchmarks/cuquantum_benchmarks/_utils.py‎ renamed to ‎benchmarks/nv_quantum_benchmarks/_utils.py‎
Lines changed: 4 additions & 4 deletions
diff --git a/‎benchmarks/cuquantum_benchmarks/backends/__init__.py‎ renamed to ‎benchmarks/nv_quantum_benchmarks/backends/__init__.py‎
Lines changed: 5 additions & 1 deletion b/‎benchmarks/cuquantum_benchmarks/backends/__init__.py‎ renamed to ‎benchmarks/nv_quantum_benchmarks/backends/__init__.py‎
Lines changed: 5 additions & 1 deletion
diff --git a/‎benchmarks/cuquantum_benchmarks/backends/backend.py‎ renamed to ‎benchmarks/nv_quantum_benchmarks/backends/backend.py‎ b/‎benchmarks/cuquantum_benchmarks/backends/backend.py‎ renamed to ‎benchmarks/nv_quantum_benchmarks/backends/backend.py‎
diff --git a/‎benchmarks/nv_quantum_benchmarks/backends/backend_cirq.py‎
Lines changed: 75 additions & 0 deletions b/‎benchmarks/nv_quantum_benchmarks/backends/backend_cirq.py‎
Lines changed: 75 additions & 0 deletions
@@ -1,4 +1,4 @@
-Copyright (c) 2021-2024 NVIDIA CORPORATION & AFFILIATES.
+Copyright (c) 2021-2025 NVIDIA CORPORATION & AFFILIATES.
 
 BSD-3-Clause
 
 
@@ -1,4 +1,4 @@
-# cuquantum-benchmarks
+# nv-quantum-benchmarks
 
 ## Installing
 
@@ -24,12 +24,12 @@ and `pip` would not install any extra package for you.
 
 ## Running
 
-After installation, a new command `cuquantum-benchmarks` is installed to your Python environment. You can see the help message via `cuquantum-benchmarks --help`:
+After installation, a new command `nv-quantum-benchmarks` is installed to your Python environment. You can see the help message via `nv-quantum-benchmarks --help`:
 
 ```
-usage: cuquantum-benchmarks [-h] {circuit,api} ...
+usage: nv-quantum-benchmarks [-h] {circuit,api} ...
 
-=============== NVIDIA cuQuantum Performance Benchmark Suite ===============
+=============== NVIDIA Quantum Performance Benchmark Suite ===============
 
 positional arguments:
   {circuit,api}
@@ -40,23 +40,23 @@ optional arguments:
   -h, --help     show this help message and exit
 ```
 
-Starting v0.2.0, we offer subcommands for performing benchmarks at different levels, as shown above. For details, please refer to the help message of each subcommand, ex: `cuquantum-benchmarks circuit --help`.
+Starting v0.2.0, we offer subcommands for performing benchmarks at different levels, as shown above. For details, please refer to the help message of each subcommand, ex: `nv-quantum-benchmarks circuit --help`.
 
-Alternatively, you can launch the benchmark program via `python -m cuquantum_benchmarks`. This is equivalent to the standalone command, and is useful when, say, `pip` installs this package to the user site-package (so that the `cuquantum-benchmarks` command may not be available without modifying `$PATH`).
+Alternatively, you can launch the benchmark program via `python -m nv_quantum_benchmarks`. This is equivalent to the standalone command, and is useful when, say, `pip` installs this package to the user site-package (so that the `nv-quantum-benchmarks` command may not be available without modifying `$PATH`).
 
 For GPU backends, it is preferred that `--ngpus N` is explicitly set. On a multi-GPU system, the first `N` GPUs would be used. To limit which GPUs can be accessed by the CUDA runtime, use the environment variable `CUDA_VISIBLE_DEVICES` following the CUDA documentation.
 
-For backends that support MPI parallelism, it is assumed that `MPI_COMM_WORLD` is the communicator, and that `mpi4py` is installed. You can run the benchmarks as you would normally do to launch MPI processes: `mpiexec -n N cuquantum-benchmarks ...`. It is preferred if you fully specify the problem (explicitly set `--benchmark` & `--nqubits`).
+For backends that support MPI parallelism, it is assumed that `MPI_COMM_WORLD` is the communicator, and that `mpi4py` is installed. You can run the benchmarks as you would normally do to launch MPI processes: `mpiexec -n N nv-quantum-benchmarks ...`. It is preferred if you fully specify the problem (explicitly set `--benchmark` & `--nqubits`).
 
 Examples:
-- `cuquantum-benchmarks api --benchmark apply_matrix --targets 4,5 --controls 2,3 --nqubits 16`: Apply a random gate matrix controlled by qubits 2 & 3 to qubits 4 & 5 of a 16-qubit statevector using cuStateVec's `apply_matrix()` API
-- `cuquantum-benchmarks circuit --frontend qiskit --backend cutn --compute-mode statevector --benchmark qft --nqubits 8 --ngpus 1`: Construct a 8-qubit QFT circuit in Qiskit and compute the statevector with cuTensorNet on GPU. Note that the `--compute-mode` can be specified only for `cutn` backend and supports `amplitude` (default), `statevector`, and `expectation`. 
-- `cuquantum-benchmarks circuit --frontend cirq --backend qsim-mgpu --benchmark qaoa --nqubits 16 --ngpus 2`: Construct a 16-qubit QAOA circuit in Cirq and run it with the (multi-GPU) `qsim-mgpu` backend on 2 GPUs (requires cuQuantum Appliance)
-- `mpiexec -n 4 cuquantum-benchmarks circuit --frontend qiskit --backend cusvaer --benchmark quantum_volume --nqubits 32 --ngpus 1 --cusvaer-global-index-bits 1,1 --cusvaer-p2p-device-bits 1`: Construct a 32-qubit Quantum Volume circuit in Qiskit and run it with the (multi-GPU-multi-node) `cusvaer` backend on 2 nodes. Each node runs 2 MPI processes, each of which controls 1 GPU (requires cuQuantum Appliance)
+- `nv-quantum-benchmarks api --benchmark apply_matrix --targets 4,5 --controls 2,3 --nqubits 16`: Apply a random gate matrix controlled by qubits 2 & 3 to qubits 4 & 5 of a 16-qubit statevector using cuStateVec's `apply_matrix()` API.
+- `nv-quantum-benchmarks circuit --frontend qiskit --backend cutn --compute-mode statevector --benchmark qft --nqubits 8 --ngpus 1`: Construct a 8-qubit QFT circuit using Qiskit and compute the statevector with cuTensorNet on GPU. The `--compute-mode` option determines the type of computation performed, and can be set to `amplitude`, `statevector`, `sampling`, or `expectation`, depending on the backend used. However, not all backends support all compute modes, and each backend has its own default mode.<br>  When the `compute-mode` is set to `expectation`, it is allowed to specify the following options to define the Pauli operators for which the expectation value is computed: `--pauli-string`, `--pauli-seed`, or `--pauli-identity-fraction`.
+- `mpirun -np 2 nv-quantum-benchmarks circuit --frontend cudaq --backend cudaq-mgpu --compute-mode expectation --benchmark qaoa --nqubits 16 --ngpus 2`: Construct a 16-qubit QAOA circuit in NVIDIA CUDA-Q and compute the expectation with the (multi-GPU) `cudaq-mgpu` backend on 2 GPUs.
+- `mpiexec -n 4 nv-quantum-benchmarks circuit --frontend qiskit --backend cusvaer --benchmark quantum_volume --nqubits 32 --ngpus 1 --cusvaer-global-index-bits 1,1 --cusvaer-p2p-device-bits 1`: Construct a 32-qubit Quantum Volume circuit in Qiskit and run it with the (multi-GPU-multi-node) `cusvaer` backend on 2 nodes. Each node runs 2 MPI processes, each of which controls 1 GPU (requires cuQuantum Appliance).
 
 ## Known issues
 
-- Due to Qiskit Aer's design, it'd initialize the CUDA contexts for all GPUs installed on the system at import time. While we can defer the import, it might have an impact to the (multi-GPU) system performance when any `aer*` backend is in use. For the time being, we recommend to work around it by limiting the visible devices. For example, `CUDA_VISIBLE_DEVICES=0,1 cuquantum-benchmarks ...` would only use GPU 0 & 1.
+- Due to Qiskit Aer's design, it'd initialize the CUDA contexts for all GPUs installed on the system at import time. While we can defer the import, it might have an impact to the (multi-GPU) system performance when any `aer*` backend is in use. For the time being, we recommend to work around it by limiting the visible devices. For example, `CUDA_VISIBLE_DEVICES=0,1 nv-quantum-benchmarks ...` would only use GPU 0 & 1.
 
 ## Output data
 
 
@@ -0,0 +1,5 @@
+# Copyright (c) 2021-2025, NVIDIA CORPORATION & AFFILIATES
+#
+# SPDX-License-Identifier: BSD-3-Clause
+
+__version__ = '0.5.0'
@@ -1,4 +1,4 @@
-# Copyright (c) 2021-2023, NVIDIA CORPORATION & AFFILIATES
+# Copyright (c) 2021-2025, NVIDIA CORPORATION & AFFILIATES
 #
 # SPDX-License-Identifier: BSD-3-Clause
 
@@ -17,16 +17,16 @@
 import time
 from typing import Iterable, Optional, Union
 import warnings
-
 import cupy as cp
 import numpy as np
 import nvtx
 import psutil
 
+from .constants import LOGGER_NAME
+
 
 # set up a logger
-logger_name = "cuquantum-benchmarks"
-logger = logging.getLogger(logger_name)
+logger = logging.getLogger(LOGGER_NAME)
 
 
 def wrap_with_nvtx(func, msg):
 
@@ -1,8 +1,9 @@
-# Copyright (c) 2021-2023, NVIDIA CORPORATION & AFFILIATES
+# Copyright (c) 2021-2025, NVIDIA CORPORATION & AFFILIATES
 #
 # SPDX-License-Identifier: BSD-3-Clause
 
 from .backend_cirq import Cirq
+from .backend_cudaq import CudaqCusv, CudaqMgpu, CudaqCpu
 from .backend_cutn import cuTensorNet
 from .backend_pny import Pny, PnyLightningGpu, PnyLightningCpu, PnyLightningKokkos
 from .backend_qsim import Qsim, QsimCuda, QsimCusv, QsimMgpu
@@ -16,6 +17,9 @@
     'aer-cusv': AerCusv,
     'cusvaer': CusvAer,
     'cirq': Cirq,
+    'cudaq-cusv': CudaqCusv,
+    'cudaq-mgpu': CudaqMgpu,
+    'cudaq-cpu': CudaqCpu,
     'cutn': cuTensorNet,
     'qsim': Qsim,
     'qsim-cuda': QsimCuda,
 
@@ -0,0 +1,75 @@
+# Copyright (c) 2021-2025, NVIDIA CORPORATION & AFFILIATES
+#
+# SPDX-License-Identifier: BSD-3-Clause
+
+import functools
+import warnings
+import logging
+try:
+    import cirq
+except ImportError:
+    cirq = None
+    
+try:
+    from .. import _internal_utils
+except ImportError:
+    _internal_utils = None
+from .backend import Backend
+from ..constants import LOGGER_NAME
+
+
+# set up a logger
+logger = logging.getLogger(LOGGER_NAME)
+
+
+class _Cirq(Backend):
+
+    def __init__(self, ngpus, ncpu_threads, precision, *args, identifier=None, **kwargs):
+        if cirq is None:
+            raise RuntimeError("cirq is not installed")
+        if ngpus > 0:
+            raise ValueError("the cirq backend only runs on CPU")
+        if ncpu_threads > 1:
+            warnings.warn("cannot set the number of CPU threads for the cirq backend")
+        if precision != 'single':
+            raise ValueError("the cirq backend only supports single precision")
+
+        self.backend = cirq.Simulator()
+        self.identifier = identifier
+        self.version = cirq.__version__
+        self.meta = {}
+        self.meta['ncputhreads'] = ncpu_threads
+
+    def preprocess_circuit(self, circuit, *args, **kwargs):
+        if _internal_utils is not None:
+            _internal_utils.preprocess_circuit(self.identifier, circuit, *args, **kwargs)
+        
+        self.compute_mode = kwargs.pop('compute_mode')
+        valid_choices = ['statevector', 'sampling']
+        if self.compute_mode not in valid_choices:
+            raise ValueError(f"The '{self.compute_mode}' computation mode is not supported for this backend. Supported modes are: {valid_choices}")
+        
+        self.updated_circuit = circuit
+        if self.compute_mode == 'statevector':
+            self.updated_circuit = cirq.drop_terminal_measurements(circuit)
+            
+        self.meta['compute-mode'] = f'{self.compute_mode}()'
+        logger.info(f'data: {self.meta}')
+
+        pre_data = self.meta
+        return pre_data
+    
+    def run(self, circuit, nshots=1024):
+        if self.compute_mode == 'sampling':
+            results = self.backend.run(self.updated_circuit, repetitions=nshots)
+            samples = results.histogram(key='result')
+            post_res = results.measurements['result']
+        elif self.compute_mode == 'statevector':
+            results = self.backend.simulate(self.updated_circuit)
+            sv = results.final_state_vector
+            post_res = None
+
+        return {'results': None, 'post_results': post_res, 'run_data': {}}
+
+
+Cirq = functools.partial(_Cirq, identifier='cirq')
Original file line number	Diff line number	Diff line change
`@@ -1,4 +1,4 @@`
`1`		`-Copyright (c) 2021-2024 NVIDIA CORPORATION & AFFILIATES.`
	`1`	`+Copyright (c) 2021-2025 NVIDIA CORPORATION & AFFILIATES.`
`2`	`2`
`3`	`3`	`BSD-3-Clause`
`4`	`4`