fastrnns - OpenGrok cross reference for /aosp_15_r20/external/pytorch/benchmarks/fastrnns/

# Fast RNN benchmarks

Benchmarks for TorchScript models

For most stable results, do the following:
- Set CPU Governor to performance mode (as opposed to energy save)
- Turn off turbo for all CPUs (assuming Intel CPUs)
- Shield cpus via `cset shield` when running benchmarks.

Some of these scripts accept command line args but most of them do not because
I was lazy. They will probably be added sometime in the future, but the default
sizes are pretty reasonable.

## Test fastrnns (fwd + bwd) correctness

Test the fastrnns benchmarking scripts with the following:
`python -m fastrnns.test`
or run the test independently:
`python -m fastrnns.test --rnns jit`

## Run benchmarks

`python -m fastrnns.bench`

should give a good comparison, or you can specify the type of model to run

`python -m fastrnns.bench --rnns cudnn aten jit --group rnns`

## Run model profiling, calls nvprof

`python -m fastrnns.profile`

should generate nvprof file for all models somewhere.
you can also specify the models to generate nvprof files separately:

`python -m fastrnns.profile --rnns aten jit`

### Caveats

Use Linux for the most accurate timing. A lot of these tests only run
on CUDA.
Name		Date	Size	#Lines	LOC
..		-	-
README.md	H A D	25-Apr-2025	1.2 KiB	42	26
__init__.py	H A D	25-Apr-2025	196	11	7
bench.py	H A D	25-Apr-2025	10.6 KiB	361	289
cells.py	H A D	25-Apr-2025	3.6 KiB	142	99
conftest.py	H A D	25-Apr-2025	962	35	27
custom_lstms.py	H A D	25-Apr-2025	17 KiB	511	394
factory.py	H A D	25-Apr-2025	17.2 KiB	534	410
fuser.py	H A D	25-Apr-2025	1.4 KiB	37	32
profile.py	H A D	25-Apr-2025	4.5 KiB	172	135
runner.py	H A D	25-Apr-2025	3 KiB	110	90
scratch.py	H A D	25-Apr-2025	1 KiB	54	35
test.py	H A D	25-Apr-2025	5.8 KiB	183	141
test_bench.py	H A D	25-Apr-2025	1.6 KiB	57	42