PipelineProfile
- class lib.infer.profile.PipelineProfile(plugins: list[ExtractPlugin], error_state: ErrorState, channels_last: list[bool], warmup_time: int, run_time: int, has_detector: bool, face_scaling: int = 2, max_vram: int = 90)
Bases:
objectBenchmark multiple PyTorch GPU plugins running simultaneously for inference
- Parameters:
plugins (list[ExtractPlugin]) – The plugins to benchmark for inference
error_state (ErrorState) – The global FSThread error state object for the pipeline
channels_last (list[bool]) – List indicating whether each model is channels first or last
warmup_time (int) – The amount of time, in seconds, to warmup the plugin at each batch size
run_time (int) – The amount of time, in seconds, to benchmark the plugin at each batch size
has_detector (bool) –
Trueif the first plugin in the pipeline is a detectorface_scaling (int) – The amount of scaling to apply to downstream plugins (ie estimate of average number of faces per frame). Default: 2
max_vram (int) – The maximum percentage of total VRAM to allow Cuda to reserve when profiling, Default: 90
Methods Summary
__call__()Runs benchmarking through all plugins concurrently, store the data and output stats
Methods Documentation
- __call__() None
Runs benchmarking through all plugins concurrently, store the data and output stats
- Return type:
None
- __call__() None
Runs benchmarking through all plugins concurrently, store the data and output stats
- Return type:
None