HW-GPT-Bench: Hardware-Aware Architecture Benchmark for Language Models. The Thirty-eight Conference on Neural Information Processing Systems Datasets and Benchmarks Track (NeurIPS'24), 2024. [PUMA: AutoML large_language_models neural_architecture_search]