~hackernews | Bookmarks (207)
-
Energy-Efficient Llama 2 Inference on FPGAs via High Level Synthesis
Skip to main content View PDF Abstract:Graphics Processing Units (GPUs) have become the leading hardware accelerator...
-
No "Zero-Shot" Without Exponential Data
Skip to main content View PDF HTML (experimental) Abstract:Web-crawled pretraining datasets underlie the impressive "zero-shot" evaluation...
-
Mamba-360: Survey of State Space Models as Transformer Alternative
Skip to main content View PDF HTML (experimental) Abstract:Sequence modeling is a crucial area across various...
-
Towards a Holistic Evaluation of LLMs on Factual Knowledge Recall
Skip to main content View PDF HTML (experimental) Abstract:Large language models (LLMs) have shown remarkable performance...
-
The Feasibility of Implementing Large-Scale Transformers on Multi-FPGA Platforms
View PDF HTML (experimental) Abstract:FPGAs are rarely mentioned when discussing the implementation of large machine learning...
-
StructLM: Towards Building Generalist Models for Structured Knowledge Grounding
Skip to main content View PDF HTML (experimental) Abstract:Structured data sources, such as tables, graphs, and...