Device invocation rewriting for zero-shot instrument retrieval

Augmenting giant language fashions (LLMs) with exterior instruments, moderately than relying solely on their inner data, might unlock their potential to resolve tougher issues. Widespread approaches for such “instrument studying” fall into two classes: (1) supervised strategies to generate instrument calling capabilities, or (2) in-context studying, which makes use of instrument paperwork that describe the supposed instrument utilization together with few-shot demonstrations. Device paperwork present directions on instrument’s functionalities and find out how to invoke it, permitting LLMs to grasp the person instruments.

Nevertheless, these strategies face sensible challenges when scaling to numerous instruments. First, they endure from enter token limits. It’s not possible to feed the complete record of instruments inside a single immediate, and, even when it had been potential, LLMs nonetheless typically wrestle to successfully course of related data from prolonged enter contexts. Second, the pool of instruments is evolving. LLMs are sometimes paired with a retriever educated on labeled question–instrument pairs to advocate a shortlist of instruments. Nevertheless, the best LLM toolkit must be huge and dynamic, with instruments present process frequent updates. Offering and sustaining labels to coach a retriever for such an intensive and evolving toolset can be impractical. Lastly, one should cope with ambiguous person intents. Consumer context within the queries might obfuscate the underlying intents, and failure to establish them might result in calling the wrong instruments.

In “Re-Invoke: Device Invocation Rewriting for Zero-Shot Device Retrieval”, offered at EMNLP 2024, we introduce a novel unsupervised retrieval technique particularly designed for instrument studying to handle these distinctive challenges. Re-Invoke leverages LLMs for each instrument doc enrichment and person intent extraction to boost instrument retrieval efficiency throughout varied use instances. We reveal that the proposed Re-Invoke technique persistently and considerably improves upon the baselines protecting each single- and multi-tool retrieval duties on instrument use benchmark datasets.