Hi, I'm Rohan

Toolformer: Empowering Language Models to Utilize Tools

Understanding the Self-Supervised Approach to Tool Integration into LLMs

Posted on October 28, 2023

Meta’s new model, Toolformer, introduces a novel approach to overcoming the limitations of large language models (LLMs) by enabling them to leverage tools via APIs. This capability addresses issues such as accessing real-time information, reducing factual inaccuracies, and improving performance in low-resource languages and mathematical tasks. [Read More]

Tags: DeepLearning AI LanguageModels APIs

Strategies for Reducing Costs in Large Language Model API Usage

Insights from Frugal GPT Paper

Posted on October 26, 2023

The escalating costs associated with LLM APIs necessitate efficient strategies to manage and optimize their usage without compromising performance. The following strategies, derived from the “Frugal GPT” paper, can serve as an excellant guide to save on LLM API usage cost. [Read More]

Tags: PaperReview CostOptimization LLM AI

Introduction to gRPC

Evolution of RPCs to gRPC

Posted on July 1, 2022

Evolution Understanding how RPC protocols have evolved may help us understand the context better. [Read More]

Tags: Systems101

OpenPrompt- A Prompt-learning Framework

Prompt Tuning

Posted on May 15, 2022

One core idea of prompt-learning is to use additional context with masked tokens to imitate the pre-training objectives of PLMs and better stimulate these models. Hence, the choice of PLMs is crucial to the whole pipeline of prompt-learning. [Read More]

Tags: prompting

Structural Probing

Does word representations encode syntactic information?

Posted on March 10, 2022

Hypothesis Do the language modelling objective implicitly encode/learn the entire parse tree? Can I detect a path from the root of the [Read More]

Tags: syntax