Google

For the Google Endpoint, you can either use the Generative Language API (for Gemini models), or the Vertex AI API (for Gemini, PaLM2 & Codey models).

The Generative Language API uses an API key, which you can get from Google AI Studio.

For Vertex AI, you need a Service Account JSON key file, with appropriate access configured.

Instructions for both are given below.

Generative Language API (Gemini)

See here for Gemini API pricing and rate limits

⚠️ While Google models are free, they are using your input/output to help improve the model, with data de-identified from your Google Account and API key. ⚠️ During this period, your messages “may be accessible to trained reviewers.”

To use Gemini models through Google AI Studio, you’ll need an API key. If you don’t already have one, create a key in Google AI Studio.

Get an API key here: aistudio.google.com

Once you have your key, provide the key in your .env file, which allows all users of your instance to use it.

.env

GOOGLE_KEY=mY_SeCreT_w9347w8_kEY

Or, you can make users provide it from the frontend by setting the following:

.env

GOOGLE_KEY=user_provided

Since fetching the models list isn’t yet supported, you should set the models you want to use in the .env file.

For your convenience, these are the latest models as of 5/18/24 that can be used with the Generative Language API:

.env

GOOGLE_MODELS=gemini-1.5-flash-latest,gemini-1.0-pro,gemini-1.0-pro-001,gemini-1.0-pro-latest,gemini-1.0-pro-vision-latest,gemini-1.5-pro-latest,gemini-pro,gemini-pro-vision

✏️

Notes:

A gemini-pro model or gemini-pro-vision are required in your list for attaching images.
Using LibreChat, PaLM2 and Codey models can only be accessed through Vertex AI, not the Generative Language API.
- Only models that support the generateContent method can be used natively with LibreChat + the Gen AI API.
Selecting gemini-pro-vision for messages with attachments is not necessary as it will be switched behind the scenes for you
Since gemini-pro-visiondoes not accept non-attachment messages, messages without attachments are automatically switched to use gemini-pro (otherwise, Google responds with an error)
With the Google endpoint, you cannot use both Vertex AI and Generative Language API at the same time. You must choose one or the other.
Some PaLM/Codey models and gemini-pro-vision may fail when maxOutputTokens is set to a high value. If you encounter this issue, try reducing the value through the conversation parameters.

Setting GOOGLE_KEY=user_provided in your .env file sets both the Vertex AI Service Account JSON key file and the Generative Language API key to be provided from the frontend like so:

Vertex AI

See here for Vertex API pricing and rate limits

To setup Google LLMs (via Google Cloud Vertex AI), first, signup for Google Cloud: cloud.google.com

You can usually get $300 starting credit, which makes this option free for 90 days.

Once signed up, Enable the Vertex AI API on Google Cloud:

Go to Vertex AI page on Google Cloud console
Click on Enable API if prompted

Create a Service Account with Vertex AI role:

Click here to create a Service Account
Select or create a project
Enter a service account ID (required), name and description are optional
Click on “Create and Continue” to give at least the “Vertex AI User” role
Click on “Continue/Done”

Create a JSON key to Save in your Project Directory:

Go back to the Service Accounts page
Select your service account
Click on “Keys”
Click on “Add Key” and then “Create new key”
Choose JSON as the key type and click on “Create”
Download the key file and rename it as ‘auth.json’
Save it within the project directory, in /api/data/

Saving your JSON key file in the project directory which allows all users of your LibreChat instance to use it.

Alternatively, you can make users provide it from the frontend by setting the following:

.env

# Note: this configures both the Vertex AI Service Account JSON key file
# and the Generative Language API key to be provided from the frontend.
GOOGLE_KEY=user_provided

Since fetching the models list isn’t yet supported, you should set the models you want to use in the .env file.

For your convenience, these are the latest models as of 5/18/24 that can be used with the Generative Language API:

.env

GOOGLE_MODELS=gemini-1.5-flash-preview-0514,gemini-1.5-pro-preview-0514,gemini-1.0-pro-vision-001,gemini-1.0-pro-002,gemini-1.0-pro-001,gemini-pro-vision,gemini-1.0-pro

✏️

If you are using Docker

If you’re using docker and want to provide the auth.json file, you will need to also mount the volume in docker-compose.override.yml

docker-compose.override.yml

version: '3.4'
 
services:
  api:
    volumes:
    - type: bind
      source: ./api/data/auth.json
      target: /app/api/data/auth.json

Google Safety Settings

To set safety settings for both Vertex AI and Generative Language API, you can set the following in your .env file:

.env

GOOGLE_SAFETY_SEXUALLY_EXPLICIT=BLOCK_ONLY_HIGH
GOOGLE_SAFETY_HATE_SPEECH=BLOCK_ONLY_HIGH
GOOGLE_SAFETY_HARASSMENT=BLOCK_ONLY_HIGH
GOOGLE_SAFETY_DANGEROUS_CONTENT=BLOCK_ONLY_HIGH

You can also exclude safety settings by setting the following in your .env file, which will use the provider defaults. This can be helpful if you are having issues with specific safety settings.

.env

GOOGLE_EXCLUDE_SAFETY_SETTINGS=true

NOTE: You do not have access to the BLOCK_NONE setting by default. To use this restricted HarmBlockThreshold setting, you will need to either:

(a) Get access through an allowlist via your Google account team
(b) Switch your account type to monthly invoiced billing following this instruction: https://cloud.google.com/billing/docs/how-to/invoiced-billing

Assistants Openai