Skip to main content

GCP Vertex AI

Note: This is separate from the Google PaLM integration, it exposes Vertex AI PaLM API on Google Cloud.

Setting up

By default, Google Cloud does not use customer data to train its foundation models as part of Google Cloud's AI/ML Privacy Commitment. More details about how Google processes data can also be found in Google's Customer Data Processing Addendum (CDPA).

To use Vertex AI PaLM you must have the google-cloud-aiplatform Python package installed and either:

  • Have credentials configured for your environment (gcloud, workload identity, etc...)
  • Store the path to a service account JSON file as the GOOGLE_APPLICATION_CREDENTIALS environment variable

This codebase uses the google.auth library which first looks for the application credentials variable mentioned above, and then looks for system-level auth.

For more information, see:

#!pip install langchain google-cloud-aiplatform
from langchain.llms import VertexAI

API Reference:

llm = VertexAI()
print(llm("What are some of the pros and cons of Python as a programming language?"))
     Python is a widely used, interpreted, object-oriented, and high-level programming language with dynamic semantics, used for general-purpose programming. It is known for its readability, simplicity, and versatility. Here are some of the pros and cons of Python:

**Pros:**

- **Easy to learn:** Python is known for its simple and intuitive syntax, making it easy for beginners to learn. It has a relatively shallow learning curve compared to other programming languages.

- **Versatile:** Python is a general-purpose programming language, meaning it can be used for a wide variety of tasks, including web development, data science, machine

Using in a chain

from langchain.prompts import PromptTemplate

API Reference:

template = """Question: {question}

Answer: Let's think step by step."""
prompt = PromptTemplate.from_template(template)
chain = prompt | llm
question = "Who was the president in the year Justin Beiber was born?"
print(chain.invoke({"question": question}))
     Justin Bieber was born on March 1, 1994. Bill Clinton was the president of the United States from January 20, 1993, to January 20, 2001.
The final answer is Bill Clinton

Code generation example

You can now leverage the Codey API for code generation within Vertex AI.

The model names are:

  • code-bison: for code suggestion
  • code-gecko: for code completion
llm = VertexAI(model_name="code-bison", max_output_tokens=1000, temperature=0.3)
question = "Write a python function that checks if a string is a valid email address"
print(llm(question))
    ```python
import re

def is_valid_email(email):
pattern = re.compile(r"[^@]+@[^@]+\.[^@]+")
return pattern.match(email)
```

Full generation info

We can use the generate method to get back extra metadata like safety attributes and not just text completions

result = llm.generate([question])
result.generations
    [[GenerationChunk(text='```python\nimport re\n\ndef is_valid_email(email):\n    pattern = re.compile(r"[^@]+@[^@]+\\.[^@]+")\n    return pattern.match(email)\n```', generation_info={'is_blocked': False, 'safety_attributes': {'Health': 0.1}})]]

Asynchronous calls

With agenerate we can make asynchronous calls

# If running in a Jupyter notebook you'll need to install nest_asyncio

# !pip install nest_asyncio
import asyncio
# import nest_asyncio
# nest_asyncio.apply()
asyncio.run(llm.agenerate([question]))
    LLMResult(generations=[[GenerationChunk(text='```python\nimport re\n\ndef is_valid_email(email):\n    pattern = re.compile(r"[^@]+@[^@]+\\.[^@]+")\n    return pattern.match(email)\n```', generation_info={'is_blocked': False, 'safety_attributes': {'Health': 0.1}})]], llm_output=None, run=[RunInfo(run_id=UUID('caf74e91-aefb-48ac-8031-0c505fcbbcc6'))])

Streaming calls

With stream we can stream results from the model

import sys
for chunk in llm.stream(question):
sys.stdout.write(chunk)
sys.stdout.flush()
    ```python
import re

def is_valid_email(email):
"""
Checks if a string is a valid email address.

Args:
email: The string to check.

Returns:
True if the string is a valid email address, False otherwise.
"""

# Check for a valid email address format.
if not re.match(r"^[A-Za-z0-9\.\+_-]+@[A-Za-z0-9\._-]+\.[a-zA-Z]*$", email):
return False

# Check if the domain name exists.
try:
domain = email.split("@")[1]
socket.gethostbyname(domain)
except socket.gaierror:
return False

return True
```

Vertex Model Garden

Vertex Model Garden exposes open-sourced models that can be deployed and served on Vertex AI. If you have successfully deployed a model from Vertex Model Garden, you can find a corresponding Vertex AI endpoint in the console or via API.

from langchain.llms import VertexAIModelGarden

API Reference:

llm = VertexAIModelGarden(
project="YOUR PROJECT",
endpoint_id="YOUR ENDPOINT_ID"
)
print(llm("What is the meaning of life?"))

Like all LLMs, we can then compose it with other components:

prompt = PromptTemplate.from_template("What is the meaning of {thing}?")
chian = prompt | llm
print(chain.invoke({"thing": "life"}))