7 posts tagged with "google cloud"

Google Provider Update - December 2025

December 12, 2025 · 3 min read

Technologist and Cloud Consultant

We've released a major update to the StackQL Google provider with a new service, enhanced AI/ML capabilities, and improvements across 177 service files.

New Service: Speech-to-Text v2

The speechv2 service brings Cloud Speech-to-Text API v2 to StackQL with 6 resources:

Resource	Description
`recognizers`	Manage speech recognition configurations with create, list, get, patch, delete, undelete, recognize, and batch_recognize methods
`custom_classes`	Create custom vocabulary classes for improved recognition accuracy
`phrase_sets`	Define phrase hints to boost recognition of specific terms
`config`	Manage location-level Speech-to-Text configuration
`locations`	Query available service locations
`operations`	Track long-running operations

Key features include support for multiple audio encodings (WAV, FLAC, MP3, OGG, WebM, MP4/AAC), translation capabilities, denoiser config, and KMS encryption support.

Vertex AI / AI Platform

The largest update in this release with 87,000+ line changes introduces powerful new RAG and evaluation capabilities:

RAG Resources: rag_corpora, rag_files, rag_engine_config for Retrieval-Augmented Generation
Conversational AI: New chat resource
Model Evaluation: evaluation_sets and evaluation_items for systematic model assessment
New Resources: science, invoke, and openapi resources
Performance: Enhanced cache_config for caching configurations

Discovery Engine

Major enhancements (50,000+ line changes) for search and conversational AI:

New assistants resource
New sitemaps resource for site search
New custom_models resource
Enhanced sessions and answers for conversational search
New authorized_views and authorized_view_sets for access control

Contact Center AI Insights

Quality assurance and analytics improvements (20,000+ line changes):

New qa_questions and qa_question_tags for quality assurance workflows
New analysis_rules resource
New segments resource
New authorized_views with IAM policy support
New datasets and views resources

BigQuery

Enhanced governance and access control (18,000+ line changes):

New routines_iam_policies for stored procedure/function IAM
Enhanced row_access_policies

Healthcare API

Expanded metrics and data mapping (15,000+ line changes):

New data_mapper_workspaces_iam_policies
Enhanced metrics: hl7_v2_store_metrics, dicom_store_metrics, series_metrics, study_metrics
New instances_storage_info resource

Cloud Spanner

Backup and security enhancements (14,000+ line changes):

New backup_schedules with IAM support
New databases_split_points resource
New database_roles with IAM policies

Cloud SQL Admin

New integration and management features (12,000+ line changes):

New instances_entra_id_certificate for Microsoft Entra ID integration
New instances_disk_shrink_config
New instances_latest_recovery_time

GKE On-Prem

Enhanced IAM across VMware and Bare Metal clusters (9,000+ line changes):

Enhanced VMware cluster resources with IAM policies
Enhanced Bare Metal cluster resources with IAM policies
New vmware_node_pools and bare_metal_node_pools with IAM

Developer Connect

Git integration improvements (3,500+ line changes):

New git_repository_links_git_refs resource
New users_self and users_access_token resources
New token resources: read_token, read_write_token

Text-to-Speech

Enhanced voices and text resources with new capabilities.

Get Started

Update to the latest Google provider:

stackql registry pull google

Let us know your thoughts! Visit us and give us a star on GitHub.

Exploring the Google Cloud Asset API

August 28, 2021 · 6 min read

Jeffrey Aven

Technologist and Cloud Consultant

The Cloud Asset API has recently gone GA, this is an exceptionally useful service which stores the history and inventory of cloud resources in your GCP org. Using the Cloud Asset API via StackQL you can enumerate all of the services and resources in your GCP org, including billable resources such as Cloud Storage buckets or Compute Engine instances, as well as other objects such as billing accounts, folders, projects, firewalls, service accounts and much more. All of this can be done using SQL!

Let’s start by exploring the available fields in this service:

Explore the API

Use the DESCRIBE or DESCRIBE EXTENDED to see the fields available in the google.cloudasset.assets resource as shown here:

StackQL
Response

DESCRIBE EXTENDED google.cloudasset.assets;

|------------------|--------|-----------------------------------------------------------------------------------------------------------|
|       name       |  type  |                                                description                                                |
|------------------|--------|-----------------------------------------------------------------------------------------------------------|
| name             | string | The full name of the asset. Example:                                                                      |
|                  |        | `//compute.googleapis.com/projects/my_project_123/zones/zone1/instances/instance1`                        |
|                  |        | See [Resource names](https://cloud.google.com/apis/design/resource_names#full_resource_name) for          |
|                  |        | more information.                                                                                         |
|------------------|--------|-----------------------------------------------------------------------------------------------------------|
| orgPolicy        | array  | A representation of an [organization                                                                      |
|                  |        | policy](https://cloud.google.com/resource-manager/docs/organization-policy/overview#organization_policy). |
|                  |        | There can be more than one organization policy with different constraints set on a given resource.        |
|------------------|--------|-----------------------------------------------------------------------------------------------------------|
| servicePerimeter | object | `ServicePerimeter` describes a set of Google Cloud resources which can freely  import and export data     |
|                  |        |  amongst themselves, but not export outside of the `ServicePerimeter`. If a request with a source within  |
|                  |        |  this `ServicePerimeter` has a target outside of the `ServicePerimeter`, the request will be blocked.     |
|                  |        |  Otherwise the request is allowed. There are two types of Service Perimeter - Regular and Bridge.         |
|                  |        |  Regular Service Perimeters cannot overlap, a single Google Cloud project can only belong to a single     |
|                  |        |  regular Service Perimeter. Service Perimeter Bridges can contain only Google Cloud projects as members,  |
|                  |        |  a single Google Cloud project may belong to multiple Service Perimeter Bridges.                          |
|------------------|--------|-----------------------------------------------------------------------------------------------------------|
| osInventory      | object | This API resource represents the available inventory data for a Compute Engine virtual                    |
|                  |        | machine (VM) instance at a given point in time. You can use this API resource to determine                |
|                  |        | the inventory data of your VM. For more information, see [Information provided by OS inventory            |
|                  |        | management](https://cloud.google.com/compute/docs/instances/os-inventory-management#data-collected).      |
|------------------|--------|-----------------------------------------------------------------------------------------------------------|
| relatedAssets    | object | The detailed related assets  with the `relationship_type`.                                                |
|------------------|--------|-----------------------------------------------------------------------------------------------------------|
| accessPolicy     | object | `AccessPolicy` is a container for `AccessLevels` (which define the necessary attributes to use Google     |
|                  |        | Cloud services) and `ServicePerimeters` (which define regions of services able to freely pass data        |
|                  |        | within a perimeter). An access policy is globally visible within an organization, and the restrictions    |
|                  |        | it specifies apply to all projects within an organization.                                                |
|------------------|--------|-----------------------------------------------------------------------------------------------------------|
| iamPolicy        | object | An Identity and Access Management (IAM) policy, which specifies access controls for Google Cloud          |
|                  |        | resources. A `Policy` is a collection of `bindings`. A `binding` binds one or more `members`              |
|                  |        | to a single `role`.  Members can be user accounts, service accounts, Google groups, and domains           |
|                  |        | (such as G Suite). A `role` is a named list of permissions; each `role` can be an IAM predefined role     |
|                  |        | or a user-created custom role. For some types of Google Cloud resources, a `binding` can also             |
|                  |        | specify a `condition`, which is a logical expression that allows access to a resource only                |
|                  |        | if the expression evaluates to `true`. A condition can add constraints based on attributes of the request,|
|                  |        | the resource, or both. To learn which resources support conditions in their IAM policies, see the         |
|                  |        | [IAM documentation](https://cloud.google.com/iam/help/conditions/resource-policies).                      |
|------------------|--------|-----------------------------------------------------------------------------------------------------------|
| ancestors        | array  | The ancestry path of an asset in Google Cloud [resource hierarchy]                                        |
|                  |        | (https://cloud.google.com/resource-manager/docs/cloud-platform-resource-hierarchy),                       |
|                  |        | represented as a list of relative resource names. An ancestry path starts with the closest ancestor in    |
|                  |        | the hierarchy and ends at root. If the asset is a project, folder, or organization, the ancestry path     |
|                  |        | starts from the asset itself. Example: `["projects/123456789", "folders/5432", "organizations/1234"]`     |
|------------------|--------|-----------------------------------------------------------------------------------------------------------|
| assetType        | string | The type of the asset. Example: `compute.googleapis.com/Disk` See [Supported asset types]                 |
|                  |        | (https://cloud.google.com/asset-inventory/docs/supported-asset-types) for more information.               |
|------------------|--------|-----------------------------------------------------------------------------------------------------------|
| accessLevel      | object | An `AccessLevel` is a label that can be applied to requests to Google Cloud services, along with a list   |
|                  |        | of requirements necessary for the label to be applied.                                                    |
|------------------|--------|-----------------------------------------------------------------------------------------------------------|
| resource         | object | A representation of a Google Cloud resource.                                                              |
|------------------|--------|-----------------------------------------------------------------------------------------------------------|
| updateTime       | string | The last update timestamp of an asset. update_time is updated when create/update/delete operation         |
|                  |        | is performed.                                                                                             |
|------------------|--------|-----------------------------------------------------------------------------------------------------------|

As you can see there is some very interesting stuff here, including where the asset fits in the organization hierarchy as well as whether the asset is included in a service perimeter.

Run some queries!

To start querying you just need to supply a root node from which you want to start enumerating assets, this can be at an org level, folder level or project level.

A simple query to group and count all of the different types of assets in a GCP project is shown here:

StackQL
Response

SELECT assetType, COUNT(*)
FROM google.cloudasset.assets 
WHERE parent = 'projects/123123123123'
GROUP BY assetType;

|--------------------------------------------------|----------|
|                    assetType                     | COUNT(*) |
|--------------------------------------------------|----------|
| appengine.googleapis.com/Application             |        4 |
|--------------------------------------------------|----------|
| appengine.googleapis.com/Service                 |       10 |
|--------------------------------------------------|----------|
| appengine.googleapis.com/Version                 |      110 |
|--------------------------------------------------|----------|
| artifactregistry.googleapis.com/DockerImage      |        8 |
|--------------------------------------------------|----------|
| artifactregistry.googleapis.com/Repository       |        1 |
|--------------------------------------------------|----------|
| bigquery.googleapis.com/Dataset                  |        5 |
|--------------------------------------------------|----------|
| bigquery.googleapis.com/Table                    |       12 |
|--------------------------------------------------|----------|
| cloudbilling.googleapis.com/BillingAccount       |        2 |
|--------------------------------------------------|----------|
| cloudfunctions.googleapis.com/CloudFunction      |        1 |
|--------------------------------------------------|----------|
| cloudresourcemanager.googleapis.com/Folder       |        6 |
|--------------------------------------------------|----------|
| cloudresourcemanager.googleapis.com/Organization |        1 |
|--------------------------------------------------|----------|
| cloudresourcemanager.googleapis.com/Project      |       10 |
|--------------------------------------------------|----------|
| compute.googleapis.com/Address                   |        1 |
|--------------------------------------------------|----------|
| compute.googleapis.com/Disk                      |       20 |
|--------------------------------------------------|----------|
| compute.googleapis.com/Firewall                  |       20 |
|--------------------------------------------------|----------|
| compute.googleapis.com/Instance                  |        9 |
|--------------------------------------------------|----------|
| compute.googleapis.com/InstanceGroup             |        2 |
|--------------------------------------------------|----------|
| compute.googleapis.com/InstanceGroupManager      |        2 |
|--------------------------------------------------|----------|
| compute.googleapis.com/InstanceTemplate          |        2 |
|--------------------------------------------------|----------|
| compute.googleapis.com/Network                   |        4 |
|--------------------------------------------------|----------|
| compute.googleapis.com/Project                   |        3 |
|--------------------------------------------------|----------|
| compute.googleapis.com/Route                     |      118 |
|--------------------------------------------------|----------|
| compute.googleapis.com/Subnetwork                |      112 |
|--------------------------------------------------|----------|
| container.googleapis.com/Cluster                 |        1 |
|--------------------------------------------------|----------|
| container.googleapis.com/NodePool                |        1 |
|--------------------------------------------------|----------|
| containerregistry.googleapis.com/Image           |      132 |
|--------------------------------------------------|----------|
| iam.googleapis.com/ServiceAccount                |       22 |
|--------------------------------------------------|----------|
| iam.googleapis.com/ServiceAccountKey             |       27 |
|--------------------------------------------------|----------|
| k8s.io/Namespace                                 |        6 |
|--------------------------------------------------|----------|
| k8s.io/Node                                      |        2 |
|--------------------------------------------------|----------|
| k8s.io/Pod                                       |       22 |
|--------------------------------------------------|----------|
| k8s.io/Service                                   |        5 |
|--------------------------------------------------|----------|
| pubsub.googleapis.com/Topic                      |        3 |
|--------------------------------------------------|----------|
| rbac.authorization.k8s.io/ClusterRole            |      109 |
|--------------------------------------------------|----------|
| rbac.authorization.k8s.io/ClusterRoleBinding     |       99 |
|--------------------------------------------------|----------|
| rbac.authorization.k8s.io/Role                   |       14 |
|--------------------------------------------------|----------|
| rbac.authorization.k8s.io/RoleBinding            |       17 |
|--------------------------------------------------|----------|
| secretmanager.googleapis.com/Secret              |        1 |
|--------------------------------------------------|----------|
| secretmanager.googleapis.com/SecretVersion       |        1 |
|--------------------------------------------------|----------|
| serviceusage.googleapis.com/Service              |      200 |
|--------------------------------------------------|----------|
| sqladmin.googleapis.com/Instance                 |        2 |
|--------------------------------------------------|----------|
| storage.googleapis.com/Bucket                    |       32 |
|--------------------------------------------------|----------|

or to see the most recent assets to be deployed or modified you could run:

StackQL
Response

SELECT name, updateTime
FROM google.cloudasset.assets 
WHERE parent = 'organizations/12312312312' 
ORDER BY updateTime DESC
LIMIT 3;

+------------------------------------------------------------------+--------------------------+
|                               name                               |        updateTime        |
+------------------------------------------------------------------+--------------------------+
| //appengine.googleapis.com/apps/mycustomapp                      | 2021-06-11T23:43:37.816Z |
| //cloudresourcemanager.googleapis.com/folders/123123123123       | 2020-04-01T01:00:00Z     |
| //cloudresourcemanager.googleapis.com/organizations/12312312312  | 2019-10-22T04:09:06.757Z |
+------------------------------------------------------------------+--------------------------+

You can go nuts from here with other reports or drill into detail as to anomalies or stray assets, have fun!

Preventing Public Access for GCS Buckets

August 24, 2021 · 2 min read

Jeffrey Aven

Technologist and Cloud Consultant

Its easy enough for anyone to deploy a Cloud Storage bucket in google, this can be done through the console, gcloud, terraform or stackql as shown here: Deploying and Querying GCS Buckets using StackQL. It is also easy to inadvertently allow users to set public ACLs on a bucket, therefore making its contents publicly visible by default. There is an easy way to prevent this from happening by Using public access prevention.

Let's work through a real life scenario using StackQL.

Step 1 : Run a query to find buckets which do not have public access prevention enforced

Run the following StackQL query from the shell or via exec:

SELECT name, 
JSON_EXTRACT(iamConfiguration, '$.publicAccessPrevention') as publicAccessPrevention
FROM  google.storage.buckets
WHERE project = 'myco-terraform';
/* returns
|-------------------|------------------------|
|       name        | publicAccessPrevention |
|-------------------|------------------------|
| myco-tf-nonprod   | unspecified            |
|-------------------|------------------------|
| myco-tf-prod      | enforced               |
|-------------------|------------------------|
*/

We can see from the query results that the myco-tf-nonprod bucket does not have public access prevention enforced, lets fix it...using StackQL.

Step 2 : Configure public access prevention for a bucket

Run the following StackQL procedure to enforce public access prevention:

EXEC google.storage.buckets.patch 
@bucket = 'myco-tf-nonprod'
@@json = '{
    "iamConfiguration": {
      "publicAccessPrevention": "enforced"
    }
}';

Step 3: Confirm public access prevention is enforced

Run the first query again, and you should see that the desired result is in place.

SELECT name, 
JSON_EXTRACT(iamConfiguration, '$.publicAccessPrevention') as publicAccessPrevention
FROM  google.storage.buckets
WHERE project = 'myco-terraform';
/* returns
|-------------------|------------------------|
|       name        | publicAccessPrevention |
|-------------------|------------------------|
| myco-tf-nonprod   | enforced               |
|-------------------|------------------------|
| myco-tf-prod      | enforced               |
|-------------------|------------------------|
*/

Easy!

Enable Logging for Google Cloud Storage Buckets and Analyzing Logs in Big Query (Part II)

August 22, 2021 · 4 min read

Jeffrey Aven

Technologist and Cloud Consultant

In the previous post, we showed you how to enable usage and storage logging for GCS buckets. Now that we have enabled logging, let's load and analyze the logs using Big Query. We will build up a data file vars.jsonnet as we go and show the queries step by step, at the end we will show how to run this as one batch using StackQL.

Step 1 : Create a Big Query dataset

We will need a dataset (akin to a schema or a database in other RDMBS parlance), basically a container for objects such as tables or views, the data and code to do this are shown here:

StackQL
Data

INSERT INTO google.bigquery.datasets(
  projectId,
  data__location,
  data__datasetReference,
  data__description,
  data__friendlyName
)
SELECT
  '{{ .projectId }}',
  '{{ .location }}',
  '{ "datasetId": "{{ .datasetId }}", "projectId": "{{ .projectId }}" }',
  '{{ .description }}',
  '{{ .friendlyName }}'
;

// variables
local projectId = 'stackql';
local datasetId = 'stackql_downloads';

// config
{
  projectId: projectId,
  datasetId: datasetId,
  location: 'US',
  description: 'Storage and usage logs from the website',
  friendlyName: 'StackQL Download Logs',
}

Step 2 : Create `usage` table

Let's use StackQL to create a table named usage to host the GCS usage logs, the schema for the table is defined in a file named cloud_storage_usage_schema_v0.json which can be downloaded from the location provided, for reference this is provided in the Table Schema tab in the example provided below:

StackQL
Data
cloud_storage_usage_schema_v0.json

/* create_table.iql */

INSERT INTO google.bigquery.tables(
  datasetId,
  projectId,
  data__description,
  data__friendlyName,
  data__tableReference,
  data__schema
)
SELECT
  '{{ .datasetId }}',
  '{{ .projectId }}',
  '{{ .table.usage.description }}',
  '{{ .table.usage.friendlyName }}',
  '{"projectId": "{{ .projectId }}", "datasetId": "{{ .datasetId }}", "tableId": "{{ .table.usage.tableId }}"}',
  '{{ .table.usage.schema }}'
; 

/* vars.jsonnet */

// variables
local projectId = 'stackql';
local datasetId = 'stackql_downloads';
local usage_fields = import 'cloud_storage_usage_schema_v0.json';

// config
{
  projectId: projectId,
  datasetId: datasetId,
  location: 'US',
  description: 'Storage and usage logs from the website',
  friendlyName: 'StackQL Download Logs',
  table: {
    usage: {
      tableId: 'usage',
      friendlyName: 'Usage Logs',
	  description: 'Big Query table for GCS usage logs',
	  schema: {
        fields: usage_fields
	  }
    },
  },
}

[
  {
    "name": "time_micros",
    "type": "integer",
    "mode": "NULLABLE"
  },

  {
    "name": "c_ip",
    "type": "string",
    "mode": "NULLABLE"
  },

  {
    "name": "c_ip_type",
    "type": "integer",
    "mode": "NULLABLE"
  },

  {
    "name": "c_ip_region",
    "type": "string",
    "mode": "NULLABLE"
  },

  {
    "name": "cs_method",
    "type": "string",
    "mode": "NULLABLE"
  },

  {
    "name": "cs_uri",
    "type": "string",
    "mode": "NULLABLE"
  },

  {
    "name": "sc_status",
    "type": "integer",
    "mode": "NULLABLE"
  },

  {
    "name": "cs_bytes",
    "type": "integer",
    "mode": "NULLABLE"
  },

  {
    "name": "sc_bytes",
    "type": "integer",
    "mode": "NULLABLE"
  },

  {
    "name": "time_taken_micros",
    "type": "integer",
    "mode": "NULLABLE"
  },

  {
    "name": "cs_host",
    "type": "string",
    "mode": "NULLABLE"
  },

  {
    "name": "cs_referer",
    "type": "string",
    "mode": "NULLABLE"
  },

  {
    "name": "cs_user_agent",
    "type": "string",
    "mode": "NULLABLE"
  },

  {
    "name": "s_request_id",
    "type": "string",
    "mode": "NULLABLE"
  },

  {
    "name": "cs_operation",
    "type": "string",
    "mode": "NULLABLE"
  },

  {
    "name": "cs_bucket",
    "type": "string",
    "mode": "NULLABLE"
  },

  {
    "name": "cs_object",
    "type": "string",
    "mode": "NULLABLE"
  }
]

Run the following to execute the StackQL command with the input data shown:

stackql exec -i ./create_table.iql --iqldata ./vars.jsonnet

Step 3 : Load the usage data

We have a Big Query dataset and a table, lets load some data. To do this we will need to create and submit a load job, we can do this by inserting into the google.bigquery.jobs resource as shown here:

StackQL
Data

/* bq_load_job.iql */

INSERT INTO google.bigquery.jobs(
  projectId,
  data__configuration
)
SELECT
  'stackql',
  '{
    "load": {
      "destinationTable": {
        "projectId": "{{ .projectId }}",
        "datasetId": "{{ .datasetId }}",
        "tableId": "{{ .table.usage.tableId }}"
      },
      "sourceUris": [
        "gs://{{ .logs_bucket }}/{{ .object_prefix }}"
      ],
      "schema": {{ .table.usage.schema }},
	  "skipLeadingRows": 1,
      "maxBadRecords": 0,
      "projectionFields": []
    }
  }'
;

/* vars.jsonnet */

// variables
local projectId = 'stackql';
local datasetId = 'stackql_downloads';
local usage_fields = import 'cloud_storage_usage_schema_v0.json';

// config
{
  projectId: projectId,
  datasetId: datasetId,
  location: 'US',
  logs_bucket: 'stackql-download-logs',
  object_prefix: 'stackql_downloads_usage_2021*',
  description: 'Storage and usage logs from the website',
  friendlyName: 'StackQL Download Logs',
  table: {
    usage: {
      tableId: 'usage',
      friendlyName: 'Usage Logs',
	  description: 'Big Query table for GCS usage logs',
	  schema: {
        fields: usage_fields
	  }
    },
  },
}

Run the following to execute:

stackql exec -i ./bq_load_job.iql --iqldata ./vars.jsonnet

Clean up (optional)

If you want to clean up what you have done, you can do so using StackQL DELETE statements, as provided below:

NOTE: To delete a Big Query dataset, you need to delete all of the tables contained in the dataset first, as shown in the following example

StackQL
Data

-- delete table(s) 

DELETE FROM google.bigquery.tables 
WHERE projectId = '{{ .projectId }}' 
AND datasetId = '{{ .datasetId }}' 
AND tableId = '{{ .table.usage.tableId }}';

-- delete dataset

DELETE FROM google.bigquery.datasets 
WHERE projectId = '{{ .projectId }}' 
AND datasetId = '{{ .datasetId }}';

// generally you would use the same data used to create the dataset and table(s)  

// variables
local projectId = 'stackql';
local datasetId = 'stackql_downloads';

// config
{
  projectId: projectId,
  datasetId: datasetId,
  table: {
    usage: {
      tableId: 'usage',
    }
  },  
}

Enable Logging for Google Cloud Storage Buckets and Analyzing Logs in Big Query (Part I)

August 18, 2021 · 3 min read

Jeffrey Aven

Technologist and Cloud Consultant

In a previous article, Deploying and Querying GCS Buckets using StackQL, we walked through some basic creation and query operations on Google Cloud Storage buckets. In this post we will extend on this by enabling logging on a GCS bucket using StackQL. This post is based upon this article: Usage logs & storage logs.

Assuming we have deployed a bucket which we want to log activities on, follow the steps below:

Step 1 : Create a bucket to store the usage logs

One bucket in a project can be used to collect the usage logs from one or more other buckets in the project. Use the StackQL Command Shell (stackql shell) or stackql exec to create this logs bucket as shown here:

INSERT INTO google.storage.buckets(
  project,
  data__name,
  data__location,
  data__locationType
)
SELECT
  'stackql',
  'stackql-download-logs',
  'US',
  'multi-region'
;

for more examples of creating Google Cloud Storage buckets using StackQL, see Deploying and Querying GCS Buckets using StackQL.

Step 2: Set IAM policy for the logs bucket

You will need to create an IAM binding to enable writes to this bucket, do this by using the setIamPolicy method as shown here:

EXEC google.storage.buckets.setIamPolicy
@bucket = 'stackql-download-logs'
@@json = '{
  "bindings":[
    {
      "role": "roles/storage.legacyBucketWriter",
      "members":[
        "group:cloud-storage-analytics@google.com"
      ]
    }
  ]
}';

TIP: you should also add role bindings to the roles/storage.legacyBucketOwner role for serviceAccount or users who will be running StackQL SELECT queries against this logs bucket.

Step 3: Enable logging on the target bucket

To enable logging on your target bucket (or buckets) run the following StackQL EXEC method:

EXEC google.storage.buckets.patch
@bucket = 'stackql-downloads'
@@json = '{
 "logging": {
  "logBucket": "stackql-download-logs",
  "logObjectPrefix": "stackql_downloads"
 }
}';

TIP: use SHOW METHODS IN google.storage.buckets; to see what operations are avaialable such as the patch and setIamPolicy examples shown in the previous steps.

Step 4: Check logging status on target bucket

To see that logging has been enabled run the StackQL query below:

select name, logging
from google.storage.buckets
WHERE project = 'stackql'
and logging is not null;

To unpack the logging object, you can use the [JSON_EXTRACT]](/docs/language-spec/functions/json/json_extract) built in function as shown here:

select name, json_extract(logging, '$.logBucket') as logBucket,
json_extract(logging, '$.logObjectPrefix') as logObjectPrefix
from google.storage.buckets
WHERE project = 'stackql'
and logging is not null;

In Part II of this post, we will demonstrate how to create a Big Query dataset, then load and analyze the GCS usage logs you have collected using Big Query, stay tuned!

New Service: Speech-to-Text v2​

Vertex AI / AI Platform​

Discovery Engine​

Contact Center AI Insights​

BigQuery​

Healthcare API​

Cloud Spanner​

Cloud SQL Admin​

GKE On-Prem​

Developer Connect​

Text-to-Speech​

Get Started​

Explore the API​

Run some queries!​

Step 1 : Run a query to find buckets which do not have public access prevention enforced​

Step 2 : Configure public access prevention for a bucket​

Step 3: Confirm public access prevention is enforced​

Step 1 : Create a Big Query dataset​

Step 2 : Create usage table​

Step 3 : Load the usage data​

Clean up (optional)​

Step 1 : Create a bucket to store the usage logs​

Step 2: Set IAM policy for the logs bucket​

Step 3: Enable logging on the target bucket​

Step 4: Check logging status on target bucket​

New Service: Speech-to-Text v2

Vertex AI / AI Platform

Discovery Engine

Contact Center AI Insights

BigQuery

Healthcare API

Cloud Spanner

Cloud SQL Admin

GKE On-Prem

Developer Connect

Text-to-Speech

Get Started

Explore the API

Run some queries!

Step 1 : Run a query to find buckets which do not have public access prevention enforced

Step 2 : Configure public access prevention for a bucket

Step 3: Confirm public access prevention is enforced

Step 1 : Create a Big Query dataset

Step 2 : Create `usage` table

Step 3 : Load the usage data

Clean up (optional)

Step 1 : Create a bucket to store the usage logs

Step 2: Set IAM policy for the logs bucket

Step 3: Enable logging on the target bucket

Step 4: Check logging status on target bucket