jamescooke, to random
@jamescooke@fosstodon.org avatar

Here's a smell that your Big Query query ain't that big - lots of 1 row tables getting used to load config-like values, which are then used for JOINs later.

So much cruft to clean up 😥 .

My brain says: - maybe couldda just done it in a PAndAs?! Let's definitely not learn about CTEs right?! 😬

siddhantgoel, to python
@siddhantgoel@mastodon.social avatar

What's the best way to get download statistics for a specific Python package uploaded on the PyPI?

I'm interested in stuff like total downloads since being published, monthly download count, etc.

hugovk,
@hugovk@mastodon.social avatar

@sckottie @siddhantgoel The API doesn't have more, but https://github.com/hugovk/norwegianblue is my client for the https://pypistats.org API.

There's also https://www.pepy.tech which has total downloads, but it includes downloads from PyPI and from PyPI mirrors.

To go to the source, both get data from BigQuery:

https://packaging.python.org/en/latest/guides/analyzing-pypi-package-downloads/

https://cloud.google.com/blog/topics/developers-practitioners/analyzing-python-package-downloads-bigquery

tdp_org, to random
@tdp_org@mastodon.social avatar

If you use BigQuery and allow others to query your data, you might be interested in my feature request:
Disallow querying for select * ...
This is a super-simple control which is useful for folks with wide tables/columns that incur significant cost on select *.
Of course it won't prevent someone listing every column but it will encourage people to think about what they're doing & adjust.
https://issuetracker.google.com/u/2/issues/288391231

tdp_org, to webdev
@tdp_org@mastodon.social avatar

@steren & I had a chat a while back about our serverless log processing pipeline which I built on Google Cloud using Cloud Storage, Eventarc (PubSub), Cloud Run & BigQuery. We talked a bit around scaling & other interesting stuff.
The comms fairies then weaved their magic & turned it into a blog post:
https://cloud.google.com/blog/products/serverless/how-the-bbc-uses-cloud-run-and-bigquery-to-process-logs

SimoAhava, to random

New stuff coming to the export in :

  • New fields with event-level traffic source data to make it easier to parse campaign data from individual hits

  • New export (separate table) for user-level data

Coming Q2 2023.

h/t Johan Strand

New user-level export coming to BigQuery.

nucliweb, to random
@nucliweb@webperf.social avatar

Intro to BigQuery and HttpArchive with Rick Viscomi

📺 https://www.youtube.com/watch?v=00f9kza3BJ0

  • All
  • Subscribed
  • Moderated
  • Favorites
  • JUstTest
  • kavyap
  • DreamBathrooms
  • InstantRegret
  • magazineikmin
  • osvaldo12
  • mdbf
  • Youngstown
  • cisconetworking
  • slotface
  • rosin
  • thenastyranch
  • ngwrru68w68
  • khanakhh
  • megavids
  • ethstaker
  • tacticalgear
  • modclub
  • cubers
  • Leos
  • everett
  • GTA5RPClips
  • Durango
  • anitta
  • normalnudes
  • provamag3
  • tester
  • lostlight
  • All magazines