Skip to content

Simplify contributor data pipeline by removing Google Sheets automation#626

Draft
rivaquiroga wants to merge 3 commits intomasterfrom
tenzing-sheets-preprocessing
Draft

Simplify contributor data pipeline by removing Google Sheets automation#626
rivaquiroga wants to merge 3 commits intomasterfrom
tenzing-sheets-preprocessing

Conversation

@rivaquiroga
Copy link
Contributor

Description

The previous contributor data generation relied on a Google Apps Script automation running inside a Google Spreadsheet. This approach had several issues:

  • Security concerns: The Google Apps Script required users to authorize access to their Google account with broad permissions (read/write spreadsheets, access external URLs). Granting these permissions to a script embedded in a shared spreadsheet poses security risks
  • Complex multi-step process: The automation used IMPORTRANGE to pull data from external Tenzing sheets, created intermediate copies (both "full" and filtered versions), and then generated CSV export URLs for the Python script to consume
  • Fragile dependencies: The process depended on Google Sheets triggers, permissions, and caching behavior that could silently fail
  • Harder to debug: Issues in the pipeline were split between Google Apps Script and Python, making troubleshooting difficult
  • Not version controlled: The automation script lived inside Google Sheets, making it difficult to track changes, review code, or roll back issues

Changes

  • Modified tenzing.py to read directly from the TENZING SHEETS SOURCE sheet
  • The script now reads each project's Tenzing sheet directly, eliminating the need for intermediate sheets
  • Added convert_to_csv_url() function to convert Tenzing edit URLs to CSV export format
  • All data processing logic is now in a single, version-controlled Python script

A copy of the previous Google Sheet (with the automation script) has been preserved for archival purposes in the Google Drive folder "Team Credit & tenzing initiative".

The Google Apps Script automation can now be retired.

Fixes # (issue)

Type of Change

  • Content/documentation update
  • New feature
  • Bug fix
  • Breaking change

Testing

Checklist

  • Self-reviewed my changes
  • Verified links and formatting are correct
  • No new warnings or errors

Notes

@github-actions github-actions bot added the content related Relevant to website content label Feb 8, 2026
@rivaquiroga rivaquiroga force-pushed the tenzing-sheets-preprocessing branch from a8bb211 to c57fae4 Compare February 8, 2026 00:58
@github-actions
Copy link
Contributor

github-actions bot commented Feb 8, 2026

✅ Spell Check Passed

No spelling issues found in this PR! 🎉

@github-actions
Copy link
Contributor

github-actions bot commented Feb 8, 2026

👍 All image files/references (if any) are in webp format, in line with our policy.

@github-actions
Copy link
Contributor

github-actions bot commented Feb 8, 2026

✅ Spell Check Passed

No spelling issues found in this PR! 🎉

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

content related Relevant to website content

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant