[Dataflow non-portable runner] Add support for sending logs directly to Cloud Logging#37662
[Dataflow non-portable runner] Add support for sending logs directly to Cloud Logging#37662scwhittle wants to merge 1 commit intoapache:masterfrom
Conversation
Summary of ChangesHello @scwhittle, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed! This pull request enhances Dataflow worker logging by enabling direct transmission of logs to Google Cloud Logging. This change aims to reduce log latency and improve reliability by allowing logs to bypass the traditional disk-based ingestion mechanism. A new configuration option provides granular control over which log levels are directly uploaded, offering more flexibility for users to manage their logging behavior. Highlights
🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console. Changelog
Using Gemini Code AssistThe full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips. Invoking Gemini You can request assistance from Gemini at any point by creating a comment using either
Customization To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a Limitations & Feedback Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here. You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension. Footnotes
|
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## master #37662 +/- ##
=========================================
Coverage 40.08% 40.08%
Complexity 3416 3416
=========================================
Files 1178 1177 -1
Lines 187433 187443 +10
Branches 3589 3592 +3
=========================================
+ Hits 75130 75144 +14
+ Misses 108912 108905 -7
- Partials 3391 3394 +3
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
25dbee1 to
06a8578
Compare
06a8578 to
115aed6
Compare
115aed6 to
403f898
Compare
The default path for logs for dataflow pipelines is to disk and then uploaded to Cloud Logging by an agent on the VM. There is a throttling limit in this path after which logs are dropped. This was somewhat arbitrary but it prevents excessive logging billing for customers that have per-element logs enabled and there are costs and scaling limits of the current implementation. This PR adds options to allow configuring logs to be sent directly to Cloud Logging, with fallback to disk-based logging on errors.
Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily:
addresses #123), if applicable. This will automatically add a link to the pull request in the issue. If you would like the issue to automatically close on merging the pull request, commentfixes #<ISSUE NUMBER>instead.CHANGES.mdwith noteworthy changes.See the Contributor Guide for more tips on how to make review process smoother.
To check the build health, please visit https://github.com/apache/beam/blob/master/.test-infra/BUILD_STATUS.md
GitHub Actions Tests Status (on master branch)
See CI.md for more information about GitHub Actions CI or the workflows README to see a list of phrases to trigger workflows.