grpc-java

Commit Graph

Author	SHA1	Message	Date
Eric Anderson	2448c8b6b9	util: Replace BUFFER_PICKER with FixedResultPicker I think at some point there were more usages in the tests. But now it is pretty easy. PriorityLb.ChildLbState.picker is initialized to FixedResultPicker(NoResult). So now that GracefulSwitchLb is using the same picker, equals() is able to de-dup an update.	2025-03-28 12:49:36 -07:00
Eric Anderson	2e260a4bbc	util: Graceful switch to new LB when leaving CONNECTING Previously it would wait for the new LB to enter READY. However, that prevents there being an upper-bound on how long the old policy will continue to be used. The point of graceful switch is to avoid RPCs seeing increased latency when we swap config. We don't want it to prevent the system from becoming eventually consistent.	2025-03-28 15:18:10 +00:00
Alex Panchenko	7507a9ec06	core: Use java.time.Time.getNano in InstantTimeProvider without reflection (#11977 ) Fixes #11975	2025-03-26 13:49:21 +05:30
Abhishek Agrawal	a332eddc13	fix: cleans up FileWatcherCertificateProvider in XdsSecurityClientServerTest	2025-03-26 11:43:05 +05:30
jiangyuan	350f90e1a3	services: Avoid cancellation exceptions when notifying watchers that already have their connections cancelled (#11934 ) Some clients watching health status can cancel their watch and `HealthService` when trying to notify these watchers were getting CANCELLED exception because there was no cancellation handler set on the `StreamObserver`. This change sets the cancellation handler that removes the watcher from the set of watcher clients to be notified of the health status.	2025-03-25 17:42:28 +05:30
Eric Anderson	3961a923ac	core: Log any exception during panic because of exception panic() calls a good amount of code, so it could get another exception. The SynchronizationContext is running on an arbitrary thread and we don't want to propagate this secondary exception up its stack (to be handled by its UncaughtExceptionHandler); it we wanted that we'd propagate the original exception. This second exception will only be seen in the logs; the first exception was logged and will be used to fail RPCs. Also related to http://yaqs/8493785598685872128 and `b692b9d26`	2025-03-24 14:32:53 -07:00
Ashley Zhang	1958e42370	xds: add support for custom per-target credentials on the transport (#11951 )	2025-03-21 15:19:40 -07:00
yifeizhuang	94f8e93691	otel tracing: fix span names (#11974 )	2025-03-21 15:19:25 -07:00
Alex Panchenko	d60e6fc251	Replace usages of deprecated ExpectedException in grpc-api and grpc-core (#11962 )	2025-03-21 13:00:24 +05:30
Eric Anderson	d2d72cda83	xds: Expose filter names to filter instances (#11971 ) This is to support gRFC A83 xDS GCP Authentication Filter: > Otherwise, the filter will look in the CDS resource's metadata for a > key corresponding to the filter's instance name.	2025-03-21 11:01:16 +05:30
Eric Anderson	bb120a8cbb	xds: Assert XdsNR's cluster ref counting is consistent It is much harder to debug refcounting problems when we ignore impossible situations. So make such impossible cases complain loudly so the bug is obvious.	2025-03-19 13:47:02 -07:00
Eric Anderson	bc3c764058	xds: Include XdsConfig as a CallOption This allows Filters to access the xds configuration for their own processing. From gRFC A83: > This data is available via the XdsConfig attribute introduced in A74. > If the xDS ConfigSelector is not already passing that attribute to the > filters, it will need to be changed to do so.	2025-03-19 09:04:27 -07:00
Abhishek Agrawal	a57c14a51e	refactor: Stops exception allocation on channel shutdown This fixes #11955. Stops exception allocation and its propagation on channel shutdown.	2025-03-19 09:27:34 +05:30
Eric Anderson	e80c197455	xds: Use XdsDependencyManager for XdsNameResolver Contributes to the gRFC A74 effort. https://github.com/grpc/proposal/blob/master/A74-xds-config-tears.md The alternative to using Mockito's ArgumentMatcher is to use Hamcrest. However, Hamcrest did not impress me. ArgumentMatcher is trivial if you don't care about the error message. This fixes a pre-existing issue where ConfigSelector.releaseCluster could revert the LB config back to using cluster manager after releasing all RPCs using a cluster have committed. Co-authored-by: Larry Safran <lsafran@google.com>	2025-03-18 14:05:01 -07:00
MV Shiva	e388ef3975	documentation: upgrade to junit 4.13.2 (#11967 )	2025-03-18 18:43:03 +05:30
Dennis Shao	b69bd64ce7	Populate the pb::java feature extension to gprc proto plugin (#11885 ) Populate the pb::java feature extension to the protoc plugins that require Protobuf Java feature resolution for the edition.	2025-03-17 18:46:28 +05:30
Alex Panchenko	fca1d3cf43	servlet: set description for CANCELLED status (#11927 )	2025-03-12 14:09:49 +05:30
MV Shiva	2f52a00364	netty: Swap to UniformStreamByteDistributor (#11954 )	2025-03-11 22:39:54 +05:30
Kannan J	2191557582	Update README etc to reference 1.71.0 (#11940 )	2025-03-11 16:05:39 +05:30
Arjan Singh Bal	4933cddd00	Fix typo in dualstack example (#11916 )	2025-03-11 16:05:05 +05:30
Kannan J	24b9f6ff0d	Update psm-dualstack.cfg (#11950 ) 120 minutes has not been sufficient, causing frequent VM timeout errors in the test runs: https://testgrid.corp.google.com/grpc-psm-java#v1.67.x&width=20&graph-metrics=test-duration-minutes&include-filter-by-regex=psm-dualstack$	2025-03-10 12:33:45 +05:30
Emmanuel Ferdman	61a110d962	examples: Update in-process sources in examples (#11952 ) Update in-process sources location in examples since they have been migrated from core artifacts.	2025-03-10 05:20:20 +00:00
Eric Anderson	f3f054a0a4	xds: Log cluster_manager config update before applying config It is confusing/harder to read the logs when the activations/deactivations because of the config happen before the log entry describing the new config.	2025-03-07 14:37:37 -08:00
Eric Anderson	d82613a74c	xds: Fix cluster selection races when updating config selector Listener2.onResult() doesn't require running in the sync context, so when called from the sync context it is guaranteed not to do its processing immediately (instead, it schedules work into the sync context). The code was doing an update dance: 1) update service config to add new cluster, 2) update config selector to use new cluster, 3) update service config to remove old clusters. But the onResult() wasn't being processed immediately, so the actual execution order was 2, 1, 3 which has a small window where RPCs will fail. But onResult2() does run immediately. And since `ca4819ac6`, updateBalancingState() updates the picker immediately. cleanUpRoutes() was also racy because it updated the routingConfig before swapping to the new config selector, so RPCs could fail saying there was no route instead of the useful error message. Even with the opposite order, some RPCs may be executing the while loop of selectConfig(), trying to acquire a cluster. The code unreffed the clusters before updating the routingConfig, so those RPCs could go into a tight loop until the routingConfig was updated. Also, once the routingConfig was updated to EMPTY those RPCs would similarly see the wrong error message. To give the correct error message, selectConfig() must fail such RPCs directly, and once it can do that there's no need to stop using the config selector in error cases. This has the benefit of fewer moving parts and more consistent threading among cases. The added test was able to detect the race 2% of the time. The slower the code/machine, the more reliable the test failed. `ca4819ac6` along with this commit reduced it to 0 failures in 1000 runs. Discovered when investigating b/394850611	2025-03-07 10:33:35 -08:00
Eric Anderson	ca4819ac6d	core: Apply ManagedChannelImpl's updateBalancingState() immediately `ffcc360ba` adjusted updateBalancingState() to require being run within the sync context. However, it still queued the work into the sync context, which was unnecessary. This re-entering the sync context unnecessarily delays the new state from being used.	2025-03-06 12:31:10 -08:00
Sergii Tkachenko	a6a041e415	xds: Support filter state retention This PR adds support filter state retention in Java. The mechanism will be similar to the one described in [A83] (https://github.com/grpc/proposal/blob/master/A83-xds-gcp-authn-filter.md#filter-call-credentials-cache) for C-core, and will serve the same purpose. However, the implementation details are very different due to the different nature of xDS HTTP filter support in C-core and Java. ### Filter instance lifecycle #### xDS gRPC clients New filter instances are created per combination of: 1. `XdsNameResolver` instance, 2. Filter name+typeUrl as configured in HttpConnectionManager (HCM) http_filters. Existing client-side filter instances are shutdown: - A single a filter instance is shutdown when an LDS update contains HCM that is missing filter configuration for name+typeUrl combination of this instance. - All filter instances when watched LDS resource is missing from an LDS update. - All filter instances name resolver shutdown. #### xDS-enabled gRPC servers New filter instances are created per combination of: 1. Server instance, 2. FilterChain name, 3. Filter name+typeUrl as configured in FilterChain's HCM.http_filters Filter instances of Default Filter Chain is tracked separately per: 1. Server instance, 2. Filter name+typeUrl in default_filter_chain's HCM.http_filters. Existing server-side filter instances are shutdown: - A single a filter instance is shutdown when an LDS update contains FilterChain with HCM.http_filters that is missing configuration for filter name+typeUrl. - All filter instances associated with the FilterChain when an LDS update no longer contains FilterChain's name. - All filter instances when watched LDS resource is missing from an LDS update. - All filter instances on server shutdown. ### Related - Part 1: #11883	2025-03-06 10:32:08 -08:00
MV Shiva	602aece081	xds: avoid unnecessary dns lookup (#11932 )	2025-03-06 16:04:53 +05:30
MV Shiva	12197065fe	xds: xDS-based HTTP CONNECT configuration (#11861 )	2025-03-06 13:40:18 +05:30
MV Shiva	c340f4a2f3	rls: allow maxAge to exceed 5m if staleAge is set (#11931 )	2025-03-04 10:02:03 +05:30
Sergii Tkachenko	1a2285b527	xds: ensure server interceptors are created in a sync context (#11930 ) `XdsServerWrapper#generatePerRouteInterceptors` was always intended to be executed within a sync context. This PR ensures that by calling `syncContext.throwIfNotInThisSynchronizationContext()`. This change is needed for upcoming xDS filter state retention because the new tests in XdsServerWrapperTest flake with this NPE: > `Cannot invoke "io.grpc.xds.client.XdsClient$ResourceWatcher.onChanged(io.grpc.xds.client.XdsClient$ResourceUpdate)" because "this.ldsWatcher" is null`	2025-03-03 14:28:36 -08:00
Kannan J	cdab410b81	netty: Per-rpc authority verification against peer cert subject names (#11724 ) Per-rpc verification of authority specified via call options or set by the LB API against peer cert's subject names.	2025-02-24 20:28:11 +05:30
Eric Anderson	57124d6b29	Use acceptResolvedAddresses() in easy cases We want to move away from handleResolvedAddresses(). These are "easy" in that they need no logic. LBs extending ForwardingLoadBalancer had the method duplicated from handleResolvedAddresses() and swapped away from `super` because ForwardingLoadBalancer only forwards handleResolvedAddresses() reliably today. Duplicating small methods was less bug-prone than dealing with ForwardingLoadBalancer.	2025-02-20 21:25:55 -08:00
Eric Anderson	110c1ff0d6	xds: Use acceptResolvedAddresses() for PriorityLb children PriorityLb should propagate config problems up to the name resolver so it can refresh.	2025-02-20 16:35:54 -08:00
Eric Anderson	f207be39a9	util: Remove GracefulSwitchLb.switchTo() It was deprecated in `85e0a01ec`, so has been deprecated for six releases/over six months.	2025-02-20 16:06:37 -08:00
Daniel Liu	892144dcac	xds: explicitly set request hash key for the ring hash LB policy Implements [gRFC A76: explicitly setting the request hash key for the ring hash LB policy][A76] * Explictly setting the request hash key is guarded by the `GRPC_EXPERIMENTAL_RING_HASH_SET_REQUEST_HASH_KEY` environment variable until API stabilized. Tested: * Verified end-to-end by spinning up multiple gRPC servers and a gRPC client that injects a custom service (load balancing) config with `ring_hash_experimental` and a custom `request_hash_header` (with NO associated value in the metadata headers) which generates a random hash for each request to the ring hash LB. Verified picks/RPCs are split evenly/uniformly across all backends. * Ran affected unit tests with thread sanitizer and 1000 iterations to prevent data races. [A76]: https://github.com/grpc/proposal/blob/master/A76-ring-hash-improvements.md#explicitly-setting-the-request-hash-key	2025-02-19 20:25:33 -08:00
Riya Mehta	68d79b5130	s2a: Use protos published under com.google.s2a.proto.v2. (#11908 )	2025-02-19 16:59:50 -08:00
Kannan J	60f6ea7b8e	Upgrade gradle and gradle plugin versions. (#11906 ) Upgrading to Gradle 8.11. Gradle 8.12 requires newer versions of Windows (gradle/gradle#31939) that we can look into later.	2025-02-19 17:25:54 +05:30
Sergii Tkachenko	2b87b01651	xds: Change how xDS filters are created by introducing Filter.Provider (#11883 ) This is the first step towards supporting filter state retention in Java. The mechanism will be similar to the one described in [A83] (https://github.com/grpc/proposal/blob/master/A83-xds-gcp-authn-filter.md#filter-call-credentials-cache) for C-core, and will serve the same purpose. However, the implementation details are very different due to the different nature of xDS HTTP filter support in C-core and Java. In Java, xDS HTTP filters are backed by classes implementing `io.grpc.xds.Filter`, from here just called "Filters". To support Filter state retention (next PR), Java's xDS implementation must be able to create unique Filter instances per: - Per HCM `envoy.extensions.filters.network.http_connection_manager.v3.HttpConnectionManager` - Per filter name as specified in `envoy.extensions.filters.network.http_connection_manager.v3.HttpFilter.name` This PR does not implements Filter state retention, but lays the groundwork for it by changing how filters are registered and instantiated. To achieve this, all existing Filter classes had to be updated to the new instantiation mechanism described below. Prior to these this PR, Filters had no livecycle. FilterRegistry provided singleton instances for a given typeUrl. This PR introduces a new interface `Filter.Provider`, which instantiates Filter classes. All functionality that doesn't need an instance of a Filter is moved to the Filter.Provider. This includes parsing filter config proto into FilterConfig and determining the filter kind (client-side, server-side, or both). This PR is limited to refactoring, and there's no changes to the existing behavior. Note that all Filter Providers still return singleton Filter instances. However, with this PR, it is now possible to create Providers that return a new Filter instance each time `newInstance` is called.	2025-02-18 10:47:01 -08:00
Eric Anderson	713607056e	util: Use acceptResolvedAddresses() for MultiChildLb children A failing Status from acceptResolvedAddresses means something is wrong with the config, but parts of the config may still have been applied. Thus there are now two possible flows: errors that should prevent updateOverallBalancingState() and errors that should have no effect other than the return code. To manage that, MultChildLb must always be responsible for calling updateOverallBalancingState(). acceptResolvedAddressesInternal() was inlined to make that error processing easier. No existing usages actually needed to have logic between updating the children and regenerating the picker. RingHashLb already was verifying that the address list was not empty, so the short-circuiting when acceptResolvedAddressesInternal() returned an error was impossible to trigger. WrrLb's updateWeightTask() calls the last picker, so it can run before acceptResolvedAddressesInternal(); the only part that matters is re-creating the weightUpdateTimer.	2025-02-18 07:33:49 -08:00
Kannan J	a132123c93	Start 1.72.0 development cycle (#11907 )	2025-02-18 19:46:02 +05:30
Naveen Prasanna V	16edf7ac4e	Examples: Updated HelloWorldServer to use Executor (#11850 )	2025-02-18 14:40:18 +05:30
Eric Anderson	16d26726cf	s2a: Don't allow S2AStub to be set S2AStub is an internal API and shouldn't be used outside of s2a. It is still available for tests. IntegrationTest was moved to io.grpc.s2a. It uses a io.grpc.s2a class, so shouldn't be in internal.handler	2025-02-14 15:47:19 -08:00
Eric Anderson	9e54e8e5e9	servlet: Provide Gradle a filter version number The version number is simply a unique string per version.	2025-02-14 15:45:44 -08:00
Larry Safran	c1d703546a	okhttp:Use a locally specified value instead of Segment.SIZE in okhttp (#11899 ) Switched to using 8192 which is the current value of Segment.SIZE and just have a test check that they are equal. The reason for doing this is that Segment.SIZE is Kotlin internal so shouldn't be used outside of its module.	2025-02-14 14:46:54 -08:00
Eric Anderson	57af63ad0a	kokoro: Increase gradle mem in android-interop To try to aid failure when building android-interop-testing ``` The Daemon will expire after the build after running out of JVM heap space. The project memory settings are likely not configured or are configured to an insufficient value. The daemon will restart for the next build, which may increase subsequent build times. These settings can be adjusted by setting 'org.gradle.jvmargs' in 'gradle.properties'. The currently configured max heap space is '512 MiB' and the configured max metaspace is '384 MiB'. ... Exception in thread "Daemon client event forwarder" java.lang.OutOfMemoryError: Java heap space ... > Task :grpc-android-interop-testing:mergeDexDebug FAILED ERROR:D8: java.lang.OutOfMemoryError: Java heap space com.android.builder.dexing.DexArchiveMergerException: Error while merging dex archives: ```	2025-02-14 13:20:05 -08:00
Riya Mehta	a5347b2bc4	s2a: inject Optional<AccessTokenManager> in tests	2025-02-14 12:55:42 -08:00
Larry Safran	41dd0c6d73	xds:Cleanup to reduce test flakiness (#11895 ) * don't process resourceDoesNotExist for watchers that have been cancelled. * Change test to use an ArgumentMatcher instead of expecting that only the final result will be sent since depending on timing there may be configs sent for clusters being removed with their entries as errors.	2025-02-14 10:23:54 -08:00
Alex Panchenko	5a7f350537	optimize number of buffer allocations (#11879 ) Currently this improves 2 flows 1. Known length message which length is greater than 1Mb. Previously the first buffer was 1Mb, and then many buffers of 4096 bytes (from CodedOutputStream), now subsequent buffers are also up to 1Mb 2. In case of compression, the first write is always 10 bytes buffer (gzip header), but worth allocating more space	2025-02-14 05:59:21 -08:00
MV Shiva	7585b1607d	core: remember last pick status in no real stream (#11851 )	2025-02-14 11:38:06 +05:30
Eric Anderson	122b683717	Upgrade netty-tcnative to 2.0.70	2025-02-13 12:41:56 -08:00

1 2 3 4 5 ...

6814 Commits All Branches Search

6814 Commits

All Branches