New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Traefik doesn't reconnect to Jaeger when connection lost #6093
Comments
I can confirm the issue on Kubernetes, and probably on any other platform when using a service name as the Jaeger host endpoint. To me it looks like the issue is caused by Traefik not recreating the Jaeger client because it's an UDP connection (no fail control), for example:
Not sure what we could do in this case though |
the samplingServerURL is a HTTP endpoint |
jaegertracing/jaeger-client-go#403 I believe this is an issue with the |
@Pehesi97 in this case, isn't If so, the service IP should not have changed, and a restarted pod should not have made any changes to the service IP. Traefik would never have connected directly to a pod IP, as pods in kubernetes do not normally get a DNS record. There are exceptions, but this does not look like one. Is it possible that your pod did not pass the readiness checks, and did not get re-added as a service endpoint? Or is there another reason that your service IP would have changed? |
@dtomcej this can happen because that template deploys a headless service for the agent. this means no proxying is done, it merely provides a dns convenience for round robinning of pods selected by the service. i've submitted a pr to the jaeger-client-go library jaegertracing/jaeger-client-go#520 that should resolve this issue. would anyone mind having a look and possibly giving it a 👍 for more exposure |
fyi if you upgrade your dependency now you should see that this issue is resolved by the linked pr |
Closed by #7198 |
Do you want to request a feature or report a bug?
Report a bug.
What did you do?
Configured Traefik to output traces to a Jaeger instance running on the same Kubernetes cluster using the following arguments:
It worked correctly, until my Jaeger Agent pod restarted. Traefik didn't connect to the service again and therefore my traces weren't being written to Jaeger.
What did you expect to see?
Traefik reconnecting to Jaeger Agent when it was available again.
What did you see instead?
Traefik didn't reconnect to Jaeger Agent when it was available again.
Output of
traefik version
: (What version of Traefik are you using?)What is your environment & configuration (arguments, toml, provider, platform, ...)?
The text was updated successfully, but these errors were encountered: