Skip to content

Hardcoded zooKeeperNamespace in URL doesn't allow to use custom name for zooKeeperNamespace  #40805

@ada-nilova

Description

@ada-nilova

Apache Airflow Provider(s)

apache-hive

Versions of Apache Airflow Providers

apache-airflow-providers-apache-hive==8.1.1

Apache Airflow version

2.8.4

Operating System

Debian GNU/Linux 12

Deployment

Docker-Compose

Deployment details

No response

What happened

In #38651 to airflow/providers/apache/hive/hooks/hive.py was added hardcoded URL part with "zooKeeperNamespace=hiveserver2".
Unfortunately, "hiveserver2" is not the once namespace name that can be. In some Hadoop installations zooKeeperNamespace can have any custom name.

In the same change was added ssl using always and only when using HA. But in some Hadoop installations we can set using ssl, but not using HA. And now we cannot use one option separately from another.

What you think should happen instead

If we check HA checkbox, it should be better to have a field for custom zooKeeperNamespace name in a connection form, not hardcoded.

And it should be better to have a special checkbox for ssl setting, separately from a HA checkbox

How to reproduce

If we have another zooKeeperNamespace (not hiveserver2) we cannot use HiveOperator with it.

If we have Hive with SSL, but without Zookeeper we cannot use HiveOperator with it.

Anything else

No response

Are you willing to submit PR?

  • Yes I am willing to submit a PR!

Code of Conduct

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions