Troubleshooting using Kerberos with Talend Big Data
For information on how to use Kerberos with Talend Big Data, see:
- How to use Kerberos in Talend Studio with Big Data v5.x (part 1)
- How to use Kerberos in Talend Studio with Big Data v6.x
When the debugger is enabled, additional information is captured which can help diagnose issues. To enable the debugger:
- Navigate to the Run view of the Talend Studio.
- Click to open the Advanced settings tab. Select the Use specific JVM arguments check box.
- Click the New... button and add this argument:
-Dsun.security.krb5.debug=true
To collect debugging information, after enabling the debugger:
- Navigate to the Basic Run tab.
- Run the Job.
- Examine the collected log messages. In doing this, pay attention to the used principal and how the clien, that is to say, Talend Studio or your Hadoop cluster, is dealing with Kerberos from the cache.
Once the detailed error is identified, search for it in the errors below.
-
"Exception in thread "main" java.lang.ExceptionInInitializerError at java.lang.Class.forName0(Native Method) at java.lang.Class.forName(Class.java:274) at org.apache.hadoop.conf.Configuration.getClassByNameOrNull(Configuration.java:2147)
Possible Cause
You java driver does not support the kerbberos security.
Resolution
Get http://www.oracle.com/technetwork/java/javase/downloads/jce-7-download-432124.html and unzip it on your $JAVAHOME/jre/lib/security.
-
Caused by: java.lang.IllegalArgumentException: Illegal principal name user@BIGDATA.COM: org.apache.hadoop.security.authentication.util.KerberosName$NoMatchingRule: No rules applied to user@BIGDATA.COM
Possible realm conflict.
ResolutionCheck your /etc/krb5.conf . If there are multiple realms, try setting your realm as the default one.
-
GSSException: No valid credentials provided (Mechanism level: Connection refused: connect)
In krb5.ini, the KDC hostname is incorrect or the KDC daemon is not started on this server.
ResolutionCheck for the daemon status or ask to verify the right KDC.
-
GSSException: No valid credentials provided (Mechanism level: Failed to find any Kerberos tgt
User who executes the Job has no valid ticket in his/her cache.
ResolutionRun kinit where the driver will run (on localhost if from Talend Studio, on the server where the Talend JobServer lies if on remote).
-
java.io.IOException: java.lang.IllegalArgumentException: Server has invalid Kerberos principal: nn/sandbox.hadoop.com@EXAMPLE.COM; Host Details : local host is: "ServerName1/127.0.1.1";
The server name is not defined in /etc/hosts .
ResolutionCheck etc/hosts on the client machine and make sure you have the same hostname.
Or
Replace _HOST by the server's complete hostname.
-
java.io.IOException: Login failure for user1 from keytab C:/Users/user1.DOMAIN1/Documents/sko/user1.keytab Caused by: javax.security.auth.login.LoginException: Checksum failed
The principal hash does not correspond to the hash the keytab knows. It can happen when the principal was once deleted and then recreated in the KDC database.
ResolutionRenew the obsolete keytab.
-
java.io.IOException: Login failure for user1 from keytab C:/Users/user1.DOMAIN/Documents/sko/user1.keytab Caused by: KrbException: Client not found in Kerberos database (devil) - CLIENT_NOT_FOUND
The keytab correctly contains the user principal but KDC does not know this principal anymore.
ResolutionRenew the obsolete keytab.
-
java.io.IOException: Login failure for user1 from keytab C:/Users/user1.DOMAIN/Documents/sko/user1.keytab Caused by: javax.security.auth.login.LoginException: Unable to obtain password from user
The keytab used does not contain any credentials corresponding to the specified principal or the keytab is not readable for the current user.
ResolutionCheck the username specified in the Talend components, if okay, check the keytab privileges (read access) and validity.
-
kinit: Client not found in Kerberos database while getting initial credentials
The user has no principal in the KDC database.
ResolutionCreate the principal or use the right one (via kadmin or kadmin.local).
-
krb5_get_init_creds: unable to reach any KDC in realm EXAMPLE.COM, tried 1 KDC
The KDC server is configured to use only UDP or TCP and not both, as supposed by your krb5.conf .
ResolutionTry to force the protocol in the krb5.conf by adding a line:
kdc = tcp/<kdc_server_hostname>:88
-
org.apache.hadoop.hbase.exceptions.UnknownProtocolException: No registered coprocessor service found for name AuthenticationService in region hbase:meta,,1
The HBase server side configurations for the coprocessor security are missing.
ResolutionAdd the following to hbase-site.xml :
<property> <name>hbase.coprocessor.region.classes</name> <value>org.apache.hadoop.hbase.security.token.TokenProvider, org.apache.hadoop.hbase.security.access.SecureBulkLoadEndpoint, org.apache.hadoop.hbase.security.access.AccessController </value> </property> <property> <name>hbase.rpc.engine</name> <value>org.apache.hadoop.hbase.ipc.SecureRpcEngine</value> </property> <property> <name>hbase.coprocessor.master.classes</name> <value>org.apache.hadoop.hbase.security.access.AccessController</value> </property>
-
org.apache.hadoop.ipc.RemoteException(org.apache.hadoop.security.AccessControlException): KERBEROS authentication is not enabled. Available:[SIMPLE]
TTalend Studio is configured to run a Job targeting a kerberized cluster whereas the server is not configured to use Kerberos.
ResolutionRemove the Kerberos credentials and configure the Job to access the cluster using simple authentication (user-based).
-
org.apache.hadoop.ipc.RemoteException(org.apache.hadoop.security.AccessControlException): SIMPLE authentication is not enabled. Available:[TOKEN, KERBEROS]
Talend Studio tries to connect to a non-kerberized cluster whereas this is a kerberized environment.
ResolutionActivate the Kerberos credentials in the Job / Talend components.
-
org.apache.hadoop.security.AccessControlException: Client cannot authenticate via:[TOKEN, KERBEROS]
Due to a bug of Talend Studio for MapReduce Jobs, this error is happening when you activated Kerberos on a M/R Job but a username was previously set and Talend Studio still uses it to access the cluster without taking Kerberos credentials into consideration.
ResolutionDeactivate the Kerberos authentication, set the username to blank and reactivate the Kerberos authentication.
-
org.apache.hadoop.security.authentication.util.KerberosName$NoMatchingRule: No rules applied to hdfs@TALEND.EXAMPLE.COM
You face a cross-realm Kerberos environment and the mapping rules to translate principal from one realm to the Hadoop realm does not match the expected result.
ResolutionCheck that the mapping rules are specified in the core-site.xml embedded in hadoop-conf-kerberos.jar . If yes, you have to correct these rules to obtain the right mapping.
-
Server has invalid Kerberos principal: hdfs/talend-cdh5-nn1@TALEND.COM
The realm configured in krb5.conf or the realm configured in the Job does not match the server realm.
ResolutionCheck the Job configuration and the krb5.ini to make sure this is aligned with the target server realm.
-
[WARN ]: org.apache.hadoop.security.UserGroupInformation - PriviledgedActionException as:user@EXAMPLE.COM (auth:KERBEROS) cause:java.io.IOException: Couldn't setup connection for user@EXAMPLE.COM to hbase/talend-cdh5@EXAMPLE.COM
The server name is not defined in /etc/hosts .
ResolutionAdd FQDN to /etc/hosts .