Class DataToSketchUDAF
- java.lang.Object
-
- org.apache.hadoop.hive.ql.udf.generic.AbstractGenericUDAFResolver
-
- org.apache.datasketches.hive.theta.DataToSketchUDAF
-
- All Implemented Interfaces:
org.apache.hadoop.hive.ql.udf.generic.GenericUDAFResolver,org.apache.hadoop.hive.ql.udf.generic.GenericUDAFResolver2
public class DataToSketchUDAF extends org.apache.hadoop.hive.ql.udf.generic.AbstractGenericUDAFResolverNote Strings as raw data values are encoded as a UTF-16 VARCHAR prior to being submitted to the sketch. If the user requires a different encoding for cross-platform compatibility, it is recommended that these values be encoded prior to being submitted and then typed as a BINARY byte[].
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static classDataToSketchUDAF.DataToSketchEvaluator
-
Constructor Summary
Constructors Constructor Description DataToSketchUDAF()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description org.apache.hadoop.hive.ql.udf.generic.GenericUDAFEvaluatorgetEvaluator(org.apache.hadoop.hive.ql.udf.generic.GenericUDAFParameterInfo info)Performs argument number and type validation.
-
-
-
Method Detail
-
getEvaluator
public org.apache.hadoop.hive.ql.udf.generic.GenericUDAFEvaluator getEvaluator(org.apache.hadoop.hive.ql.udf.generic.GenericUDAFParameterInfo info) throws org.apache.hadoop.hive.ql.parse.SemanticExceptionPerforms argument number and type validation. DataToSketch expects to receive between one and four arguments.- The first (required) is the value to add to the sketch and must be a primitive.
- The second (optional) is the sketch size to use. This must be an integral value and must be constant.
- The third (optional) is the sampling probability and is a floating point value between 0.0 and 1.0. It must be a constant
- The fourth (optional) is an update seed. It must be an integral value and must be constant.
- Specified by:
getEvaluatorin interfaceorg.apache.hadoop.hive.ql.udf.generic.GenericUDAFResolver2- Overrides:
getEvaluatorin classorg.apache.hadoop.hive.ql.udf.generic.AbstractGenericUDAFResolver- Parameters:
info- Parameter info to validate- Returns:
- The GenericUDAFEvaluator that should be used to calculate the function.
- Throws:
org.apache.hadoop.hive.ql.parse.SemanticException- See Also:
#getEvaluator(org.apache.hadoop.hive.ql.udf.generic.GenericUDAFParameterInfo)
-
-