- Mountain View CA, US Pankaj RASTOGI - Fremont CA, US Sumanth VENKATASUBBAIAH - Mountain View CA, US Qingbo HU - Burlingame CA, US Karthik PRAKASH - Milpitas CA, US Nicholas Jeffrey HOH - Mountain View CA, US Frank WISNIEWSKI - San Francisco CA, US Abhishek JAIN - Mountain View CA, US Caio Vinicius SOARES - Redwood City CA, US Yuwen WU - Mountain View CA, US
International Classification:
G06F 16/23 G06N 20/00 G06N 5/02 G06F 9/54
Abstract:
Certain aspects of the present disclosure provide techniques for operation of a feature management platform. A feature management platform is an end-to-end platform developed to manage the full lifecycle of data features. For example, to create a stateful feature, the feature management platform can receive a processing artifact from a computing device. The processing artifact defines the stateful feature, including the data source to retrieve event data from, when to retrieve the event data, the type of transform to apply, etc. Based on the processing artifact, the feature management system generates a processing job (e.g., the API defines a pipeline), which when initiated generates a vector that encapsulates the stateful feature. The vector is transmitted to the computing device that locally hosts a model, which generates a prediction that is transmitted to the feature management platform. Subsequently, the predication and stateful feature can be transmitted to other computing devices.
- Mountain View CA, US Pankaj RASTOGI - Fremont CA, US Sumanth VENKATASUBBAIAH - Mountain View CA, US Qingbo HU - Foster City CA, US Karthik PRAKASH - Milpitas CA, US Nicholas Jeffrey HOH - Sunnyvale CA, US Frank WISNIEWSKI - San Francisco CA, US Abhishek JAIN - Mountain View CA, US Caio Vinicius SOARES - Redwood City CA, US Yuwen Ellen WU - Mountain View CA, US
International Classification:
G06F 16/23 G06F 9/54 G06N 5/02 G06N 20/00
Abstract:
Certain aspects of the present disclosure provide techniques for operation of a feature management platform. A feature management platform is an end-to-end platform developed to manage the full lifecycle of data features. For example, to create a stateful feature, the feature management platform can receive a processing artifact from a computing device. The processing artifact defines the stateful feature, including the data source to retrieve event data from, when to retrieve the event data, the type of transform to apply, etc. Based on the processing artifact, the feature management system generates a processing job (e.g., the API defines a pipeline), which when initiated generates a vector that encapsulates the stateful feature. The vector is transmitted to the computing device that locally hosts a model, which generates a prediction that is transmitted to the feature management platform. Subsequently, the predication and stateful feature can be transmitted to other computing devices.
- Mountain View CA, US Abhishek JAIN - Mountain View CA, US Caio Vinicius SOARES - Redwood City CA, US Tristan Cooper BAKER - San Diego CA, US Joseph Brian CESSNA - San Diego CA, US
International Classification:
G06N 20/00
Abstract:
Certain aspects of the present disclosure provide techniques for operation of a feature management platform. A feature management platform is an end-to-end platform developed to manage the full lifecycle of data features. For example, to create a feature the feature management platform can receive a processing artifact (e.g., a configuration file and code fragment) from a computing device. The processing artifact defines the feature, including the data source to retrieve event data from, when to retrieve the event data, the type of transform to apply, etc. Based on the processing artifact, the feature management system generates a processing job, which when initiated generates a vector that encapsulates the feature data. The vector is transmitted to the computing device that locally hosts a model, which generates a prediction. The prediction is transmitted to the feature management platform and can be transmitted to other computing devices, upon request.
Framework For Processing Machine Learning Model Metrics
- Mountain View CA, US Sumanth Venkatasubbaiah - San Jose CA, US Caio Vinicius Soares - Mountain View CA, US
Assignee:
Intuit Inc. - Mountain View CA
International Classification:
G06N 5/04 G06N 20/00 G06F 11/30 G06F 9/54
Abstract:
A method including extracting, from an input, supported data. The input includes outputs from machine learning models in different formats. The supported data includes a subset of the input after data normalization. The method also includes inferring, from the supported data, data types to be used with respect to generating metrics for the machine learning models. The method also includes generating, from the supported data and using the data types, a relational event including the supported data. The relational event further includes a first data structure object including the types and having a first data structure different than the different formats. The method also includes calculating, using the supported data in the first data structure, the metrics for the machine learning models. The method also includes generating, from the relational event, a monitoring event. The monitoring event includes a second data structure object segmented into data buckets storing the metrics.
Method And System For Generating Synthetic Data Using A Regression Model While Preserving Statistical Properties Of Underlying Data
- Mountain View CA, US Malhar Siddhesh Jere - Mountain View CA, US Sumanth Venkatasubbaiah - San Jose CA, US Caio Vinicius Soares - Mountain View CA, US Sricharan Kallur Palli Kumar - Mountain View CA, US
Assignee:
Intuit Inc. - Mountain View CA
International Classification:
G06N 3/04 G06N 3/08 G06F 7/58
Abstract:
A method for generating a synthetic dataset involves generating discretized synthetic data based on driving a model of a cumulative distribution function (CDF) with random numbers. The CDF is based on a source dataset. The method further includes generating the synthetic dataset from the discretized synthetic data by selecting, for inclusion into the synthetic dataset, values from a multitude of entries of the source dataset, based on the discretized synthetic data, and providing the synthetic dataset to a downstream application that is configured to operate on the source dataset.
Method For Personalized Context-Aware, And Privacy Preserving Real-Time Brokerage For Advertising
- Stuttgart, DE Juergen Heit - Sunnyvale CA, US Caio Soares - Redwood City CA, US Jo-Anne Ting - San Francisco CA, US
Assignee:
Robert Bosch GmbH - Stuttgart
International Classification:
G06Q 30/02
US Classification:
705 1458
Abstract:
A real-time and privacy-preserving method for delivering personalized information to a user within a specific geographical location is provided. The method comprises the steps of: storing information specific to a user in a database maintained by a centralized brokerage service; transmitting a user initiated request to the centralized brokerage service to provide a listing of products and services offered by retailers or third parties located in the approximate current geographical vicinity of the user; utilizing the stored information of the user to generate a personalized listing of products and services offered within the approximate current geographical vicinity of the user; and sending the generated personalized listing to the user.
Auburn University Aug 2009 - Aug 2010
Researcher
Auburn University Aug 2004 - May 2009
Graduate Teaching Assistant
McKesson Provider Technologies Jun 2007 - Aug 2007
Software Engineering Intern
Auburn University Aug 2005 - Aug 2006
Graduate Research Assistant
Education:
Auburn University 2004 - 2010
Ph.D., Computer Science and Software Engineering-GPA: 4.0/4.0
-Research Interests: Artificial Intelligence, Machine Learning, Optimization, Predictive Modeling, Data Mining
-Dissertation: Improving Prediction Accuracy Using Class-specific Ensemble Feature Selection
Auburn University 2004 - 2009
M.S., Computer Science and Software Engineering-GPA: 4.0/4.0
-Research Interests: Artificial Intelligence, Machine Learning, Optimization, Predictive Modeling, Data Mining
-Thesis: A Class-specific Ensemble Feature Selection Approach for Classification Problems
Berry College 2001 - 2004
BS, Computer Science, Mathematics
Skills:
Machine Learning Data Mining Artificial Intelligence Computer Science Java Algorithms R Python Distributed Systems C Neural Networks Statistics Optimization Sql Latex Svm Hadoop Decision Trees Evolutionary Computation Mpi Datamining Logistic Regression Regression Testing Classification Eda Stochastic Optimization Feature Extraction Feature Selection Dynamic Modeling Design of Experiments Experimental Design
Interests:
Artificial Intelligence Quantitative Research Machine Learning Quantitative Development Data Mining Pattern Recognition Software Engineering
Languages:
English Portuguese Spanish
Certifications:
Ccdh: Cloudera Certified Developer For Apache Hadoop Cloudera University