/ Posts / dotData Overview: Enterprise AI Platform
by N/A - Dominick Amalraj
on January 11, 2021
Machine learning objectives are transforming the way organizations are conducting business. With the ability to understand what the future holds more clearly and insights that can change the way an enterprise operates, it is no shock there is a desire to implement these solutions. Machine learning use cases are difficult to deliver on due to the skillset needed and the time-consuming nature associated with these projects. Automated machine learning has opened new doors for both data scientists and non-data scientists to more efficiently take on these projects. Once a dataset is prepared, automated machine learning can implement top-notch models at the fraction of the time and allow users to deploy them through REST API or other means.
But even though auto-ml platforms are saving time and efficiency in the model building phase, there is still a large amount of work that goes into the various other phases in the data science process. The 80/20 rule in data science says 80% of the time spent on a data science project is on finding, cleaning, and organizing data. This leaves only 20% of the time to actually build out predictive models. Within the data preparation stage, data science teams conduct various ETL steps to create their data table, clean the data, and conduct feature engineering. dotData, an Enterprise AI platform, has created a solution that not only automates model building but the entire data science process.
The dotData platform conducts and automates all the necessary steps in the machine learning process as the image above depicts. The steps automated within this platform include the ETL process, feature engineering, model building, and production of final results. This article will cover the main functionalities of the dotData platform.
1. Automated ETL Process
In any machine learning project, the ETL process is a vital piece. This includes bringing all the necessary tables from various data sources, cleaning the data, and creating a table which can be used for model training. dotData allows users to upload data tables into the platform from a plethora of sources. Once the data is ingested, dotData can automatically clean, analysis, and prepare multi-dimensional tables ready for model building.
The user is in charge of selecting a feature to predict such as sales or churn. From there, the dotData platform will be able to join the necessary tables needed based on the statistical relationship to the target field and conduct the necessary ETL steps to form a training dataset. This removes the need for a user to evaluate which tables are necessary for model building and any ETL steps needed to configure the final table. On top of that, dotData will automatically cleanse the schema and final data table used for model building.
2. AI – Powered Feature Engineering
Another crucial aspect in machine learning is the feature engineering. There is quite a bit that can be done in this step including: computing new features, conducting any necessary transformations, removing redundant or insignificant features, and more. This can be a very complex and lengthy process, but with dotData this would only take a few hours. dotData presents the features and most relevant findings with thorough visual explanations. This gives users a better understanding of the main drivers that impact a target field.
3. Auto-ML and Production Capabilities
dotData has plenty of innovative machine learning techniques when it comes to their automated machine learning. This includes models from XGBoost, LightGMB, TensorFlow, and PyTorch. As with many automated machine learning tools, the platform will automatically tune and reconfigure models to achieve the most accurate results. From the thorough feature engineering conducted in the dotData platform, the model's accuracy sees significant improvement and insights are more clearly illustrated. After models are made, dotData has a GUI that allows users to see insights from the models, make predictions, and gain the necessary information to make better business decisions. With its REST API, predictions made in dotData can be accessible to a number of BI platforms and sources.
dotData is focused on limiting the effort and time associated in the machine learning process. With the reduced time to value from automating the entire data science process, users will be able to discover more from their models and dive deeper into them to better prepare for the future. Additionally, with the reduced time per project, data science teams are able to tackle more use cases than ever before. If you have any questions or would like to schedule a demo of dotData please reach out to info@pomerolpartners.com for more information
by Owen Bott on March 11, 2021
Pomerol Appoints New Partners - Scott Duthie & Goncalo Pereira
Viewby Owen Bott on October 22, 2020
Utilizing Qlik to track our progress against COVID19.
Viewby N/A - Dominick Amalraj on October 2, 2020
Learn how DataRobot can accelerate every aspect in the machine learning process
Viewby Owen Bott on October 1, 2020
What's New to Qlik Sense with the September 2020 Release
Viewby John Fitzgerald on August 31, 2020
A fire at a neighbor's house reminded me that the first attempt at solving a problem might not always be the right approach.
Viewby N/A - Tyler Robinson on August 18, 2020
How to take your Fantasy Football draft to the next level.
Viewby Scott Duthie on August 6, 2020
Data driven decision making starts at the branch – a case study for credit unions.
Viewby N/A - Dominick Amalraj on May 26, 2020
Elevate your organization from machine learning capable to machine learning driven.
Viewby N/A - Tyler Robinson on May 8, 2020
The Necessity for Clean Data - A Sample Use Case
Viewby Scott Duthie on May 6, 2020
4 Ways to extend Qlik NPrinting to get more value out of it.
Viewby Owen Bott on April 27, 2020
Sneak Peak into all of the new features in the Qlik Sense April 2020 Release
Viewby N/A - Brian McManamy on April 5, 2020
Are you getting the most out of your Qlik Sense monitoring tools?
Viewby N/A - Tyler Robinson on March 21, 2020
How can you use data to solve your most critical problems?
Viewby N/A - Dominick Amalraj on March 19, 2020
Learn more about how you can get the most out of your machine learning projects during unpredictable times
Viewby Wendell Truax on March 16, 2020
Plan your Qlik Sense upgrades more reliably with our extension inventory application.
Viewby Scott Duthie on March 1, 2020
How do you transform ‘Consumers’ of analytics to ‘Contributors’? You provide a tool for them to seamlessly share and communicate their questions, insights and ideas.
Viewby Scott Duthie on February 18, 2020
Explore the many ways that NodeGraph can help you track and manage your Qlik metadata.
Viewby Scott Duthie on January 8, 2020
Pomerol joins forces with non-profit to increase sex trafficking awareness through data analytics.
Viewby on January 8, 2020
Learn how Trifacta can simplify and expedite your data transformations for analysis.
Viewby on January 8, 2020
Check out the latest and greatest in the November 2019 Qlik Sense update.
Viewby N/A - Mike Mahoney on November 21, 2019
Learn about Motio and how version control is crucial for your Qlik deployment.
Viewby N/A - Mike Mahoney on November 21, 2019
Vizlib, a industry leading developer of Qlik Sense visualization extensions, has joined the Pomerol team.
Viewby on November 21, 2019
Pomerol Partners and Sense Excel collaborate to “turbo-charge” reporting and analysis for organizations
Viewby N/A - Mike Mahoney on November 21, 2019
What are the hot BI topics your organization should be thinking about?
Viewby N/A - Mike Mahoney on November 21, 2019
Couchbase and Pomerol Partners Drive Customer Success with Faster Time to Value
Viewby on November 21, 2019
We have partnered with StreamSets to help modernize your data integration efforts.
Viewby on November 21, 2019
Use these tips to build a self-service analytics platform for your organization.
Viewby on November 21, 2019
Qlik recently acquired CrunchBot and Crunch Data, an experienced AI and solution development team.
Viewby on November 21, 2019
Check out the new updates and functionalities of the Qlik Sense April 2019 Release.
Viewby Scott Duthie on November 21, 2019
Pomerol Partners and DataRobot to collaborate on automated machine learning within predictive analytics
Viewby on November 21, 2019
7 major roadblocks of machine learning projects and how to overcome them.
Viewby on November 21, 2019
Check out the new updates and functionalities for the Qlik Sense February 2019 Release.
Viewby on November 21, 2019
Check out the best updates and functionalities for the June 2019 Qlik Sense Release.
Viewby N/A - Mike Mahoney on October 16, 2019
Check out how the data wrangling tool, Trifacta, can help advance your machine learning needs.
Viewby Kanon Cozad on August 1, 2019
Learn about Big Squid and how Pomerol can help you implement it.
Viewby John Fitzgerald on December 25, 2016
Leverage K4 Analytics for advanced planning, budgeting, and forecasting from inside your Qlik apps
View