Teradata Machine Learning Challenge
Overview
Data preparation is a very important step in Machine Learning, but it can often be tedious and time-consuming. In fact, 80% of the time spent developing ML pipelines is typically dedicated to data preparation. Teradata simplifies this process with its powerful in-database analytic functions.
In this challenge, which should take less than 30 minutes to complete, you'll build a data preparation pipeline, but with a twist — we've removed the function names!
By solving this challenge, you'll gain hands-on experience with Teradata's ClearScape Analytics, by using a free account, and learn how it can make your data preparation tasks easier and more efficient.
We invite you to complete this challenge during AWS re:Invent for an opportunity to be entered into a free draw with a chance to win one of three awesome prizes.
Questions? Please feel free to email the Teradata Developer Relations team.
Entry submission timeline and prizes
Entries may be submitted starting on Monday, December 2, 2024, at 8am Pacific Standard Time (PST). The deadline to enter is Sunday, December 8, 2024, at 11:59 p.m. Pacific Standard Time (PST). Late entries will not be accepted. You also must get your badge scanned at the Teradata booth 1887 at AWS re:Invent to be eligible to enter.
Three prize winners from those who successfully completed the Machine Learning Challenge will be selected via electronic random drawing on Wednesday, December 11, 2024 and notified via email.
The first winner will receive a Star Wars™ Millennium Falcon™ LEGO® kit (Retail Value $849) and the second and third winners will each receive a Star Wars™ R2-D2™ LEGO® kit (Retail Value $99).
Machine Learning Challenge steps
- Access a free Teradata environment: Sign up for a free ClearScape Analytics Experience account to execute your functions against the dataset and test your solution to prep the data.
- In ClearScape Analytics Experience, create an environment named ml-challenge-reinvent. Click on "Run demos" in this environment. ClearScape Analytics Experience's Jupyter environment will open.
- Create a folder named "ml-challenge-reinvent" inside the "UseCases" folder in the Jupyter notebook environment.
- Open the ml-challenge-reinvent folder, click on "Open from URL" under the "File" menu, and paste the following URL in the dialogue: raw.githubusercontent.com/Teradata/aws-reinvent-ml-challenge/refs/heads/main/Customer_Churn_ML_Challenge.ipynb. This will copy the challenge Jupyter notebook to your environment.
- Explore the business scenario and dataset: The business scenario, dataset, and prefilled steps are in this Jupyter notebook.
- Complete the pipeline: The notebook is incomplete - your task is to identify the missing elements. Specifically, you'll need to:
- Identify and import the necessary libraries.
- Fill in the missing ClearScape Analytics functions for data preparation.
- Add any additional data preprocessing steps required.
You'll find hints in the notebook's markdown cells to help you along the way. Refer to the Teradata documentation for guidance on the missing functions and steps.
- Submit your solution: Once you've completed the notebook, submit your solution by completing the short entry form located on this page, and within the form, let us know the email address you used to create your ClearScape Analytics Experience account.
Questions? Please feel free to email the Teradata Developer Relations team.
Rules
- Eligibility: You must successfully complete the Machine Learning Challenge (see Steps above) to be entered into the random prize draw. This Machine Learning Challenge is free to enter and only open to registered AWS re:Invent attendees who stopped by the Teradata booth 1887 in-person in Las Vegas, Nevada, and had their AWS re:Invent badge scanned by a Teradata staff member. You must be over the age of 18. By entering this Machine Learning Challenge, you confirm you are eligible. This Teradata Machine Learning Challenge is not open to Teradata associates.
- A maximum of one entry per individual is permitted. Entry is free of charge. Entrants are not required to provide consideration of any kind.
- Teradata will attempt to contact winners by email up to two times. If a winner does not respond to the emails by 11:59pm Pacific Daylight Time (PST) Friday, December 13, 2024, they will lose their right to the prize, and Teradata reserves the right to choose and notify a new winner. By entering this prize draw, you are confirming that you are permitted to do so by your employer and, if selected as a winner, through the random draw, to receive the Teradata Machine Learning Challenge prize. Winners are chosen entirely at random by computer and regardless of any skill or knowledge you may have demonstrated in the Challenge. You are also confirming that your employer is not currently considering a Teradata sales proposal.
- You will not include any content that includes any rights of any kind owned by a third party or violates any laws.
- You confirm that you are not a federal, state or government employee.
- Prizes cannot be converted to cash and are non-exchangeable and non-transferable.
- You are responsible for making any required personal tax declaration.
- Teradata will make any required U.S. Internal Revenue Service filings regarding winners and prize values.
- We reserve the right to cancel this challenge or substitute prizes with another prize of equal or higher value if circumstances beyond our control make it necessary to do so.
- The decision of Teradata regarding any aspect of the Machine Learning Challenge and the prize draw is final and binding and no correspondence will be entered into about it.
- This is a Nevada-based prize draw, subject to the laws and venue of the State of Nevada.