java aws linux spark informatica mysql git pyspark tableau agile methodology jira machine learning shell scripting data analysis data warehousing data management microsoft azure confluence data cleansing data modeling scrum data integration teradata machine learning algorithms