Why, you ask. Ideally, players should make more runs the more the number of balls they've played. And using this intuition, the relationship seems pretty linear. But is it?
Create a scatter plot for no. of runs against no. of balls for each player. You need to write a function plot_runs_by_balls() which creates a scatter plot of balls played on the X axis, and their respective runs scored on the Y axis.
This function takes in no parameters
This function returns nothing
Remember, what we want is the count of balls played (aggregation) and sum of runs (aggregation), per match per player (grouping). Since there is one grouping, and two different aggregations, you might want to "merge" them into one dataframe.