As a data analyst, you will assess continuous data sources for their relevance to specific research questions throughout your career.
In your previous coursework, you have performed data cleaning and exploratory data analysis on your data. You have seen basic trends and patterns and now can start building more sophisticated statistical models. In this course, you will use and explore both multiple regression and logistic regression models and their assumptions.
For this task, you will select one of the Data Sets and Associated Data Dictionaries from the following link:
Data Sets and Associated Data Dictionaries
You will then review the data dictionary related to the raw data file you have chosen, and prepare the data set file for multiple regression modeling. The organizations connected with the given data sets for this task seek to analyze their operations and have collected variables of possible use to support decision-making processes. You will analyze your chosen data set using multiple regression modeling, create visualizations, and deliver the results of your analysis. It is recommended that you use the cleaned data set from your previous course.
Note: The link to the data files can also be found below in the web links section. If you have trouble accessing the link, copy and paste the link directly into your web browser.
Your submission must be your original work. No more than a combined total of 30% of the submission and no more than a 10% match to any one individual source can be directly quoted or closely paraphrased from sources, even if cited correctly. The originality report that is provided when you submit your task can be used as a guide.
You must use the rubric to direct the creation of your submission because it provides detailed criteria that will be used to evaluate your work. Each requirement below may be evaluated by more than one rubric aspect. The rubric aspect titles may contain hyperlinks to relevant portions of the course.
Tasks may not be submitted as cloud links, such as links to Google Docs, Google Slides, OneDrive, etc., unless specified in the task requirements. All other submissions must be file types that are uploaded and submitted as attachments (e.g., .docx, .pdf, .ppt).
Part I: Research Question
A. Describe the purpose of this data analysis by doing the following:
1. Summarize one research question that is relevant to a real-world organizational situation captured in the data set you have selected and that you will answer using multiple regression.
2. Define the objectives or goals of the data analysis. Ensure that your objectives or goals are reasonable within the scope of the data dictionary and are represented in the available data.
Part II: Method Justification
B. Describe multiple regression methods by doing the following:
1. Summarize the assumptions of a multiple regression model.
2. Describe the benefits of using the tool(s) you have chosen (i.e., Python, R, or both) in support of various phases of the analysis.
3. Explain why multiple regression is an appropriate technique to analyze the research question summarized in Part I.
Part III: Data Preparation
C. Summarize the data preparation process for multiple regression analysis by doing the following:
1. Describe your data preparation goals and the data manipulations that will be used to achieve the goals.
2. Discuss the summary statistics, including the target variable and all predictor variables that you will need to gather from the data set to answer the research question.
3. Explain the steps used to prepare the data for the analysis, including the annotated code.
4. Generate univariate and bivariate visualizations of the distributions of variables in the cleaned data set. Include the target variable in your bivariate visualizations.
5. Provide a copy of the prepared data set.
Part IV: Model Comparison and Analysis
D. Compare an initial and a reduced multiple regression model by doing the following:
1. Construct an initial multiple regression model from all predictors that were identified in Part C2.
2. Justify a statistically based variable selection procedure and a model evaluation metric to reduce the initial model in a way that aligns with the research question.
3. Provide a reduced multiple regression model that includes both categorical and continuous variables.
Note: The output should include a screenshot of each model.
E. Analyze the data set using your reduced multiple regression model by doing the following:
1. Explain your data analysis process by comparing the initial and reduced multiple regression models, including the following elements:
• the logic of the variable selection technique
• the model evaluation metric
• a residual plot
2. Provide the output and any calculations of the analysis you performed, including the model’s residual error.
Note: The output should include the predictions from the refined model you used to perform the analysis.
3. Provide the code used to support the implementation of the multiple regression models.
Part V: Data Summary and Implications
F. Summarize your findings and assumptions by doing the following:
1. Discuss the results of your data analysis, including the following elements:
• a regression equation for the reduced model
• an interpretation of coefficients of the statistically significant variables of the model
• the statistical and practical significance of the model
• the limitations of the data analysis
2. Recommend a course of action based on your results.
Part VI: Demonstration
G. Provide a Panopto video recording that includes all of the following elements:
• a demonstration of the functionality of the code used for the analysis
• an identification of the version of the programming environment
• a comparison of the two multiple regression models you used in your analysis
• an interpretation of the coefficients.
Note: The audiovisual recording should feature you visibly presenting the material (i.e., not in voiceover or embedded video) and should simultaneously capture both you and your multimedia presentation.
Note: For instructions on how to access and use Panopto, use the “Panopto How-To Videos” web link provided below. To access Panopto’s website, navigate to the web link titled “Panopto Access,” and then choose to log in using the “WGU” option. If prompted, log in using your WGU student portal credentials, and then it will forward you to Panopto’s website.
To submit your recording, upload it to the Panopto drop box titled “Multiple Regression Modeling – NBM2 | D208.” Once the recording has been uploaded and processed in Panopto’s system, retrieve the URL of the recording from Panopto and copy and paste it into the Links option. Upload the remaining task requirements using the Attachments option.
H. List the web sources used to acquire data or segments of third-party code to support the application. Ensure the web sources are reliable.
I. Acknowledge sources, using in-text citations and references, for content that is quoted, paraphrased, or summarized.
J. Demonstrate professional communication in the content and presentation of your submission.
Why Work with Us
Top Quality and Well-Researched Papers
We always make sure that writers follow all your instructions precisely. You can choose your academic level: high school, college/university or professional, and we will assign a writer who has a respective degree.
Professional and Experienced Academic Writers
We have a team of professional writers with experience in academic and business writing. Many are native speakers and able to perform any task for which you need help.
Free Unlimited Revisions
If you think we missed something, send your order for a free revision. You have 10 days to submit the order for review after you have received the final document. You can do this yourself after logging into your personal account or by contacting our support.
Prompt Delivery and 100% Money-Back-Guarantee
All papers are always delivered on time. In case we need more time to master your paper, we may contact you regarding the deadline extension. In case you cannot provide us with more time, a 100% refund is guaranteed.
Original & Confidential
We use several writing tools checks to ensure that all documents you receive are free from plagiarism. Our editors carefully review all quotations in the text. We also promise maximum confidentiality in all of our services.
24/7 Customer Support
Our support agents are available 24 hours a day 7 days a week and committed to providing you with the best customer experience. Get in touch whenever you need any assistance.
Try it now!
How it works?
Follow these simple steps to get your paper done
Place your order
Fill in the order form and provide all details of your assignment.
Proceed with the payment
Choose the payment system that suits you most.
Receive the final file
Once your paper is ready, we will email it to you.
No need to work on your paper at night. Sleep tight, we will cover your back. We offer all kinds of writing services.
No matter what kind of academic paper you need and how urgent you need it, you are welcome to choose your academic level and the type of your paper at an affordable price. We take care of all your paper needs and give a 24/7 customer care support system.
Admission Essays & Business Writing Help
An admission essay is an essay or other written statement by a candidate, often a potential student enrolling in a college, university, or graduate school. You can be rest assurred that through our service we will write the best admission essay for you.
Our academic writers and editors make the necessary changes to your paper so that it is polished. We also format your document by correctly quoting the sources and creating reference lists in the formats APA, Harvard, MLA, Chicago / Turabian.
If you think your paper could be improved, you can request a review. In this case, your paper will be checked by the writer or assigned to an editor. You can use this option as many times as you see fit. This is free because we want you to be completely satisfied with the service offered.