Have you ever just looked around where you are and thought “How the heck did I get HERE?” If I am being honest, it has happened a fair number of times for me. Normally when this happens, I stand in amazement for a few minutes and just let myself soak it in.
One such moment happened at PASS Summit 2018. It was my pleasure to have been invited to live-blog from the PASS Summit Blogger’s Table for the 2018 Summit Keynotes. Surrounded by some of the most amazing people in the community, I nervously sat out to share my experiences in real time. For reference, those blogs are listed here:
The Official PASS Summit Blogger program is made up of some of the community’s most influential and knowledgeable experts across architecture, data management, and analytics.
This year’s team of bloggers will share the latest event announcements, along with their past experiences and tips and tricks in the lead up to PASS Summit 2019.
What does that mean?
There’s a lot of fancy words there. So what does this mean for YOU? Official Bloggers not only get the opportunity to live-blog the keynote events, but we also get exclusive access with the PASS Executive Committee, industry leaders, exhibitors, and sponsors.
Through this program, I will be on the inside and will be sharing:
stories and highlights from PASS Summit 2018;
tips and tricks about making Summit a great experience,
info about PASS Summit events and announcements;
and live-blogging events and sessions at PASS Summit 2019!
This is such an exciting opportunity and I am looking forward to sharing everything with you! I can tell you now, there will be a lot of exciting information in the months to come. And as always, along the way I will do my best to entertain and enlighten.
Other Official PASS Summit Bloggers
The coolest thing about bloggers is that each one of them has a unique understanding to share. I highly suggest you follow me and my fellow Official Pass Summit Bloggers listed below.
A few years ago I accompanied my husband to Atlanta for SQL Saturday. He was speaking and had told me what a wonderful event it was. Not ever wanting to pass up a chance to learn, network with community, and be of assistance to my great friends there, I jumped at the chance to go! I was not speaking yet, so I was purely an attendee.
I dreamed of the day when I would be able to present. I have always admired those who were brave enough to stand in front of strangers and share knowledge and experience. I was never sure I had the courage, or the experiences to share. What I have come to realize is that EVERYONE has a story to share!
Go Big or Go Home
To say that SQL Saturday Atlanta is a big event is a total understatement. There are always tons of people there! Not only that, the planning team is a large group who work seamlessly and tirelessly to pull of an awesome event! The event this year is held on May 18, 2019 at Gwinnett Technical College, Alpharetta, Georgia.
Data ethics is something near and dear to my heart. In this session we discuss how data collection and usage is all around us and explore how it is used. Is what we do with the data ethical? Are we collecting and processing biased data? We will take a look at ethical data collection and usage, what it means to your models and forecasts, and how being aware of possible biases can impact your bottom line. You might be training on extremely biased data and not know it, skewing your forecasts in the wrong direction.
I hope you will join me. My goal is to make learning fun and interactive. Fortunately, I am one of the first sessions, so grab some coffee and a snack and join me in Room 226!
More Than Learning
If you have never been to a SQL Saturday, one of the most important things to remember is that it is more than just about learning new skills. SQL Saturdays are a fantastic place to meet new friends, vendors who can make your job easier, and quite possibly meet your new employee or coworker! If you are in search of a new job, networking with vendors and others in the community will boost your reach.
Take a moment to speak with fellow attendees and speakers. We often get so caught up in learning new technical skills that we forget to make time to connect with people. Don’t be shy! We are all in this together, we all want to connect and share. Don’t miss out! Register for SQL Saturday Atlanta today, or a SQL Saturday near you!
Thank you to Idera for sponsoring me as an Idera Ace!
The number one question I get asked is if the capstone was difficult. The short answer is “yes.” The capstone is designed to test the knowledge and skills gained from the classes in the course, as well as expand your horizons as to how you would deal with “real-world” projects. The next question I get is “Do you think I can pass it?” Absolutely… if you give it your all.
The capstone I completed was divided into three parts. * data analytics * building and testing a machine learning model * write a professional report
The Sum of Three Parts
Part 1, Analyzing Data
The capstone really put me to work. Reviewing and analyzing data is very easy and straight forward for me. I really enjoy looking between the lines and finding the patterns that emerge. The capstone allows the student use of any analytics program they desire. For this portion, Excel was my choice, and it proved to be a wise one. Part 1 took mere minutes for me to complete. It was the second part of the capstone where the majority of my time and effort was spent.
Part 2, Machine Learning
In order to create a successful machine learning model, you will need to be proficient in your Google-Fu. Building the machine learning model took more knowledge and experience than I had gained in the course work alone. A large amount of my time was spent exploring various approaches and algorithms. Many hours were spent researching algorithms and trying to figure out the best ways to go about training the unruly model.
I was frustrated, tired, and admittedly ready to give up. REALLY AND TRULY. A thought occurred: “This is why they call it ‘Data Science’. I am sitting here trying to find answers along an untrodden path.” In that moment, I imagined this is what it feels like to be a true scientist. Just as in physical science, data science requires time, patience, research, trial and error.
Not being one to give in, I persisted until the best combination of algorithms was found. It was a profound relief to test the model against my data and to see it be successful.
So many people in the community helped guide me in the direction to find answers. Thank you to all who wrote blogs, tweets, whitepapers, and produced videos. Sharing your unique views and understanding with others makes us a stronger community.
Part 3, Professional Report
Armed with my data, my experiences, and my successful machine learning model, the final step in the capstone is to put it all together in a written report! Holy goodness. This was much more difficult than I had expected it to be. Your grade for the third part is dependent on other students and their assessment of your report. Yet another real-life experience to get you ready for a career as a Data Scientist!
The report took a few days to complete; the last day I worked a solid 30 hours straight on it. I could not sleep; was excited, terrified, stressed, and also just REALLY ready to get this completed. Once my report was submitted, I began reading the other students’ reports as assigned. The reports were outstanding. Those students brought to light information that had not shown itself in my project. Each report was vastly different; the story the data told the other three varied from each other and mine as well. How interesting that we all had the same data sets and somehow all four of us presented completely different stories!
Undoubtedly, the classes in the MPP Data Science course taught me a number of valuable skills while also having the unintended consequence of teaching me some things about…well…me.
When faced with something new and extremely difficult, I learned that I have the ability to rise up and learn and be successful. That the willingness to learn something new can set you apart from others.
At the end of the capstone, while reviewing other students’ reports, I saw opportunities for more learning as their approaches to the same material differed from mine.
Finally, I learned that when we work together for a common goal, we are stronger and smarter together.
You have undoubtedly heard that Data Science is one of the fastest growing fields in the data industry and one of the best jobs in America . While many people are interested in a career in data science, they are afraid it might take more than they have to offer. I was one of these people. I was afraid that I didn’t have the knowledge or mental (or mathematical) aptitude needed for such a career. Being the unwavering person I am, I set a goal to learn more and then went on a search for information. I found the Microsoft MPP in Data Science program and thought “Well, I can at least give it a try.”
*Let me pause here and applaud Microsoft for partnering with EdX.org to assemble and bring in this training… and making it available to anyone and everyone for FREE. You can take and complete these classes for free. The only payment needed is if you decide to complete the classes for verified certificates (needed to complete the MPP Certification).
What it takes
Are you interested in studying data science? Ask yourself these questions:
Do you have an interest in exploring abstract ideas?
Are you a curious person?
Do you feel comfortable seeking for answers in unique ways?
Do you love exploring with new programs and technology?
Are you interested in finding the story within the story?
Are you good at finding patterns where there seem to be only random ideals and images?
Are you interested in working with data?
What is a Data Scientist?
If you answered yes to the above questions, you just might be the next great Data Scientist! Let’s break down what a Data Scientist does. The role of data scientist is a unique one as it requires an ability to think on your feet, think outside the box, be creative with technology, and be somewhat of an entrepreneur. Data science walks the fine line between technology and creative story telling. A data scientist is one who knows how to use various means to pull narratives from data to create a great story. You see, data is not merely a static table of letters and numbers. No, it is much more than just digits in a row. Data is a living, breathing, ever-evolving collection of information that is searching for a way to tell its’ story. Data scientists are curious, technically equipped story-tellers exploring the data landscape for the next great story to share. Sound interesting? If so, keep reading!
Data Science Tools
On my journey to becoming a Microsoft MPP in Data Science, I started where we all start… at the beginning. The very first class in the MPP Course is Introduction to Data Science. This is your typical intro class. It is easy, but very important. This will guide you through what to expect, how to navigate the classes, as well as provide an over-view of the basic concepts and principles on which data science is based.
There are a number of tools in the data science repertoire. For the purpose of this blog, we will focus on the tools one can learn through the Microsoft MPP courses.
Analyzing & Visualizing Data
The first tool we look at is for analyzing and visualizing data. The MPP course gives you a choice between working with Power BI or Excel. As I have previous experience with Excel and feel pretty confident there, I chose to learning something new and went with Power BI. I found Power BI to be a super fun tool that felt more like a video game and less like work. I love a good visual! This class easily walked me through setup and through a variety of use-case scenarios. I found it very fun and easy to learn. In fact, what struck me the most about these classes is how very concise yet easily followed the class are.
Communicate Data Insights
Now that you understand the basics of analyzing and visualizing data, it is important to know how to master data communication. It is one thing to be able to look at data and understand it, it takes a completely different set of skills to convey the stories the data has to tell. In the next course, Analytics Storytelling for Impact, you will learn how to fully explore a story to find what a great story is, and what it is not. This course really dives into how to make an impact through storytelling and gives you an idea how to create impact through presentations, reports and how to apply these skills to your data analytics. I thoroughly enjoyed this class as it spoke to the theater major in me. I do love to tell a good story, and this class gave me new ways to look at data and has resulted in me questioning things I see every day like political polls, job descriptions, and advertisements.
Apply Ethics and Law in Analytics
Ethics? What does ethics have to do with being a data scientist? Admittedly, when I first saw that the program had been updated with Ethics and Law in Data and Analytics I was a bit taken aback. I thought I had left the legal field and was on the way to a technical role. Why learn ethics? Data science, and data collection have changed wildly and quickly over the last few years. It is my firm belief that every data professional needs to take this course. Only through taking this course I learned about the possibility of data being accidentally prejudice! Certainly ethics should be considered when collecting and analyzing data! The data scientist would be remiss in not heeding due diligence!
Query Relational Data
The data scientist must know how to query databases in order to get the data needed to analyse. The MPP program offers Querying Data with Transact SQL where you will learn to query and modify data in SQL Server or Azure SQL using TSQL. If you are not familiar, SQL is pronounced in the industry as “See-Quil” not “Es-Que-El”… it is a pet peeve of mine to hear someone say S-Q-L when talking to me about SQL. This course was very thorough and a great way to step into learning how to query and program using TSQL. This class will take some effort, I found it to be one of the more intensive classes in this course. SQL is no easy task, and SQL Server has many versions out there in practical use, each version with different hurdles to jump. This particular class is a fantastic place to start to learn a great deal about SQL.
Explore Data with Code
The next step in the program is to explore data with code. You are given two options here, one path is Introduction to R for Data Science and the other is Introduction to Python for Data Science. For my interests, I chose Python since it is widely used in many areas, especially advanced analytics and AI. To my surprise, Python was a lot of fun to learn. I did more research into the uses of Python and found it to be a very useful tool in my toolkit. I can even design and program holiday lights for my house using Python!
Apply Math and Statistics to Data Analysis
Whoa, wait….math? Math is involved??? Yes, absolutely! Remember back in school when you thought “When will I EVER use this again in real life?” The answer is “Now, and always, honestly.” There are three classes offered here so you can choose which you want to learn:
I chose the Python Edition to continue on my usage of Python from the last class. I was not a great math student, so I was really afraid I would not be smart enough to get through this class. If you are feeling that way, stop that now. Like I have said before, these classes are designed in such a great way that not only was I able to learn and grow, I made a great grade! Don’t let fear of failure keep you from trying something new.
To be honest, I faced this particular class with dread. Much to my surprise, I really and truly enjoyed learning about building machine learning models. You can chose between Principles of Machine Learning: R Edition and Principles of Machine Learning: Python Edition . If you have previously chosen Python as I did, continue on with that path. This class offers a clear explanation of machine learning theory through hands-on experience in the labs. You will use Python or R to build, validate and deploy machine learning models using Azure Notebooks.
I will make one suggestion though, before completing this class, would recommend completing Developing Big Data Solutions with Azure Machine Learning . As a more visual-based person, I found that I understood the machine learning models much more after completing the course using Azure Machine Learning.
Build Predictive Solutions at Scale
Okay, now we are getting to some really fun stuff! I think this was my absolute favorite of all the classes. You can chose from one of these three:
I chose Developing Big Data with Azure Machine Learning (AML) and what a blast I had! I can say that working with AML and with Azure Data Studio was like opening up presents on my birthday! The final projects were a lot of work, but I got a real sense of what working in the field as a data scientist and machine learning is all about… trial and error. It was a lot of fun trying to use insights, hunches, best guesses, and technology all together to create and train a model in order to accurately predict solutions!
After all the courses are completed and passed, you can only gain the MPP in Data Science if you successfully pass the Microsoft Professional Capstone : Data Science. As of the writing of this blog, I am slated to begin the Capstone on December 31, 2018 and I cannot think of a better way to ring in the new year!
I have researched many ways to become a Data Scientist. Most universities offer degrees in data science. I have found that on the majority of their sites, they tout a Masters or PHD in Data Science is what you need (with a heavy prerequisite of extensive math and stats classes) in order to become a data scientist. Must you have an advanced degree in mathematics or engineering to become a data scientist? Absolutely not. You don’t even have to hold a degree to work as a data scientist! Take a look at this article published on Forbes: 4 Reasons Not To Get That Masters In Data Science
My advice is to take a look at the Microsoft MPP program and try on a few of the free classes. If you are truly interested in a data science career and are willing to put forth the time and attention needed to learn, you already qualify as a good candidate. Don’t let your past dictate your future. Make the investment in yourself and grow along with the technology as it comes. You can do this!
In 2016 I decided to switch careers. It was a big decision for me, and a bit scary. I had been working in the legal field since I graduated college. It was an area in which I grew and excelled, but it had become stale for me. I knew I wasn’t happy and that my heart truly belonged in a more technically challenging role; however, I had no idea where to start. In walked the Microsoft Professional Program in Data Science. Knowing that I learn best from structured programs, I felt confident that the classroom-style lecture and lab would be a great fit. I found it interesting and so very helpful that the program allowed me to first dip my toe into tech without too much investment. The courses are designed so that you can audit them and not pay a thing! Soon, I was hooked!
Jumping Into the Deep End
Standing on the precipice of greatness is often an exciting and terrifying experience. I was staring at the data field trying to decide which path to take. I was fortunate to be offered a position as a Marketing DBA with a SQL Server consulting company. I began learning and growing as a website designer, marketing specialist, CRM specialist, project manager, event coordinator, and database administrator. Wow, there is SO much to learn! So many paths to take, so many areas in which to specialize. It can be very overwhelming to the new student. After a while I found that I needed to make a move, I needed to take a big leap into the well of knowledge, so I left my position to go back and finish my data science courses.
What is “data science?” you may be asking. I feel that the term is both widely over-used and wholly underestimated. Think about the term “scientist” and what that entails. A scientist uses knowledge and technology to aid in finding an answer or truth. So naturally, a data scientist does the same thing. We use various means of technology, finesse, hypothesis, and trial and error to draw from the data the story that is waiting to be told. My personal journey took me along the path of learning Python and Azure Machine Learning, two things I never knew I would have any desire to learn. I faced these two courses with extreme skepticism. I resigned myself to the thought that I would more than likely barely get by (and that “D stands for diploma” as we used to say in college). Much to my surprise, I found that not only did I understand both, I absolutely fell in love with coding in Python and predictive analysis with Azure Machine Learning!
Data science really is so much more than exploring with new technology and programs. Data science also draws on the tried and true, and this is where my background in the legal field comes in really handy. For instance, can pure, raw data be racist? (Absolutely, and that will be a topic we save for an up-coming blog. ) When we are given data, it is never in a perfect form: information is missing or duplicated, in the wrong format, or is unusable for many other different reasons. Because of this, a data scientist must know how to clean, manipulate, and finesse the data in order to pull out the story. Much like an artist who starts with a block of marble creates a beautiful statue, Data scientists must know the curves and lines to manipulate to bring out the best form of the story the data is designed to tell. In the legal field, we used words and phrases to convey law, tell stories of guilt or innocence, and create the narrative we desired. It all takes an art form and knack for finesse.
There will come a time in life, perhaps many times, where you find your cheese has been moved. Many times as well that you find you need to move your own cheese. “Why the cheese talk Angela?” you might be pondering. Back in my first job out of college, an attorney I worked with gave me a copy of a book he wanted me to read. It was truly an eye-opening and life changing moment for me. If you have not read the book Who Moved My Cheese ? I suggest getting a copy. It is a good reminder that no matter where you are in life, if you are not living happy and feeling fulfilled, you CAN make a change. You can wait for someone else to move your cheese, or you can just move it yourself. So this is what I did, I moved my cheese.
I invite you to come along on this data science journey with me as we explore new programs and technologies. Over the next few weeks, I will be diving into my experience in each section of the Microsoft Professional Program in Data Science. I will NOT be sharing answers to test or lab questions, that would be cheating, and you know that as well as I do!
In 2016 I attended PASS Summit for the first time as an attendee. I was so excited and a little nervous. I was certainly not a stranger to the #SQLFamily but I was not really a member of it yet either. My intentions were to attend Summit, make a few connections, and learn as much as possible. I had no idea what was truly in store for me. I had no idea that some of the people I met there would become some of the best friends I have ever known. I had no idea how my life would be so enriched by meeting and talking to so many diverse people.
Talk to People!
As many conference go-ers will attest, the most important thing you can do at a conference is networking!
Look, Summit is like going to Disney World; there is NO WAY you will be able to see everything in that short amount of time. So prepare accordingly and schedule in time to spend in the community center and go out to after events. Summit is much bigger than you can imagine. I knew I would meet some really cool people and would learn some amazing new things. What I did not know is how vastly those three days would change my life!
I am very much a people-person gifted with the ability to talk with anyone about anything at any time. I am an extrovert. I thrive on meeting new people and gaining insights from others. While this is totally me, I understand that many in the tech industry are not as ready and willing to start a conversation. I am on a personal mission this year to help people enrich their lives one conversation at a time.
Calling all Introverts!
Sometimes all people need is an invitation to start a conversation. My trick to starting conversations with shy people is to notice something we have in common, or to compliment something about them. Example: “Hi! That is a great bag!” or “Oh hey, my hair is the same color as your backpack!” These are small little introductions that may or may not lead to a full-blown conversation and maybe an eventual friendship!
You may be sitting there growing steadily uncomfortable at this thought. (Mmmmm hmmm, I can see you squirming in your seat!) Don’t fret. I can help you! If you are not ready to be the one who reaches out, give others something to start a conversation with you! At Summit this year I will have a limited number of buttons to hand out to my Introvert Entourage! Come meet me, ask for a button, and join the coolest group around: “Angela’s Introvert Entourage!”
Come walk with me and we can meet new friends together!
*You don’t have to be an Introvert to receive a button. You can be an Introvert Ally!*
When creating reports in Power BI, the end goal is to make them useful to many users. In order to share reports created in Power BI, they must be published to the cloud (known as PowerBI.com). Once nestled in the cloud, the data in the reports will either stand static, or will need to be updated on a regular basis. In order to refresh data and keep end users in up to date, the cloud must have access to data sources. This is where you need a Data Gateway. Think of a data gateway as a bridge between your on-premises data sources and the cloud.
A gateway should be installed on a machine that is always on and connected to the internet. Gateway cannot access information from a machine that is powered off or loses internet.
Before installing, take into consideration that if you are installing on a laptop and it is turned off, not connected to the internet, or asleep, the gateway won’t work and the data in the cloud will not sync with your on-prem data. Also, if the machine on which the gateway is installed is connected to a wireless network, the gateway may perform more slowly and that will take longer for the data to sync with the cloud and your on-prem data.
Power BI Gateway can be installed in two ways:
On-premises data gateway – This gateway can be used by any user that has access to the server on which the gateway is installed. It can be used for scheduling refreshes and live queries.
On-premises data gateway (Personal mode) – This gateway is can only be used by the person setting up the gateway. This mode is only used for scheduling refreshes in Power BI. At the time of writing, Live Connection, DirectQuery, Power Apps, Logic Apps, and Microsoft Flow are not supported.
Only one gateway in each mode can be installed on one machine. That is, you may install one gateway in personal mode, and another in regular mode. You cannot install two or more personal mode gateways on one machine. You can, however, manage multiple gateways from the same interface on Power BI.
Installing a Gateway
To install a gateway, you will first need to sign on to PowerBI.com. Take note that this is NOT the desktop app, this is the cloud-based service. Look at the top right on the menu bar, click on the icon that looks like an arrow pointing down. The dropdown will reveal several actions. You will want to choose ‘Data Gateway’.
This will take you to a new webpage where you will be able to start your Gateway download. Click on the DOWNLOAD GATEWAY button and wait for the download to begin. Once the Download Installer has finished, open up the exe and follow instructions.
When the installer opens, you will be ready to start setting up your gateway.
Click NEXT to choose the type of gateway you need. Before you choose, take into consideration the role of each. Remember that the Personal mode is only useful for on demand refresh and scheduling refresh in PowerBI and cannot be used for Live or DirectQuery. On-premises data gateway can be used by multiple users and does support both schedule refresh and DirectQuery.
Please note the following in regard to installing either mode:
both gateways require 64-bit Windows operating systems
gateways can’t be installed on a domain controller
you can install up to two On-premises data gateways on the same computer, one running in each mode (personal and standard)
you cannot have more than one gateway running in the same mode on the same computer
you can install multiple On-premises data gateways on different computers, and manage them all from the same Power BI gateway management interface (not including Personal mode)
You can only have one Personal mode gateway running for each Power BI user. If you install another Personal mode gateway for the same user, even on a different computer, the most recent installation replaces the existing previous installation.
Once you have chosen your mode and clicked next, it will take a a few seconds for it to download and get ready to install your gateway.
Upon successful installation, you will need to add an email address to use with this gateway. Next you will need to sign in.
We have successful installation of our Gateway! Now you will have the option to configure a new gateway, migrate, restore, or take over an existing gateway. Here we will register the data gateway.
To configure a new gateway, you will need to enter a name for the gateway, enter a recovery key (minimum 8 characters) and finally, select Configure. Be sure to store your recovery key in a safe place.You will need it in the future if you ever need to migrate, restore, or take over a gateway.
Congratulations, you now have a successful installation and configuration of Gateway! Now you will be able to connect to on-prem data sources! For use with Power BI, you will need to add your data sources to the gateway within the Power BI service. This is done by going to the menu bar, clicking on the gear icon and choosing MANAGE GATEWAYS from the drop down. We will cover adding data sources in the next blog!
*For a more in-depth look at Gateway installation, information can be found on Microsoft Docs.