Skip to content

The Chicago Data Visualization Group (CDVG) focuses on learning, creating, sharing, & promoting data visualization in Chicago

Great year in 2014, looking forward to 2015

January 13, 2015

Tom

Looking back, 2014 was a great year for data visualization in Chicago and our group. The group met 10 times this past year in a mix of workshop and seminars. Group membership grew from 700 to 1,400 members over the year. I wanted to review what we did last year, where we excelled and where we fell short. Even though it was a wonderful year for the group, 2015 is an opportunity to grow and offer even more to those interested in data visualization in the Chicagoland area.

City of Big Data

Given my role in Chicago’s government, the city (the entity of residents, companies, and culture; not just the government) tended to be showcased. The City of Big Data exhibit at the Chicago Architecture Foundation was a grand exhibit of the intersection of urban spaces, data visualization, and data. Computer-based 2-D and 3-D models were featured, and a real-life 3-D projection of data onto a scale replica of downtown Chicago.

Chicago: City of Big Data

Chicago: City of Big Data

The organizers of the exhibit were exploring how data is not a spreadsheet or abstract notion, instead, it surrounds us. Some of this data is public, highlighted by its display on the 3D model. But it’s also private, captured by Facebook activity and streaming music. In all, it surrounds us in the same way it does in the exhibit.

Data Science for Social Good is a relatively new program that puts highly-qualified and trained undergraduate, graduate, and post-graduate students in Chicago working on research problems to improve society. Many of these fellows were part of the group over the year. This fall, we were able to see some of those projects which are mostly available as open source projects.

It was obvious that Shiny was a popular tool to quickly prototype and build interactive programs and maps for end users. Grand visualizations were not a primary outcome of those projects, but because of the operational nature of these systems, clear, concise visualizations and maps were rolled out. While visualization is often associated with impressive, large displays, some of the most crucial visualizations follow good practices on charts and maps, enabling exploration through advance filtering. While grand visualizations are the most impressive, the most important visualizations are well-executed visualization of maps and simple graphs.

Importance in design

One of my favorite meetup concepts in 2014 was exploring how design choices leads to a different user experience. This past April, Heather Billings of Chicago Tribune and Derek Eder and Erik Van Zanten of DataMade shared how they visualized the crime data located on the City’s open data portal. It’s a popular dataset from the number of rows (over 5 million) and the pertinent nature of the topic. There seems to be hundreds of analysis and graphs of the data, but the dataset is only 22 variables (columns)–how can the same data look so different?

crimeinchicagodotorg

Each one of the designers took a different approach to visualizing crime data. Chicago Tribune focused on crime in the neighborhood. Derek was candide in admitting that the project was an exploration and learning project as an intentional visualization. Erik demonstrated a tool that allowed people to define their own area of interest. Each project used the exact same data source, but the design and visualization choice led to different user experiences and outcomes.

The city is often the subject of research, an easy topic since it’s a topic that literally surrounds us every day. The newly launched bike-share program led to the DIVVY Data Challenge was quite successful. Similar to crime data, there were a number of drastically different designs based on the same fundamental data, so how different of a design and experience yields from the same data? In this case, there were 99 submissions, all with their own design.

Viva Voronoi!

Viva Voronoi!

We looked at Divvy Spokes, a chord diagram showing neighborhood-to-neighborhood connections by Shaun Jacobsen; thorough statistical analysis by Drew DePriest on ride patterns and his struggle with seemingly messy data; custom dashboard (with crossfiltering!) by Michael Freeman; and learned Voronoi tiles from Gabe Gaster on his map. Additional “show and tell” presentations by KK shows the usefulness and ease of Tableau and–in one of the most amusing apps–find your match on DIVVY with Serendivvity. The same dataset was literally ranging from a dashboard to a dating site. Ultimately, the design and visualization choices leads to an entirely different experience.

Keynotes

We were treated to nationally renowned data visualization experts. Alberto Cairo previewed his upcoming book and his perspective on the role and epistemology of data visualization. Kaiser Fung, the embodiment of an art critic turned data visualization aficionado, reviewed the best and worst charts–and what makes a good chart, good. Both Alberto and Kaiser, leading speakers in this area, were able to cap off a productive year with outstanding talks.

Review of 2014 goals

Early in 2014, I noted three ways to facilitate a robust meetup group: (1) seminars, (2) teach & learn (workshops), and (3) show & tell. Seminars were a strong part of the year. Visits by Alberto Cairo and Kaiser Fung were enlightening. Generally, we were able to hear from a speaker almost every month. Seminars are a clear staple of this group and will continue to be in the future.

D3 Workshop at Critical Mass

D3 Workshop at Critical Mass

The show & tell sessions were also repeated throughout the year. The DIVVY meetup was particularly wonderful as some great data visualizations were presented during very short presentations. It was successful enough this year where I aim to deliver more of these opportunities in the future.

Sadly, this puttered out throughout the year and I do not think we achieved our mission of providing workshops and conducting adequate hands-on training. We started strong with workshops on D3 (many thanks to Paul Katsen for a wonderful job!), but scheduling started to interfere. We are going to renew our efforts to offer workshops in 2015.

Looking forward to 2015

We will continue to offer the three areas of focus–seminars, workshops, and show & tell–from 2014. I hope to improve the number of workshops that can be provided to the Chicago community. But, in addition, we will take the opportunity to improve upon the past year. Thus, the next year will include new focuses that weren’t covered in 2014.

Data visualization is frequently embodied by business intelligence tools, used by large and small organizations. We will be spending some time this year focusing on business intelligence platforms. There is an unfortunate vacuum in the Chicago community around BI platform awareness.

Our first meetup (TBA) will discuss the Tableau platform. As we learned from the Data Science for Social Good group, some of the best data visualization is good execution of basic graphs. The rise of “data exploration” platforms have focused more on visualization and discovery than automating or “pixel-perfect” reports. We will introduce these platforms and discuss the pros and cons of these tools.

Data visualization is also establishing itself as a profession. Dedicated data visualization-ists(?) are being sought on job boards. Thus, we will be more considerate on making a connection between employers and potential employees. Late in the year, we introduced an opportunity for employers to note they were hiring. We will continue that practice. Likewise, we will increase the offerings for serious professional development in the group.

These lists are never exhaustive, but is a start of a plan for 2015. I am interested to hear your feedback through this blog, @ChicagoDataVis, chicagodataviz@gmail,com, and at the events.

Postponed D3 Workshop

April 1, 2014

Tom

Unfortunately, due to last minute scheduling issues, we will be postponing the D3 workshop planned for Thursday, April 3rd. We hope to reschedule this meetup soon and we apologize for the inconvenience.

Hi, I’m Tom.

March 27, 2014

Tom

Tom Schenk Jr.

Josh Doyle has done a remarkable job at establishing the Chicago Data Visualization Group in the city. Just a couple of years ago, noticing a gap in Chicago’s active peer network, he rightly identified a larger interest in data visualization. As he noted, Josh will be active in the community but I will be stepping-in to organize the group’s meetups. I am very excited that I have an opportunity to spend a little time working with this group. In large part, I am relieved that Josh has done the hard work of keeping a new group active in it’s initial stage. In the upcoming year, I have the luxury of building upon previous work.

What will that work entail? Namely, I want to increase the number of opportunities for members through the meetup group. We’ve begun scheduling a pair of events each month; a workshop early in the month and a seminar or presentation at the end. It would be ideal to keep this pace throughout the year, if not more.

There is a general approach to upcoming events. In addition to increasing frequency, I hope this group can provide a rich variety of meetups in these areas:

  1. Show & Tell
    Chicago’s local talent has produced superb visualization, many of which feature Chicago itself. Companies like Datascope Analytics, Datamade, Chicago Tribune, and others actively use data visualization as a means of effective communication. To further the local community–our community–I hope we can share work through a show and tell for every meetup.

    This was introduced as “lightning talks” in the most recent meetup, but will continue to exist as a show & tell.In addition to showing completed works, I hope members will be brave enough where work-in-progress can be shared to solicit feedback in the development process.Want to give a show & tell? Bring a laptop (and dongle) that can connect to a VGA cable and be prepared to talk. For now, we’ll keep the lightning talk format to a 5-minute limit.

  2. Teach & Learn
    Data visualization is a profession that involves skill and toolset. We are fortunate that many of us work in a growing software development, UI/UX, data analytics, and general IT fields. But it also leaves limited opportunity to learn in a rapidly changing field. This year we will be offering a series of free workshops on various data visualization tools. Paul Katsen has held the well-received D3 workshops.

    In the upcoming year, we aim to expand to other tools, including R and more D3. We also would like for you to lead sessions. With over 800 members in our group alone, there is sufficient talent that could be taught to others from our own ranks. Always feel free to reach out with a new idea for a workshop that can be useful for budding and experienced data visualization-ists.

  3. Seminars
    We already have had great speakers, such as Andy and Cole, visit us in Chicago. I hope to bring more leaders in visualization to our meetups. There are some exciting speakers that are being confirmed for later this year.Local talent is also important. Chicago-based Datascope Analytics have presented to our group, as will Datamade, Chicago Tribune, and a variety of people who participated in the DIVVY Data Challenge.

    In May, we will be holding an event at the Chicago Architecture Foundation’s City of Big Data exhibit. I hope we can continue to feature data visualization from our own community as featured seminars.

This isn’t an exhaustive list, but a start to a good year. I am interested to hear your feedback through this blog, @ChicagoDataViz, chicagodataviz@gmail,com, and at the events.

Graphing in R & ggplot2 workshop off to a great start!

October 10, 2012

Tom

687474703a2f2f6879706572737065632e722d666f7267652e722d70726f6a6563742e6f72672f7370632e706e671_th.png

Thanks to the 30+ CDVG members that came out to the first night of the Graphing in R & ggplot2 workshop at 1871. It was a great start and I look forward to doing more of these in the future. I’ve updated the presentation and posted it to slideshare. Here is a highlight of the changes:

  • added link to a matrix multiplication tutorial on the Matrix: Arithmetic slide (p. 26).
  • changed the data frame subsetting example to access columns by name instead of index number.
  • cleaned up typos.
  • added references page.

Free copy of the R Cookbook for the first person to post and share a valid solution to any one of the exercises on slide number 36. You can post on your own blog, on our CDVG blog (contact me) or Github(recommended). just let us know where you put it.

Another reminder that we don’t have the IMSA classroom on October 22 and we will either push the schedule a week or just meet in the shared areas of 1871. We will make that decision before leaving 1871 this coming Monday.

I look forward to seeing everyone next week when we will start using ggplot2 under the tutelage of Tom Schenk. Bring $5 if you want to have pizza that night. Drinks are BYO. I’ll bring a few extra beers for anyone that wants to do a beer swap. It’s a great way to discover something new to drink.

See everyone Monday.


Thanks,

Josh Doyle

CDVG members will meet to discuss free online data visualization course offered by Knight Center.

October 10, 2012

Tom

Alberto Cairo and the Knight Center for Journalism in the Americas is offering a free online introduction to data journalism and infographics course in October. I am signing up and I hope that some of you will join me. If you want to discuss the course materials with your fellow CDVG members also taking the course then sign up for the meeting at our meetup site.

Meeting Topic Suggestion: Real-time data flows and updates with no lags in order to make better decisions on fresh data

October 1, 2012

Tom

CDVG member Adam Kallish recently proposed:

Real-time data flows and updates with no lags in order to make better decisions on fresh data

as a topic for a future meeting.

Please express your interest in this topic by commenting below.

Meeting Topic Suggestion: Temporal meaning of data depending on the context of access and use

October 1, 2012

Tom

CDVG member Adam Kallish recently proposed:

Discussion of the temporal meaning of data depending on the context of access and use (work/home/travel)

as a topic for a future meeting.

Please express your interest in this topic by commenting below.

Meeting Topic Suggestion: Cognitive and workload performance increases through data visualization

October 1, 2012

Tom

CDVG member Adam Kallish recently proposed:

Discussion of cognitive and workload performance increases through data visualization and interacting with data

as a topic for a future meeting.

Please express your interest in this topic by commenting below.

Meeting Topic Suggestion: Platform implications for data visualization

October 1, 2012

Tom

CDVG member Adam Kallish recently proposed:

discussions on rich internet applications and discussions when browser based, or fat client/RIA applications for multi-modal platforms (laptop/tablet/phone/hybrid) are best applicable for a particular solution

as a topic for a future meeting.

Please express your interest in this topic by commenting below.

Workshop: Graphing with R & ggplot2

September 28, 2012

Tom

Descriptionggplot2 example by Hadley Wickham

Introduce yourself to R and the powerful graphing library based on the Grammar of Graphics–ggplot2. Attendees will work in small teams to learn how to generate basic and advanced plots in ggplot2 to solve a variety of problems. The workshop will also review the fundamentals of data visualization to increase the readability and clarity of plots.

The workshop is open to all types of users, including those who are unfamiliar with R. We will mix some demonstration with small group-based projects. Basic principles of data visualization will also be emphasized alongside ggplot2 demonstrations to put the program into a larger context.

Audience

The workshop is targeted to individuals who are not familiar with ggplot2, including beginners who are new to the R software. Attendees will need to bring there own computer where we will install the R and ggplot2 software–don’t worry, both are open source and free.

Time, Location, & Signup

The workshop begins October 8  in the IMSA classroom at 1871 located on the 12th floor of The Merchandise Mart (222 W. Merchandise Mart Plaza). It includes four sessions (outline below) meeting on consecutive Mondays at 6pm. The IMSA room isn’t available on October 22 so, depending on the number of attendees, we will either meet in a smaller conference room or push the schedule a week.

There are only 30 seats available for this workshop due to the size limitations of the IMSA classroom. Interested attendees need to go to the CDVG meetup site to sign up for each of the four sessions.

Workshop Leaders

The workshop will be led by CDVG member Tom Schenk. Tom is a Senior Research Data Analyst at Northwestern University, Department of Medical Social Sciences. You can read more about Tom on his website. He also curates Data Nouveau–a collection of interesting data visualizations on the web.

Tom will be assisted by CDVG member Josh Doyle (who is relatively new to R & ggplot2 and will ask the dumb questions so others won’t have to). We also expect to have some other experienced folks in the room to help out.

Workshop Outline

Introduction to R (October 8)

We will familiarized ourselves with the R environment with a gentle introduction to the basic functions. After installing R, we will import and inspect data sets while becoming familiar with R terminology. By the end of the class, we will conduct basic descriptions and plots of the data.

  • Learn how to import data into R.
  • Understand the structure of data sets and their components..
  • Learn how to describe data.
  • Download and install new packages from CRAN.
  • Plot data using basic R functions.

Introduction to ggplot2 (October 15)

We will begin to use the ggplot2 package to create basic, but handsome, univariate, bivariate, and time-series graphs. We will introduce the functions and terminology used in ggplot2. We will also explain the fundamentals of proper data visualization techniques and how it relates to the ggplot2 defaults.

  • Install the ggplot2 package.
  • Use geometric shapes to display data.

Grammar of Graphics (October 22 or 29)

We will continue to show more advanced features of ggplot2, including how it relates to Leland Wilkinson’s Grammar of Graphics. We will show how to plot more than 2 variables in a single graph using colors, shapes, and sizes. We will also discuss how human ability to perceive different shapes and colors should drive the choices we make in data visualization.

  • Using scales to add information.
  • Using coordinates to aid interpretations.
  • Easily create small multiple graphs.

Plots for Publications (October 29 or November 5)

After learning how to make plots, we will learn how to customize graphs with custom colors, labels, and themes. We will emphasize how to create a customized look to be included in publications, including adding labels in diagrams to help readers.

  • Saving graphs from R into publication-friendly formats.
  • Use custom colors for plots.
  • Use your own fonts.
  • Customizing ggplot2 graphs with the new themes feature.
Follow

Get every new post delivered to your Inbox.

Join 56 other followers