The Renegade Coder

What Is Snake Case in Python?

Jeremy Grifski — Fri, 26 Jul 2024 14:00:00 +0000

While Python may be the name of both a programming language and a snake, snake case is an unrelated concept stemming from the way we name things in programming. In the rest of this article, we’ll take a look at what snake case is, when you would use it, and even why you should use it. Let’s dig in!

Concept Overview
When to Use Snake Case in Python?
Why Follow Snake Case in Python?
What Naming Conventions Do Other Languages Use?
Learning About Conventions

Concept Overview

By far, snake case has to be one of the lowest hanging fruit concepts I’ve ever written about, but we’ll see just how hard of a concept it is to cover.

To start, snake case, which is sometimes written as snake_case as should be obvious shortly, describes lowercase phrases where the spaces between words are replaced by underscores. For example, the phrase “red apple” would be written as red_apple in snake case.

In the context of Python, snake case is used for naming things, like functions and variables. Therefore, snake case is what’s known as a naming convention. In general, there are many naming conventions that vary from language to language and even from team to team. But, I would argue that snake case is fairly universal in the Python community. I usually find it a bit odd when it’s not being followed.

Alternatives to snake_case include camelCase, kebab-case, and flatcase, some of which you may even see used in Python for special cases.

Naturally, I’m sure this short introduction left you will some questions, so let’s get to them!

When to Use Snake Case in Python?

When writing code in Python, you will have plenty of opportunities to name things. For example, whenever you create a variable, you will have to give it a name. The same goes for functions, parameters, modules, classes, and constants. But, not all of them should make use of snake case.

Explicitly, snake case is used when naming variables, functions, parameters, and modules—as in my_variable, my_function, my_parameter, and my_module, respectively. In contrast, upper camel case is used when naming classes, as in MyClass, and upper case is used with constants, as in MY_CONSTANT. Here’s that same information in a table:

Python Context	Casing
Functions, Variables, Parameters, and Modules	snake_case
Classes	UpperCamelCase
Constants	UPPER_CASE

Generally, I would not expect to see camel case, kebab case, or flat case anywhere in Python code, but I’m sure people do it. Seeing camel case is usually a way of detecting a Java programmer, as I used to be one myself.

Also, I should mention, as rzuckerm has pointed out in the Discord, that several of the Python core libraries disobey this convention, such as logging and threading. Fortunately, the Python developers are aware of this and have carved out an exception for camelCase (or mixedCase as they call it) in PEP8.

Why Follow Snake Case in Python?

The question of why you should follow snake case or any rules, really, is more of a philosophical debate than a practical one, but I’ll entertain it.

One possible argument would be one of efficiency. Typically, developers want to eliminate as much cognitive overhead as possible when reading code, and naming conventions are one of our ways of doing that. If you can quickly tell what a name means by glancing at it, you don’t have to bother hunting down its origins.

Another possible argument would be one of polish. As a craft, software development deserves as much care as any other craft. For example, how many of those videos have you seen of rushed construction jobs where tools and junk are left hidden in the walls of a house? Now imagine your users seeing your source code that does not subscribe to any rules.

Perhaps another argument could be around teamwork. Generally, this argument ties into efficiency, but it’s normal to want to use a shared language when you communicate with your team. Believe it or not, code is a form of communication and using the right language (e.g., snake_case in python) shows that you’re a part of the ingroup (i.e., a real software developer).

Lastly, one argument could be around selfishness. When I teach folks how to code, it can be hard to get students to subscribe to conventions, especially if they have previous coding experience. My trick to get folks to care about rules is to center them in the conversation. Forget some hypothetical team. You’re the one who is going to be reading that code again later, so why not do your future self a favor?

What Naming Conventions Do Other Languages Use?

While this is a wonderful question, I cannot even begin to answer it. Not all languages have a strong community around them or have similar philosophical leanings that would cause the community to adopt certain conventions.

That said, there are a few conventions I know off the top of my head. As I said previously, Java is somewhat famous for following the camel case convention for everything but classes, which follow upper camel case, and constants, which follow upper case.

In a similar vein, C# follows a lot of the same conventions as Java, as they are very similar languages. Though, I’m sure there are minor differences, like the way the C# community prefers to place their braces. Likewise, I would suspect that Kotlin also follows the same conventions.

It’s been some time since I’ve used many C-style languages, such as C, C++, and even JavaScript, so I don’t recall the naming conventions in those spaces. That said, I consider those languages and their communities to be a bit of the wild west, so I would imagine there aren’t many universal conventions.

Beyond that, I will say that the Sample Programs repo, which contains code snippets in a bunch of programming languages, has a spot to declare the casing of at least the file names for every programming language in the repo. So, you might take a peek there!

Learning About Conventions

Generally, programming languages, like Python, are going to have community-driven conventions. These conventions can be subscribed to in whole or in part depending on the context. As a result, you may find that some of your personal conventions are banned in certain teams, and that’s okay! Every group is going to have their reasons for why they prefer certain conventions over others. These cause arguments every day in our communities, so I don’t expect them to stop anytime soon.

As a matter of fact, I’ve been involved in a few of those debates myself, which you can read about below:

With that said, know that many of the strong opinions you’ll hear from people in tech are just that, opinions. As far as I know, the conventions we hold dear have rarely been researched, if ever, so there’s little to no empirical evidence driving these debates. Though, it looks like there may be some research coming down the pipeline, which is exciting!

Also, while you’re hear, why not browse some of the following resources (#ad):

Likewise, you’re welcome to take your support a step further by checking out my list of ways to grow the site. Otherwise, take care, and hopefully I’ll see you again soon!

The Name 100 Women Challenge

Jeremy Grifski — Fri, 19 Jul 2024 14:00:00 +0000

All But Dissertation: The Light at the End of the Tunnel

Jeremy Grifski — Fri, 12 Jul 2024 14:00:00 +0000

It’s July 2024, and I have three chapters of my dissertation drafted! Two more and we’ll be ready to defend.

What Does It Mean to be All But Dissertation?
What’s Left to Do?
Did You Find Anything Interesting?
What Can We Do With This Knowledge?

What Does It Mean to be All But Dissertation?

It’s been some time since I’ve written about my PhD journey. After all, quite a lot has happened since 2018, and I’ve been so sucked into the grind that I’ve forgotten to share my progress. As a result, I wanted to spend some time today talking about my current stage in my PhD program, which is often affectionately called “All but dissertation” or ABD.

To illustrate what ABD actually looks like, here’s a little recap of some of the major milestones in my PhD journey:

Since passing my candidacy exam, I’ve conducted my entire study, and all that’s left to do is write up my dissertation. That’s what we mean when we say ABD.

What’s Left to Do?

At the time of writing, I’ve drafted three of the five chapters in my dissertation. Specifically, I’ve written the literature review, the methods, and the findings chapters. The last two chapters are the introduction and conclusion chapters. I estimate about two more months of writing before I have a full draft of my dissertation. Therefore, I am planning to defend my work in the fall and hopefully graduate on December 15th, 2024.

In the meantime, I’m going through feedback cycles with my advisor and my committee. At the moment, I’m feeling pretty good about my progress, so I expect to have a full draft around the time fall semester begins.

At that point, there is somewhat of a painful process that begins. For example, I have to declare my plans to graduate by September 6th. The defense then has to occur by November 22nd, and the final dissertation document has to be submitted and accepted by November 27th. In addition, the dissertation document has to be reviewed for accessibility somewhere along the way to submission.

Did You Find Anything Interesting?

Glad you asked! At this point, I haven’t really talked a lot about my work on this site. I think the closest I’ve done is talk a little about ethnography. Regardless, the short description is that I looked at student values and how they compare to their beliefs about the values of their institution. In this case, the students we’re talking about are undergraduate computer science students, and the concept we’re looking at is called value congruence.

What I found is perhaps not going to be all that surprising. In general, there is alignment in values when it comes to getting students into the classroom (i.e., students want to take classes and institutions want students to take them). However, once students are in those classes, the alignment in values breaks down.

To illustrate what this looks like, students value a much more diverse set of resources than they believe the institution does. These resources range from relationships like friends and family to practitioner resources like Stack Overflow or ChatGPT. On the other hand, students believe the institution only cares about institutional resources like faculty, homework, and textbooks.

Ultimately, I don’t find the results particularly surprising. You hear students say these sort of things all the time when they are rightfully critical of their education. However, it is nice to have some empirical evidence of this reality that we can point to.

What Can We Do With This Knowledge?

One of the arguments I’ve made in my dissertation is that undergraduate computer science education needs to be reformed. I make this argument on the premise that others have made the same argument before me and that there are many recent issues in the literature that could also benefit from reforms.

I take that argument a step further by centering students. Specifically, I argue that students should be the primary benefactors of any educational reforms, and I justify that in a variety of ways: there are more students now than ever before, students face a lot of challenges, and addressing these challenges would result in positive benefits for institutions more broadly.

I then extend that argument to say that we should leverage value congruence as the lens by which we prioritize students in reform efforts. The majority of this argument hinges on the known benefits of value congruence in the business world, and I’m just mapping the results onto computer science education.

All of this is to say that the results of my study give us ways that we can construct reform efforts that improve the experience for students. In turn, institutions will see a variety of benefits like improved retention and pass rates. Ultimately, it’s a win-win for both parties.

At the moment, I’m not sure what reform efforts would look like explicitly, but I imagine some of them will lean in the direction of adopting a more community-based education focus as well as more real-world curriculum. These are things that are already evidence-based. My work just provides another way of arguing for their inclusion in computer science education.

With that said, I don’t want to give all the goodies away just yet! I’ll update y’all with where things are once I’ve successfully defended my work. As a result, look out for an article titled something like, “lessons learned from earning a PhD,” or something like that!

In the meantime, feel free to read some of the following related articles:

Likewise, you can take your support a step further by heading over to my list of ways to grow the site. Otherwise, take care!

What Are Special Methods in Python?

Jeremy Grifski — Fri, 05 Jul 2024 14:00:00 +0000

In growing the Python concept map, I thought I’d take today to cover the concept of special methods, as their called in the documentation. However, you may have heard them called magic methods or even dunder methods. In any case, that’s what we’ll be covering today!

Concept Overview
Why Do Special Methods Exist?
What Special Methods Should I Care About?
Learning About Special Syntax

Concept Overview

If you’ve been playing with Python for any length of time, there’s a good chance you’ve come across a few methods with some strange syntax. After all, the standard for methods (and naming broadly) is snake case, where lowercase words are separated by underscores (e.g., my_function(), my_variable, my_method()).

However, there are a few methods, which we call special methods (or sometimes magic methods or dunder methods), that don’t obey that convention. Instead, they start and end with pairs of underscores. For example, you might have made an object of your own and been tasked with implementing the __init__() method.

As it turns out, there are several of these special methods that each serve their own purposes:

__new__()
__init__()
__del__()
__repr__()
__str__()
__bytes__()
__format__()
__hash__()
__bool__()

By no means is this list exhaustive. In fact, there are even a few for when you want to define the relational operators (e.g., <, >, etc.) or arithmetic operators (e.g., +, -, etc.). Likewise, if you want to customize your objects even more, there are special methods like __iter__(), __contains__(), and __len__().

Ultimately, however, you probably have some questions about these special methods. In the rest of this article, I’ll try to answer them.

Why Do Special Methods Exist?

As you are probably already aware, Python has a variety of built-in functions, which provide convenience for many of the built-in data types like strings and lists. For example, we can use the built-in len() function to get the length of a list. Similarly, we can use the built-in str() function to get a string representation of that same list.

As it turns out, Python also has a lot of operators to allow for a variety of convenience features on those same pieces of data. For example, you can concatenate strings with the + operator. You can also add together numbers with the same operator.

What makes this all possible are the special methods. See, it would be somewhat ridiculous to expect the built-in functions and operators to be able to work on any data type. It’s not like the underlying code for the + operator has a special case for numbers, lists, and strings. Instead, each data type defines how that operator will behave through the corresponding special methods.

Just for the sake of argument, imagine you’re designing the list class in Python. You might write something like this to enable concatenation with the + operator:

class list():
  def __add__(self, other):
    self.extend(other)

Then, when the + operator is used between two lists, this method is called and the first list is extended by the other. Of course, this isn’t how the actual + operator works for lists, but you could see how you might go about implementing the feature yourself.

Seeing __add__() in action, you might be wondering which special methods are even important. We’ll cover that question next.

What Special Methods Should I Care About?

The short answer to this question is to think about which built-in functions and operators you use most often and prioritize understanding them. However, here are some of my recommendations.

Constructing Objects With init()

By far the most important special method to know about is __init__() because it serves as the constructor. Here, you can define as many parameters as you’d like for the purpose of instantiating an object. For example, if you want to create a Cat class, you might give the user the option to specify a few features of the cat such as age, color, and name.

class Cat():
  def __init__(self, age, color, name):
    self.age = age
    self.color = color
    self.name = name

It’s also okay to leave the constructor as-is. In that case, you will probably want to set your variables to some default value. Then, you can let the user control them through methods. That said, a constructor is usually a bit more convenient for your users.

Note that there are no built-in functions or operators that activate the __init__() method. Instead, this method is called when you create an instance of the Cat class (e.g., Cat(7, "gray", "Reina")).

Converting Objects to Strings With str() and repr()

Perhaps my favorite pair of built-in functions are str() and repr() because they allow us to convert any data type we want to a string. Of course, none of this is possible without having the corresponding special methods defined in our objects. As a result, if we want str() and repr() to return something useful, we need to implement __str__() and __repr__().

I’ve written about the differences between these two functions somewhat extensively, but to summarize: str() should return a client-facing string while repr() should return a developer-facing string. For example, if you had an object for rolling dice, you might want str() to return the total of the current roll (e.g., "8") while repr() returns the underlying values like the number of sides on each die as well as their current state (e.g., "Dice(sides=[6, 6, 6], state=[3, 4, 1])").

Comparing Two Objects With eq()

While you can implement all of the relational operators for every class, the one relational operator I recommend implementing is ==, which is computed through __eq__(). The reason is somewhat obviously: if you ever want to verify that two objects are equal, you must implement __eq__(). Otherwise, Python uses the is operator, which only compares object IDs.

One thing to note is that you’re not required to implement != through __ne__(). Fortunately, __ne__() defers to __eq__(). If you plan to implement the other relational operators, __eq__() has no effect.

Learning About Special Syntax

Overall, Python has a lot of special methods. I didn’t count them all, but you might just look at the list of numeric special methods. This list contains methods for just about every math operation you might want to do from basic arithmetic to bitwise computation. In other words, you could really get lost in the weeds learning about the special methods—which are a set of features I’ve even considered abusing for fun.

With that said, I think this will have to do for today! If you liked this, I always recommend that you keep browsing as it helps the site grow:

Likewise, if you want some more formal resources for learning Python, check out these books (#ad):

Finally, you can take your support even further by checking out my list of ways to grow the site, which includes links to my Patreon, Newsletter, and YouTube channel.

What Is Iterable Unpacking in Python?

Jeremy Grifski — Fri, 28 Jun 2024 14:00:00 +0000

Today, we’re expanding our new concept series by building on the concept of an iterable. Specifically, we’ll be looking at a feature of iterables called iterable unpacking. Let’s get into it!

Concept Overview
Common Use Cases
How Does Iterable Unpacking Work With Custom Iterables?
Python Version History
Unpacking More Concepts

Concept Overview

In Python, iterable unpacking refers to the ability to extract the contents of an iterable into a sequence of variables.

To illustrate iterable unpacking, imagine that you have a list where the elements are in a predictable order, such as a list containing a student’s ID number as well as their first and last name. When processing this list, we might want to move each element into a variable with a reasonable name, rather than accessing the indices directly, as follows:

student = [3, "Jeremy", "Grifski"]
identifier = student[0]
first_name = student[1]
last_name = student[2]

Naturally, this is a bit cumbersome, so the Python developers came up with a much nicer syntax called iterable unpacking:

student = [3, "Jeremy", "Grifski"]
identifier, first_name, last_name = student

Later, if you decide to expand your dataset to include other information about each student, you might convince yourself that the only solution is to fall back on indexing and slicing:

student = [3, "Jeremy", "Grifski", 175, 3.2]
identifier, first_name, last_name, data = student[0], student[1], student[2], student[3:]

Fortunately, you can save yourself a lot of time by using the iterable unpacking operator, *, as follows:

student = [3, "Jeremy", "Grifski", 175, 3.2]
identifier, first_name, last_name, *data = student

In other words, aside from the first three elements in the list, everything else will be stored in a new list in data.

Next, we’ll talk about a few places where iterable unpacking is commonly used.

Common Use Cases

The very first place that iterable unpacking comes to mind for me is in the *args syntax you sometimes see as function parameter, such as in max() and min(). The whole idea being that you can pass in any number of values and they all get captured in the args variable, much like the following:

identifier, first_name, last_name, *args = 3, "Jeremy", "Grifski", 175, 3.2

Another place that this comes to mind for me is when you want a convenient way to swap variables. Normally, you have to introduce a temporary variable as follows:

first = "Robert"
second = "Allen"

# performs the swap
temp = first
first = second
second = temp

With iterable unpacking, the syntax could not be cleaner:

first = "Robert"
second = "Allen"

# performs the swap
first, second = second, first

In addition, another really common use of iterable unpacking is during looping. For example, sometimes folks like to approximate a for loop from their favorite C-style languages by using enumerate(). What enumerate() does is craft a sequence of tuples where the first element is the index and the second element is the data. Because the order is constant, we can take advantage of iterable unpacking:

watchlist = ["Loki", "Attack on Titan", "Vinland Saga"]
for index, show in enumerate(watchlist):
  print(index, show)

# prints
# 0 Loki
# 1 Attack on Titan
# 2 Vinland Saga

A similar trick can be used when you’re iterating over a dictionary using the items() method. Once you start to notice this trick, you’ll start seeing it everywhere.

Lastly, you might also know that Python let’s you return multiple values from a function by wrapping them in a tuple. When this happens, you might be tempted to store the tuple outright and index it for the values you need. Instead, I prefer to use iterable unpacking to get nice variables as follows

def grade_range(scores):
  return min(scores), max(scores)

grades = [71, 93, 54, 99, 80]
worst_grade, best_grade = grade_range(grades)

How cool is that? Next, we’ll look at how iterable unpacking works with custom iterables.

How Does Iterable Unpacking Work With Custom Iterables?

Previously, you may recall that we created a couple of our own custom iterables. Naturally, I’m interested in if these same iterables support iterable unpacking. To start, we’ll use a custom iterable that implements the __iter__() special method.

import random

class DiceRoll:

    def __init__(self):
        self.history = []

    def roll(self):
        total = random.randint(1,6) + random.randint(1,6)
        self.history.append(total)
        return total

    def __iter__(self):
        return iter(self.history)

After rolling the dice a few times, I attempted iterable unpacking, and it just worked:

test = DiceRoll()

test.roll() # returned 5
test.roll() # returned 11
test.roll() # returned 10
test.roll() # returned 2
test.roll() # returned 8

first, *middle, last = test # unpacked into 5, [11, 10, 2], 8

Out of curiosity, I also attempted it on the alternate implementation using __getitem__():

import random

class DiceRoll:

    def __init__(self):
        self.history = []

    def roll(self):
        total = random.randint(1,6) + random.randint(1,6)
        self.history.append(total)
        return total

    def __getitem__(self, key):
        return self.history[key]

Perhaps unsurprisingly, it worked just as well:

test = DiceRoll()

test.roll() # returned 6
test.roll() # returned 7
test.roll() # returned 4
test.roll() # returned 7
test.roll() # returned 6

first, *middle, last = test # stored 6, [7, 4, 7], and 6

Overall, I was extremely pleased with how robust the iterable and iterable unpacking features are in Python.

Up next, we’ll take a look at a brief history of the iterable unpacking feature.

Python Version History

It’s important to note that iterable unpacking has not been a permanent feature of Python, and it’s been modified over time. For example, the following lists some of the changes made to iterable unpacking since Python 3:

Python 3.0: PEP 3132—Extended Iterable Unpacking
Python 3.5: PEP 448—Additional Unpacking Generalizations

Unfortunately, prior to Python 3, iterable unpacking didn’t really exist to the extent it does now. Instead, folks were using a mix of indexing and slicing to manually unpack their iterables.

Unpacking More Concepts

Previously, we explored the idea of iterables, and today we looked at a feature of iterables called iterable unpacking. In the future, we’ll look to cover some of the related concepts that popped up today like slicing and indexing. Overall, I’m really enjoying watching this concept map grow! I hope you are too.

In the meantime, you’re welcome to browse some of these related articles:

Similarly, you can start learning Python with any of the following resources (#ad):

Finally, you can support the site by heading over to my list of ways to grow the site. Otherwise, take care!

What Is an Iterable in Python?

Jeremy Grifski — Fri, 21 Jun 2024 14:00:00 +0000

I’m kicking off a new Python series that covers core programming concepts. This is a bit different from my usual problem solving content, so it should be a lot of fun! Today, we’re kicking off the series with an explanation of what an iterable actually is in Python.

Concept Overview
Examples of Common Iterable Data Types
Built-in Functions That Accept Iterables
Making a Custom Iterable
- Creating an Iterable Using __iter__()
- Creating an Iterable Using __getitem__()
Why Do Both Strategies Work?
Iterables Are Only the Beginning

Concept Overview

Broadly, an iterable is a type of data structure that can be looped over. I say broadly because I don’t believe Python has a strong definition for iterable. The closest you will get is from the glossary, which defines an iterable as:

An object capable of returning its members one at a time. Examples of iterables include all sequence types (such as list, str, and tuple) and some non-sequence types like dict, file objects, and objects of any classes you define with an __iter__() method or with a __getitem__() method that implements sequence semantics.

Iterables can be used in a for loop and in many other places where a sequence is needed (zip(), map(), …). When an iterable object is passed as an argument to the built-in function iter(), it returns an iterator for the object. This iterator is good for one pass over the set of values. When using iterables, it is usually not necessary to call iter() or deal with iterator objects yourself. The for statement does that automatically for you, creating a temporary unnamed variable to hold the iterator for the duration of the loop. See also iterator, sequence, and generator.
Source: Python 3 Glossary

To me, the definition above is a bit more descriptive than prescriptive. In other words, it sort of tells us what kinds of data structures would count as iterables, but it doesn’t give us any strict rules for creating one. As a result, there aren’t really any strong claims I can make about what you can do with an iterable because there are different kinds of iterables.

If you’ve used any other mainstream programming languages (e.g., Java), you might find the lack of a strong definition to be a bit jarring. After all, an iterable would typically be a data structure (i.e., an object) which inherits iterable properties from something like an interface. Therefore, you would know if a data structure was iterable through a type check (e.g., is this data structure an instance of “iterable”?).

In contrast, as with many things in Python, you sort of have test an object for its limits. For instance, if you want to know if an object is iterable, you have to call iter() on it. Alternatively, the more Pythonic approach might be to just try iterating over it using a for loop, which the Python docs call EAFP:

Easier to ask for forgiveness than permission. This common Python coding style assumes the existence of valid keys or attributes and catches exceptions if the assumption proves false. This clean and fast style is characterized by the presence of many try and except statements. The technique contrasts with the LBYL style common to many other languages such as C.
Source: Python 3 Glossary

Ultimately, we’ll take a look at the definition provided in more detail and try to explore iterables of various types.

Examples of Common Iterable Data Types

Despite the lack of a solid definition for iterables, most of the data structures you encounter will be iterable to some extent. Perhaps the most obvious example is the list, which you can loop over easily:

dice_rolls = [7, 6, 7, 10, 8, 7]
for roll in dice_rolls:
  print(roll)

# prints:
# 7
# 6
# 7
# 10
# 8
# 7

You can also loop over a string to look at individual characters:

dna_sequence = "CTCTAAA"
for base in dna_sequence:
  print(base)

# prints: 
# C
# T
# C
# T
# A
# A
# A

Unfortunately, things start to get a little tricky with a dictionary. If you try to iterate over them directly, you’ll only see the keys:

grades = {"math": "B-", "art": "C", "science": "A+"}
for subject in grades:
    print(subject)

# prints:
# math
# art
# science

If you want to iterate over the actual pairs (as tuples), you have to make use of the items() method:

for pair in grades.items():
    print(pair)

# prints:
# ('math', 'B-')
# ('art', 'C')
# ('science', 'A+')

More often than not, however, you’re going to want to take advantage of iterable unpacking anyway:

for subject, grade in grades.items():
    print(subject, grade)

# prints:
# math B-
# art C
# science A+

In any case, you can probably imagine that most built-in data types are iterable including strings, lists, tuples, and dictionaries. Up next, we’ll look at some built-in functions that take advantage of this trait.

Built-in Functions That Accept Iterables

Some of the very first functions that come to mind for me when I think of iterables are min() and max(). These methods take an iterable and return the smallest and biggest items in them, respectively. Here’s what that looks like on a list:

dice_rolls = [7, 6, 7, 10, 8, 7]

min(dice_rolls) # returns 6
max(dice_rolls) # returns 10

Then, there are functions like zip(), which take a series of iterables and combine them. Of course, zip() is a lazy function, so we have to convert the result back to a list to see our items:

subjects = ["math", "art", "science"]
grades = ["B-", "C", "A+"]

list(zip(subjects, grades)) # returns [('math', 'B-'), ('art', 'C'), ('science', 'A+')]

There are even interesting boolean functions like all() which can be used to determine if everything in an iterable is truthy:

truthy_values = [5, "hello", True]
all(truthy_values) # returns True

If you get bored, I’d recommend browsing the built-in functions list. There are several more functions that take iterables, such as map(), any(), and enumerate().

Making a Custom Iterable

At this point, we have a rough idea of what an iterable is in Python. We also know of a few examples of common iterables such as lists and strings, and we know a few built-in functions that take iterables. Now, I want to take a moment to craft a couple iterable objects by following the definition provided in the Python docs and play with them a bit (i.e., use them in loops, pass them to built-in functions, etc.).

Creating an Iterable Using `iter()`

Keeping with the dice roll theme, I made a simple object that let’s you roll a pair of dice and stores every roll in your history:

import random

class DiceRoll:

    def __init__(self):
        self.history = []

    def roll(self):
        total = random.randint(1,6) + random.randint(1,6)
        self.history.append(total)
        return total

    def __iter__(self):
        return iter(self.history)

To make make this object iterable, you’ll notice that I have implemented the __iter__() special method. The idea being that the underlying list is iterable, so I just return its iterator.

Now, we can roll the dice a few times and report our history:

test = DiceRoll()
test.roll() # returned 6
test.roll() # returned 11
test.roll() # returned 5
for roll in test:
    print(roll)

# printed:
# 6
# 11
# 5

What’s cool is that we can start taking advantage of its iterable status by passing it to a variety of built-in methods:

min(test) # returned 5
max(test) # returned 11
any(test) # returned True

Ultimately, it looks like we’ve created an iterable. Let’s try another way.

Creating an Iterable Using `getitem()`

According to the docs, we can also create an iterable by implementing the __getitem__() special method, which allows us to index our data structure. Fortunately, that’s an easy change:

import random

class DiceRoll:

    def __init__(self):
        self.history = []

    def roll(self):
        total = random.randint(1,6) + random.randint(1,6)
        self.history.append(total)
        return total

    def __getitem__(self, key):
        return self.history[key]

Instead of making use of list’s iterator, we make use of the ability to index a list. The results are virtually identical:

test = DiceRoll()

test.roll() # returned 10
test.roll() # returned 3
test.roll() # returned 4

for roll in test:
    print(roll)

# printed    
# 10
# 3
# 4

min(test) # returned 3
max(test) # returned 10
any(test) # returned True

So, it seemingly doesn’t make a difference which one you implement. Though, I imagine the context matters. For example, you might only implement __getitem__() if it made sense for you to be able to index your data structure (i.e., it’s a sequence-like data structure).

Why Do Both Strategies Work?

Previously, we looked at two different ways to create an iterable data type in Python. In the first example, we looked at an object which implements the __iter__() special method. In contrast, the second example creates an iterable using the __getitem__() special method.

While I didn’t go into detail around each of these special methods, they work quite differently. One relies on a separate one-time-use iterator object, and the other relies on sequence semantics. This begs the question: how are both supported so seamlessly? I was genuinely curious about this, so I dug into the docs a bit.

First, the reason the iterator solution works is somewhat obvious. The concept of an iterator is not new; many programming languages support them. In short, they’re an object with a “next” method which you use to get the next item in the collection. Some languages have the additional requirement of a boolean method to check if there is anything next, but as previously mentioned, Python subscribes to the EAFP principle. As a result, “next” will eventually crash, and that’s when we know to stop looping.

In contrast, when you implement indexing instead of an iterator, the iterable then assumes a counting mechanism for iterating over the data structure. Normally, you might need the length of the data structure to know when to stop, but again we don’t really want to be checking things first. Instead, the loop will count up until it gets an IndexError, at which point the loop terminates.

While the logic is a bit different in both cases, the design is fundamentally the same: we iterate until we encounter an error.

Iterables Are Only the Beginning

With this being a new series, I’m expecting to build on existing concepts. For example, while we now know what an iterable is, we’ll need to explore related terms like iterable unpacking and indexing. Because concepts are all very much connected, I’m sure there will be a lot of unpacking (pun absolutely intended) going forward.

Until then, however, why not explore some of my other articles:

Alternatively, here are some resources for helping you learn Python (#ad):

Finally, you can take your support even further by helping me grow the site. Otherwise, take care!

Meritocracy: The Facade That Determines Who Deserves Success

Jeremy Grifski — Fri, 14 Jun 2024 14:00:00 +0000

On a drive home from a bachelor party, I was so bored that I got to thinking about meritocracy and its consequences. It’s amazing what the mind can do.

Who Deserves Success?
Who Doesn’t Deserve Success?
Making Sense of the Facade

Who Deserves Success?

Meritocracy has a lot of different definitions, but in the United States it is probably best defined as follows: a system in which individuals will succeed in life as long as they do some combination of hard work, talent development, and learning.

Under a belief in meritocracy, the question of “who deserves success?” springs forth. After all, if someone demonstrates merit, we would argue that they deserve success. In fact, we see this all the time in certain contexts.

One context that comes to mind for me is hockey because amazing players regularly fail to earn a cup. A couple players in recent memory who have never won a cup are Joe Pavelski, who just announced his retirement, and Henrik Lundqvist. Though, many would argue that they both deserved a cup.

Likewise, we often see this with rich people in general. A solid portion of society will look at people like Bezos and Musk and assume that they deserved their success. After all, under meritocracy, it’s not possible for someone to reach that level of success without having earned it. Confronting the reality of their success would quite literally crumble the belief system, so people don’t do it.

Overall, I find it interesting to look at examples of folks who succeed within the meritocratic framework as expected (e.g., CEOs) and look at how people use that to justify their belief system. I also find it interesting to look at folks who don’t succeed within the meritocratic framework who otherwise should (e.g., hockey players) and look at how people don’t update their belief system. The entire concept of people “deserving” things is really interesting!

Who Doesn’t Deserve Success?

Interestingly, for every example of an individual or group that we feel deserves success through the lens of meritocracy, there are examples of folks who we argue don’t deserve success.

Almost the entire inspiration for this article stems from a conversation I had with some friends at a bachelor party about unions. I started the conversation about unions, so I could tell the story about why my dad hates them. However, to preface the conversation, I stated that I’m very pro-union.

Having shared my story, it wasn’t long before a few of my friends were pitching arguments for why unions are bad. One very common argument that I always hear, which came up during this conversation, was that unions are bad because they enable lazy workers. This, of course, is subtly arguing that lazy people don’t deserve success—which is right in line with what we’d expect from a society built on a belief in meritocracy.

We see very similar takes in political discussions around policies intended to help folks. For example, the concept of “universal” anything—such as universal basic income, universal healthcare, or universal student loan forgiveness—is always met with pushback from both sides of the aisle.

When democrats talk about student loan forgiveness, they always want to put stipulations on it where rich people won’t get the benefit because “they don’t deserve it.” Deserve in this context is actually not related to meritocracy at all but rather related to need (i.e., it’s not that rich people haven’t merited forgiveness but rather they don’t need it.). Unfortunately, this often leads to beneficial policies not passing due to pushback from rich groups.

Meanwhile, republicans often argue that students haven’t worked hard enough to merit the forgiveness. Often times, the argument is around the trades being “real work” and desk jobs being the opposite. Ironically, there are a ton of working class folks who could benefit from student loan forgiveness, and they find themselves voting against their own interests for the sake of upholding some ideal form of meritocracy (e.g., hard workers don’t take handouts).

Overall, what we see with the comparison between folks who don’t deserve success and don’t get it versus folks who don’t deserve success but get it is exactly the same as previously discussed. Results that align with meritocracy are used to reinforce that belief system while results that don’t align are discarded.

Making Sense of the Facade

I don’t think anything I’ve written about hasn’t been explored by actual scholars (e.g., Erin Cech), but I got bored on my drive home from the bachelor party, and I had meritocracy on my mind. Ultimately, I encourage you to apply these same ideas around “who deserves something” in your own life to see where it crops up.

For instance, I personally think amazing hockey players deserve to win a cup, but I am also aware of the randomness that occurs in sports. There is so much out of the control of that individual player that it’s not like they failed; they just got unlucky.

With all that said, I’ll save some of my more “burn down the system” takes for another time. For now, I’m going to continue to think about this idea of “deserving” success and even “deserving” failure or punishment. In the meantime, I hope you’ll continue to browse the site! For instance, here are some other weird articles that have nothing to do with programming or education:

Hope that will keep you busy! If not, you can always head over to my list of ways to grow the site. Otherwise, take care.

6 Tips for New College and University Educators

Jeremy Grifski — Fri, 07 Jun 2024 14:00:00 +0000

As I wrap up my 5th year of teaching, I figured I’d put together a nice little set of tips for folks just starting to teach at the college-level. In general, my tips are very big picture and will help you get through some of your tougher times as an educator.

A List of Tips for New Educators
Bonus Tip: You Can Grow As an Educator

A List of Tips for New Educators

Since starting my teaching career in 2018, I’ve grown a lot. A major part of that growth has been learning how to deal with some challenging teaching situations. As a result, this list isn’t really going to give you the kinds of tips you might typically see—like how to organize your office or how to structure your free time. Instead, it’ll focus on the challenges you will face as an educator and how to prevent and/or overcome them.

You Should Always Get Details Down in Writing

One lesson I learned very early in my teaching career was to always get agreements down in writing—especially if you want to be a flexible educator for you students. In other words, if you make a deal with one of your students (e.g., by giving them an extension on an assignment), that deal should be written down somewhere like your email.

Hopefully, it’s somewhat obvious why this is important, but I’ll explain anyway. If you do not get a deal down in writing, it’s very easy for the student to ignore the agreement and submit work on their own terms. Previously, I talked about a student who did exactly this to me, and I had no way to call them out on it.

If there is nothing else you take from this list, it should be this tip. It was the first major lesson I learned, and I haven’t had any problems like it since.

You Can Prevent Most—But Not All—Cheating

As someone in the world of computer science, there’s a general obsession with objectivity. In other words, there is this belief that there is always a right and wrong answer to a question. Unfortunately, this leads to a lot of lazy assessments that ask students to answer questions with an objectively correct response—or at least a correct response according to the instructor.

Naturally, the consequence of giving assessments which only have one answer is that students will very quickly be able to retrieve the correct answer without doing the work (i.e., cheating). I’ve had instructors in the past try to solve this problem by changing their assessments every semester, but this is a lot of work for very little payoff, in my opinion. In my own experience going as far back as even my first semester of teaching, I’ve seen a lot of these kinds of assignments plagiarized

One solution to this cheating problem is to provide more open-ended assessments, such as open-ended projects. The point here being that students can then select a topic of their own interest and build their knowledge around it.

There are, of course, other strategies for preventing teaching, most of which involve just getting to know your students. For instance, knowing your students’ names is a great way to make students feel less anonymous.

Of course, even if you go through the effort of preventing cheating, you will still have some students that cheat. It is inevitable, and I do not see a reason to beat yourself up over it. Maybe just send them my tongue-in-cheek article on how to cheat in a better way.

You Should Take Your Teaching Evaluations With a Grain of Salt

As far as I know, many institutions have students fill out some form of teaching evaluation at the end of each semester. This is a great tool for you to help develop your teaching with some feedback directly from your students. However, you should be wary of the results for a few reasons.

First, students aren’t exactly equipped to evaluate teaching. They aren’t trained in it, so they don’t know what effective teaching should look like. However, they can tell when an environment is welcoming and supportive, so I would definitely listen to them when they describe their experiences.

Second, in my experience, teaching evaluations are very rarely used as a part of the promotion journey. If institutions aren’t going to respect them as a part of your performance reviews, then I think it’s safe to say that you shouldn’t stress too much about your teaching evaluations either.

Finally, and perhaps most importantly, teaching evaluations are biased to folks that look like me: a white male. This reality has been reproduced multiple times in the literature, so you may put a ton of work in your teaching to still receive bad reviews from your students. This is partially why I think we should throw them out as a form of feedback.

You Should Find Ways to Align Expectations

As an educator, you will find yourself in situations where you communicated your expectations poorly. In other words, students will think they need to do one thing when you had something else entirely in mind. Unfortunately, this only becomes more complex when you introduce additional stakeholders, such as graders or other instructors.

For me, I had a lot of trouble aligning expectations between my students, their graders, and me. Often times, I would say one thing to a student and a grader would say another. Naturally, this led to all sorts of problems. Specifically, I recall having a grader who was significantly tougher on students than I would have liked, which lead to many students losing motivation.

Ultimately, I ended up creating rubrics for all of the assignments. That way, graders were bound to the structure of the rubric, and students could get a feel for what was expected of them. Sure, there were still disagreements between the three groups, but the difference was usually a point or two.

You Do Not Have to Stress About Your Choices

In general, teaching is a tough field. You have to deal with a lot of challenges coming from a lot of different directions, like students, parents, and administrators. Not to mention, you will be presented with situations that don’t have a correct answer.

One scenario that came to mind for me was a student who would routinely show up to exams late, expecting to be able to get the full time needed. In a scenario like this, there are a lot of ways to approach the problem. You could tell the student, “tough luck.” If possible, you might stay late to give them extra time. Alternatively, you might offer them a retake exam at a later date.

Regardless, when you’re presented with a scenario where there is no clear way to proceed, it’s okay not to beat yourself up over the path you choose. In my case, I went the “tough luck” route, consulted some colleagues, and walked my decision back later. It’s okay to do that.

You Can Chalk a Course or Semester Up as a Loss

Aside from my writing tip, I would say that my second most important tip would be that it’s okay to realize you’ve done all you can do. I first had this realization last semester at the end of my fifth year of teaching when I found that I just couldn’t engage one of my classes.

One thing you will have to come to terms with is that what you do as an educator will not work for every single student all of the time. Sure, you might try to leverage a diverse set of teaching techniques, but there comes a point when it’s okay to stop trying. I reached that point with one of my classes last semester.

Thankfully, semesters end, so you always have that to look forward to. You will get a chance to try again next semester.

Bonus Tip: You Can Grow As an Educator

Many of the stories I drew on to create this article came from old reflections I wrote over the past several years. Here are just a few of them:

If you want to get better as an educator yourself, I’d recommend writing your own reflections. They can help you process a semester, and you can look back on them to see how much you’ve grown.

Alternatively, you can follow my educator journey by reading one of the articles above or exploring my list of ways to grow the site. Otherwise, take care and thanks for reading!

Checking Up on Google Search in 2024: It’s Worse Somehow

Jeremy Grifski — Fri, 31 May 2024 14:00:00 +0000

Recently, I was ranting about Google’s new AI Overview feature when I stumbled upon an old article of mine from 2020. Back then, I investigated keywords for some of the articles I’d written to scope out the competition. What I found was that Google wasn’t even trying to rank decent computer science content. Since my site hasn’t really grown since then, I decided to check the same keywords to see what’s going on! Spoiler alert: it’s worse somehow.

Following Up On Old Gripes With Google Search
Reviewing “How to Check If a List is Empty in Python” Again
Reviewing “Python Code Snippets” Again
Contextualizing the Problem
Key Takeaways

Following Up On Old Gripes With Google Search

As you can imagine, I reviewed the article from 2020, and I’ve realized that there are problems on Google that persist to this day. The bulk of the rest of this article will show you what I mean, but I want to share just a couple up front.

Google Ranks Duplicate Content Separately

One of the issues I’d like to bring up in particular is something I noted in the syndication section. Specifically, I talked about how if you post an article in more than one place, you should add a canonical tag back to the original, so Google won’t rank them both.

Well, you may be surprised to know that both articles for the query “how to invert a dictionary in python” are still ranked four years later. Of course, instead of them being ranked 1st and 3rd, they’re ranked 4th and 11th, respectively. Just put the main one at 4th! What the hell is Google doing?

To give Google credit, the two articles are different in the sense that one platform allows for comments and the other doesn’t. So, my guess is that the comments coupled with the domain authority of the other site has caused it to continue to outrank the original. I do think it’s stupid regardless, and I wish I could add a no-index tag to the syndicated version.

Anti-intellectual Content Is Multiplying

Also, as you’ll see shortly, I ran a search again for the “how to check if a list is empty in python” keyword, and I found that many of the same complaints I made four years ago are valid today. For instance, I said this:

In general, I wouldn’t say any of this content is bad, but it’s definitely not great. For example, one of the articles claims that you can check if a list is empty by the “not operator.” This isn’t wrong, necessarily, but it doesn’t really explain why it works.

Looking at the articles now—and there are literally dozens—the use of the phrase “not operator” is even more common. To me, that’s a bit of a problem because it doesn’t explain why the solution works. I suppose if you’re just looking to copy code you might not care, but the “not operator” here isn’t actually doing the work. I know it’s pedantic, but as someone who takes care to write educational content, it’s a bit troubling to see these types of explanations (or lack thereof) littering the search index.

Google Does Not Rank the Canonical Article

And since we’re here, I just need to rant about Tutorials Point again. I trash talked them pretty hard in the last article for writing thin content and stealing the top rankings on what I can only assume at this point is domain authority. After revisiting some of their old work, I was almost tempted into walking back my complaints. That was until I searched up the “python code snippets” keyword for my article titled 100 Python Code Snippets for Everyday Problems, which I published on December 27th 2019. Unsurprisingly, I found our old friends, Tutorials Point, stealing my article title practically verbatim.

Part of me thought maybe it was just a common title, and I was getting a bit upset over nothing. But then I remembered, I have receipts. Not a single article in the top 6 had a title like mine back in 2020. Yet somehow, Tutorials Point has an article spawn in 2023 with almost the exact title of mine with literally no history of it in the web archive.

Imitation Is the Sincerest Form of Flattery

But wait, it gets even more ridiculous because Tutorials Point isn’t even the only offender. GeeksForGeeks also has an article with the exact same title as Tutorials Point but published almost two years earlier in 2021. And before long, I started to notice a variety of articles with variations of my title:

19 Python Code Snippets for Everyday Issues (2023-09-26)
10 Python Code Snippets for Everyday Programming Problems (2021-12-01)
11 Python Boilerplate Code Snippets Every Developer Needs (2023-01-12)
10 Most Popular Python Code Snippets Every Developer Should Know (2023-09-03)
14 Code Snippets That Every Python Programmer Must Learn (2023-01-05)

Meanwhile, my article isn’t even listed until rank 27. Even worse, we run into the syndication problem again where the old version of the article is the one Google has chosen to index. In other words, what you’ll see in search is “71 Python Code Snippets for Everyday Problems” on dev.to. I am amazed Google still manages to mess this up.

With that rant out of the way, let’s take a slightly deeper dive into the pair of queries we explored last time.

Reviewing “How to Check If a List is Empty in Python” Again

One of the articles I wrote back in 2018 was called How to Check If a List is Empty in Python. At the time in 2020, it was ranked second for the search phrase, “how to check if a list is empty in python.” Today, it’s ranked 72nd. Let’s take a look at the competition:

Unlike in that past, you can see there is no snippet for this search term anymore. Instead, Google actually looks somewhat like what you’d expect: a list of search results. Unfortunately, the results aren’t great.

Rank 1: Stack Overflow

At the top is the same site I attempted to dethrone many years ago, StackOverflow. For the record, the content is largely the same:

I can’t really be too upset about this because Stack Overflow does tend to have good information on it. However, I still wish it wasn’t so popular, as the culture of the platform is very toxic.

Rank 2: freeCodeCamp

At rank 2 is a relatively new article from an independent author on freeCodeCamp. The content itself is nearly identical to my article in terms of solutions provided, albeit in a different order, with different wording, and with different examples. It’s also significantly more streamlined, which I can understand if all you want is a solution and no explanation.

The only thing that is kind of weird is the title: “Python isEmpty() equivalent – How to Check if a List is Empty in Python.” Nowhere in the article is “isEmpty()” ever mentioned. I would expect with a title like this that the content would be trying to get you to connect your current knowledge to some new knowledge, but the article never makes that connection explicit.

Overall, though, I can’t really complain. The article seems fine.

Rank 3: GeeksForGeeks

At rank 3 is our old friend GeeksForGeeks, which I can’t stand. Last time, I think they were at rank 4, so they somehow gained a position. I don’t recall what the page used to look like, though I suppose I could check the web archive. That said, I was somewhat pleased with the content this time around. There’s at least an example of the problem and a host of solutions that go far beyond the three I listed (though, some of them are truly absurd). The code samples are also pretty strange, with no regard for naming conventions or software design principles. Just take a look at this:

def Enquiry(lis1): 
	if not lis1: 
		return 1
	else: 
		return 0

# Driver Code 
lis1 = [] 
if Enquiry(lis1): 
	print("The list is Empty") 
else: 
	print("The list is not empty")

What is the point of the Enquiry function? Why make use of type flexibility with integers to show type flexibility of lists? I’m so confused. With the way content always seems to look on GeeksForGeeks, I would not be surprised if this was generated by ChatGPT.

Rank 4+: Everyone Else

Anyway, following rank 3, there’s actually a huge gap in the search results. At that point, Google provides a feature titled “people also ask,” which is quite fun to explore if you’re bored. Then, under that is a small section titled “discussions and forums” listing off Stack Overflow again, alongside Reddit and Quora.

From there, there are a series of new sites plus the other previous suspects:

Programiz (173 words): shows the same three solutions
Flexiple (454 words): shows five solutions but three of them are duplicates
Tutorials Point (974 words): I previously complained about this one, but it’s much more thorough now
IOFLOOD (1994 words): a bit more interesting and covers more cases

As I complained previously, most of these articles are quite short and are more or less copies of each other. Worse still, a lot of them look like the they straight up ripped my code. Of course, that’s hard to prove, but many of them use the same variable names and even the same comments, though sometimes as print strings.

With that said, I am slightly more happy with the pages I trashed in the past. It seems that there’s just a lot more garbage now.

Also, I’ll say that I went back and reviewed my article and noticed some artifacts in the text from when I used to use a different inline highlighting plugin (e.g., text that looked like this: `run()`python). But if that’s the only reason I’m sitting at rank 72, that’s pretty absurd. I suppose I’ll check back in a few weeks and see if anything has changed.

Reviewing “Python Code Snippets” Again

When I think about my 100 Python Code Snippets for Everyday Problems article, I think of it as one of the few examples of what I’d call an “ethical listicle.” Usually, at least in my experience, listicles are low quality bait posts that were popularized by sites like BuzzFeed. The whole premise is that they’re meant to be relatable but ultimately clickbait, so you can get that sweet, sweet ad revenue.

In contrast, my article doesn’t just list a bunch of code snippets but actually backs each of them with another article which describes them in more detail. The content itself is organized by data structure, so the snippets themselves are organized in groups. I don’t think you can really make a list post more informative.

To me, that should put my article near the top in terms of rank, but that’s unfortunately not what I’ve found. Instead, it’s sitting at rank 27. Here’s the competition:

At a glance, I don’t really see any of the old articles ranking in the top anymore—not just mine. When comparing the new titles with the old title, I would argue that the results are roughly the same (i.e., a mix of list posts). So, let’s see how the content differs!

Rank 1: BuiltIn

Up top, we have an article titled “13 Python Code Snippets You Need to Know,” which seems to have come on the seen on March 8th 2022. With a quick peek, I notice that the structure of the article is pretty similar to mine in that it organizes the solutions by data structure (e.g., lists, dicts, strings, etc.).

Out of curiosity, I tried comparing my article to theirs with a plagiarism detection tool and found that there was really only 3.6% plagiarism. However, the sections that were plagiarized were a bit egregious. For example, the code snippet I use to show how to combine two lists into a dictionary is almost copied verbatim:

And, those are the paraphrased solutions. There are several that are ripped directly. For example, two of my dictionary inversion solutions are ripped verbatim:

Out of curiosity, I thought to even search for the variable name “my_inverted_dict,” since I figured it was unique enough. And sure enough, three of my articles show up from as early as December 4th, 2017. In hilarious fashion, my solution is used in a Stack Overflow answer almost two years later. It’s also copied in a bunch of other places, which caused me to stop looking lest I get very sad.

But wait! There’s more. My personal favorite stolen set of code snippets comes from my goofy search engine implementation (meta, I know) where the author just straight up copied the example list:

I’m not even certain they understood what they copied because it’s very easy to check if a string is in a list, like they claim they’re doing. What these solutions do is check if a substring exists within a string in a list. I also find it funny that the plagiarism tool I used didn’t quite catch how much of the two code sets were actually identical, so I assume the 3.6% is a conservative estimate.

If you’re bored, I’d recommend comparing the two articles. There are some other funny copies. For instance, my entire parsing spreadsheet solution is copied.

This did, of course, lead me down a rabbit hole of trying to detect plagiarism on articles on my site. According to Neil’s post, you can do this easily by dropping your article URL into copyscape. If you do that with my article, look who shows up:

So, the top ranking result on Google for the keyword “python code snippets” is an article that directly plagiarizes from my article, which ranks at an abysmal 27th. I have to wonder how much more of their article is plagiarized.

Rank 2: Squash Labs

At a glance, the squash labs article looks fine. It’s definitely one of the longer articles, and it has a nice navigational sidebar. However, it did not take me long to discover that the code snippets and explanations had bugs.

For example, the very first solution claims that the max() function can be used to find the max item in a list. This is true, though “max” probably needs to be defined with respect to whatever data type you’re using—which they cover, kinda. Specifically, they show two code snippets: one for integers and one for strings. However, the string solution is just wrong. Take a look:

fruits = ['apple', 'banana', 'cherry', 'date']
max_fruit = max(fruits)
print(max_fruit)  # Output: cherry

In the solution above (if you haven’t caught it yet), they claim that “cherry” is the last word alphabetically in the list, and it just isn’t. max_fruit stores “date” because “d” comes after “c” alphabetically (which is actually somewhat irrelevant anyway but not the point).

Now, in general, I’ve received complaints for having bad code in my solutions, and it’s mostly due to laziness. I get a solution working, make some minor changes, and copy over the lines without testing everything again. It happens. So, I will cut them a break, especially since I didn’t notice any other bugs in their solutions.

I will say that they make a lot of claims in their writing without any sources or justifications. Again, I don’t think that matters, but saying something like “membership tests for sets are faster than for lists” should probably be substantiated or demonstrated. It’s part of the reason why my articles include performance tests. Personally, I don’t believe their claim if the first step involves converting the list to a set just to do the membership test.

At any rate, before moving on, I did try to do some plagiarism detection. I kind of figured there wouldn’t be any, and there wasn’t—at least when compared to my site directly. However, copyscape connected their article to a variety of sources, suggesting to me that maybe the article was written with the help of generative AI. I have a hard time believing somebody would go through this much effort to integrate so many sources, but I suppose if James Somerton can do it…

Because my brain is rotted, I did try passing some of the article through an AI detector. I don’t totally trust AI tools for detecting generative AI, but every single model came back as likely AI:

Whereas, my article, which was written before these tools became mainstream, comes back as “appearing human.”

Last thing I’ll comment on is the weird URL. While I was going down the rabbit hole, I noticed the URL reads “25 handy python code snippets for everyday issues” while the article is titled “19 python code snippets for everyday issues.” That’s weird, isn’t it?

Rank 3: GeeksForGeeks

In third place for this keyword as well is our old friend, GeeksForGeeks. Again, there’s not much to say. I skimmed their article, and it looked pretty decent. None of the solutions seemed wrong, nor were they weird like the other article. I even did the same plagiarism checking, and I didn’t really find anything. So overall, I was fairly pleased with it.

In general, I don’t really like GeeksForGeeks. They have a content management problem, where not everything is the same quality. That said, I don’t have much to complain about here. I would be happy with it ranking above the other two articles so far.

Rank 4+: Everyone Else

If you keep going down the list, you’ll see a lot of the same kinds of content plus some weird stuff, like the topic page for Python Snippets on GitHub. It seems weird to index that given it’s basically an index page, but Google doesn’t seem to mind. There’s also some advertising-style content in the “Pieces for Developers” link.

Later down the list, there are your usual Q&A style links, like a Reddit thread on how to save code snippets, and a User Support Hub thread on how to use code snippets in Jupyter Notebooks. Again, I’m not sure how those are more relevant than my article, but we let Google do its thing.

If you keep scrolling, you’ll start to see a few more list posts and then you get into some documentation. Then, Google sneaks in a few more ads, and suddenly the results are completely unrelated. That’s not a great user experience.

Contextualizing the Problem

In order to really claim that Google sucks at ranking good content, it’s important not just to look at how it’s changed over time. It’s also important to look at how the results compare to other search engines.

There are many search engines to choose from, but by far the one that drives the most traffic for me beyond Google is DuckDuckGo. As a result, I decided to take a peek at the search results for the same keywords. What I found is almost comical.

To start, here are the results for the keyword “python code snippets” on DuckDuckGo:

What Google thinks is 27th place, DuckDuckGo places at the top of search. Sure, the plagiarized article is 3rd, but I’ll take that over outperforming the original.

As for the “How to Check If a List is Empty in Python” query, here’s what we get:

My article doesn’t show up until rank 15, but that’s miles better than 72. Otherwise, the results are pretty similar. Tutorials Point luckily doesn’t show up until after my article, so I’m happy about that.

I will also say that basically every result seems to be relevant, even if I don’t agree that the quality of the content matches the ranking. That alone makes me feel a lot better about it as a search engine.

Just for fun, I also looked at the same queries on other search engines, and here were my respective rankings:

Keyword	Google Rank	DuckDuckGo Rank	Bing Rank	Yahoo
Python Code Snippets	27	1	1	1
How to Check If a List is Empty in Python	72	15	15	13

In other words, on average, my articles just perform better on basically every other search engine. I’m not sure what that says about those other search engines, but it tells me that Google isn’t working for me. Maybe some day I’ll figure out why.

Key Takeaways

Having reviewed Google’s search results for a couple of my old articles, I got to thinking a bit more deeply about what exactly is wrong with Google. Here are a few of my key takeaways:

Google does not care about plagiarism: this one should be obvious. After all, I wrote a whole article about their new generative AI feature, which is essentially stealing with extra steps. Between that and plagiarized material outranking the original, it’s hard to know what to trust in the search results.
Google doesn’t handle content syndication well: I have reposted my articles in a few different places on the internet to try to expand my reach. Unfortunately, that has resulted in the copies outranking the originals in search.
Google doesn’t handle mimicry well: it seems common for content farm operations to see a popular article in search results and copy its title basically verbatim. I assume this is because the title conveys a lot of information related to the search phrase, but that just results in what looks like (and mostly is) a bunch of slop in the search results. Ultimately, it makes genuine content look bad in comparison, like we’re hopping on a trend or something.
Google does not care about accurate information: this is a criticism Google has gotten for a long time, especially with regards to medical advice. However, I think it’s probably an issue across the board. It did not take me long to find information that was just wrong.
Google does not care about relevance: this is a common complaint you’ll see people make now about how Google works. You’ll search for something, and you’ll get links to things that are tangentially related. So you end up iteratively refining your search, hoping to find anything of value. That’s definitely not ideal for a search engine whose primary purpose is to provide relevant results.

The last thing I’ll say is that I never want to turn this into some weird witch hunt where people go out and shame other writers, even if they plagiarize or use tools like ChatGPT. Instead, I want to be critical of the systems that incentivize this type of behavior. If Google was doing its due diligence as a search engine, people would not see the benefit of content farming, plagiarism, or generative AI as they would not produce desirable results on the search engine.

In general, I think we give these companies too much of a break when we run defense for them by saying things like: “it’s really hard to put together a good search algorithm to combat these issues.” Even sites that report on the horrible state of Google search seemingly place the blame on spam sites and not the search engine itself.

All of that is to say that I’ve started using DuckDuckGo. While I haven’t used it long enough to really have a preference, I can’t in good conscience continue using Google.

With that said, let’s call it a day here! I’ve probably spent too much time on this topic anyhow. As always, if you liked this article, there are more like it:

Likewise, you can take the extra step to support the site by checking out this list. Otherwise, take care!

Google Threatens to Ruin Search as We Know It

Jeremy Grifski — Fri, 24 May 2024 14:00:00 +0000

In my latest rant, I want to talk about Google’s absurd decision to add generative AI to its search engine.

Google’s Latest Search Feature
A Violation of the Social Contract
What Is the Long Term Plan?

Google’s Latest Search Feature

By now, you have most likely noticed the “experimental” AI Overview feature on Google. You type in your query, and you’re briefly greeted with a strange gradient and loading screen. Before long, a short answer to your question is generated by AI.

The new feature has lead to a surge in hilarious posts on Twitter showcasing just how bad the new feature is at giving good answers. As usual, I’ve curated some of my favorites, though I didn’t bother confirming if they’re real queries since Google doesn’t bother verifying their AI’s responses:

This is why they’re banning tiktok

@rare_cryptid on May 19th, 2024

perfect. ready to go. ship it out

@dril on May 5th, 2024

typing “1000km to [word]” is a surefire way to get the awful google AI to say something incredibly funny

@zachsilberberg on May 21st, 2024

Good ol’ Google AI: telling you to do the exact things you *are not supposed to do* when bitten by a rattlesnake. From mushrooms to snakebites, AI content is genuinely dangerous.

@ErinEARoss on May 19th, 2024

Yeah, you know, just whatever

@onionweigher on May 18th 2024

For as long as I’ve been alive, search engines have been a fundamental part of my experience with the internet. Without them, there is no way to access information without knowing the exact web address of the information you hope to find. It would be like going to a library where the books are stored in no particular order, each inside their own safe.

Thankfully, search engines were created to help us make sense of the information on the internet. They scour websites and put together massive indices which we can explore with our own keywords. Want to find out what other movies an actor is in? Just throw their name into a search engine alongside the word “movies.”

However, in order for search engines to exist, there needs to be information on the web to crawl. That information—I suppose until recently—doesn’t spawn out of thin air. It’s written and curated by real people with real experiences and expertise.

Most of that information is written entirely for free and usually costs the author money in terms domain and hosting services. In exchange for our labor, search engines promise that our work will be indexed for others to find—at least that’s the idea.

At this point, you’re probably wondering why search engines would be incentivized to exist. Sure, they improve the experience of the web for everyone, but what’s in it for them? Well, I’ll tell you: it’s data!

Since search engines aren’t subsidized by the state like libraries, they make their money by farming user data. Until fairly recently, that data was from a third party (i.e., users of the search engine). Every time you make a search or visit a site, the search engine builds up your profile, so they can target you with ads or sell your data.

For a while, this seemed like a fair arrangement: we get our work in front of people and search engines get their data. Which I suppose makes sense: we’re mostly okay with today’s lack of privacy given the existence of smart phones and social media.

But of course, the greed of the tech industry created new forms of exploitation. For instance, search became something to game and companies started creating content farms to target “the algorithm” through SEO. Since the top search result became so coveted, I would not be surprised if search engines themselves had content farms that they could prioritize in their algorithms. Why wouldn’t they? Hell, I’ve even wondered if Google is attempting to rank good content at all.

All of that is to say that the latest addition of the “AI Overview” is perhaps an unsurprising trend for the search engine. Except now, Google is threatening to exploit the only group who allowed them to exist in the first place: creators. I can only imagine what this means long term for search. As Baldur Bjarnason puts it, tech has broken the social contract and is offering us a much worse deal:

We and media companies put stuff up on the web for free. Some of us do it for business reasons. Some of it is personal.

Tech companies use this stuff to create systems that can make shoddy, degraded versions of our work, deepfake us, and make convincing fake only personas for astroturfing, destroying our work, businesses, and social interactions.

Writing when tech has broken the web’s social contract

What Is the Long Term Plan?

As someone who runs a tech and teaching blog, I’ve watched my pageviews drop pretty steadily over time—while I like to think my content has only gotten better.

As you can see, since 2017, I’ve had a million visits to my site. Looking back at my revenue, I’ve made around $2,000 in that time span. I’ve also written over 600 articles. That’s not exactly a great ratio of income to article, not to mention that it costs money to run the site.

Of course, as Baldur Bjarnson says, we don’t do this for the money. We do it for a lot of reasons. For instance, I write because I want more folks to have access to approachable educational content in computer science—as opposed to more toxic sources like StackOverflow or even Reddit. I cannot fulfil that mission if Google is going to start aggregating my work and presenting it as its own through generative AI. And no, providing links to the stolen material as “citations” is not going to solve this problem.

The part that kills me about all this is that it’s so short-sighted. What does Google do once people stop writing? What does Google do when the majority of the content online is generated by AI? All I see moving forward is a search engine that’s going to cannibalize itself. But, maybe they’ll change course under enough pressure.

With all that in mind, there’s never been a better time to help out writers like myself! I have a list of ways to do that, but the best way is probably the Patreon. If I could grow that, I’d be able to write a bit more than I do now. Otherwise, thanks again for reading!

The Renegade Coder

What Is Snake Case in Python?

Table of Contents

Concept Overview

When to Use Snake Case in Python?

Why Follow Snake Case in Python?

What Naming Conventions Do Other Languages Use?

Learning About Conventions

The Name 100 Women Challenge

All But Dissertation: The Light at the End of the Tunnel

Table of Contents

What Does It Mean to be All But Dissertation?

What’s Left to Do?

Did You Find Anything Interesting?

What Can We Do With This Knowledge?

What Are Special Methods in Python?

Table of Contents

Concept Overview

Why Do Special Methods Exist?

What Special Methods Should I Care About?

Constructing Objects With __init__()

Converting Objects to Strings With __str__() and __repr__()

Comparing Two Objects With __eq__()

Learning About Special Syntax

What Is Iterable Unpacking in Python?

Table of Contents

Concept Overview

Common Use Cases

How Does Iterable Unpacking Work With Custom Iterables?

Python Version History

Unpacking More Concepts

What Is an Iterable in Python?

Table of Contents

Concept Overview

Examples of Common Iterable Data Types

Built-in Functions That Accept Iterables

Making a Custom Iterable

Creating an Iterable Using __iter__()

Creating an Iterable Using __getitem__()

Why Do Both Strategies Work?

Iterables Are Only the Beginning

Meritocracy: The Facade That Determines Who Deserves Success

Table of Contents

Who Deserves Success?

Who Doesn’t Deserve Success?

Making Sense of the Facade

6 Tips for New College and University Educators

Table of Contents

A List of Tips for New Educators

You Should Always Get Details Down in Writing

You Can Prevent Most—But Not All—Cheating

You Should Take Your Teaching Evaluations With a Grain of Salt

You Should Find Ways to Align Expectations

You Do Not Have to Stress About Your Choices

You Can Chalk a Course or Semester Up as a Loss

Bonus Tip: You Can Grow As an Educator

Checking Up on Google Search in 2024: It’s Worse Somehow

Table of Contents

Following Up On Old Gripes With Google Search

Google Ranks Duplicate Content Separately

Anti-intellectual Content Is Multiplying

Google Does Not Rank the Canonical Article

Imitation Is the Sincerest Form of Flattery

Reviewing “How to Check If a List is Empty in Python” Again

Rank 1: Stack Overflow

Rank 2: freeCodeCamp

Rank 3: GeeksForGeeks

Rank 4+: Everyone Else

Reviewing “Python Code Snippets” Again

Rank 1: BuiltIn

Rank 2: Squash Labs

Rank 3: GeeksForGeeks

Rank 4+: Everyone Else

Contextualizing the Problem

Key Takeaways

Google Threatens to Ruin Search as We Know It

Table of Contents

Google’s Latest Search Feature

A Violation of the Social Contract

What Is the Long Term Plan?

Constructing Objects With init()

Converting Objects to Strings With str() and repr()

Comparing Two Objects With eq()

Creating an Iterable Using `iter()`

Creating an Iterable Using `getitem()`