this post was submitted on 12 Apr 2025
196 points (90.2% liked)

Technology

76361 readers
1257 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
top 50 comments
sorted by: hot top controversial new old
[–] thebestaquaman@lemmy.world 111 points 6 months ago (2 children)

I write a lot of Python. I hate it when people use "X is more pythonic" as some kind of argument for what is a better solution to a problem. I also have a hang up with people acting like python has any form of type safety, instead of just embracing duck typing.This lands us at the following:

The article states that "you can check a list for emptiness in two ways: if not mylist or if len(mylist) == 0". Already here, a fundamental mistake has been made: You don't know (and shouldn't care) whether mylist is a list. These two checks are not different ways of doing the same thing, but two different checks altogether. The first checks whether the object is "falsey" and the second checks whether the object has a well defined length that is zero. These are two completely different checks, which often (but far from always) overlap. Embrace the duck type- type safe python is a myth.

[–] iAvicenna@lemmy.world 11 points 6 months ago* (last edited 6 months ago) (2 children)

isn't the expected behaviour exactly identical on any object that has len defined:

"By default, an object is considered true unless its class defines either a bool() method that returns False or a len() method that returns zero, when called with the object."

ps: well your objection is I guess that we cant know in advance if that said object has len defined such as being a collection so this question does not really apply to your post I guess.

[–] CompassRed@discuss.tchncs.de 17 points 6 months ago

It's not the same, and you kinda answered your own question with that quote. Consider what happens when an object defines both dunder bool and dunder len. It's possible for dunder len to return 0 while dunder bool returns True, in which case the falsy-ness of the instance would not depend at all on the value of len

[–] thebestaquaman@lemmy.world 5 points 6 months ago

Exactly as you said yourself: Checking falsieness does not guarantee that the object has a length. There is considerable overlap between the two, and if it turns out that this check is a performance bottleneck (which I have a hard time imagining) it can be appropriate to check for falsieness instead of zero length. But in that case, don't be surprised if you suddenly get an obscure bug because of some custom object not behaving the way you assumed it would.

I guess my primary point is that we should be checking for what we actually care about, because that makes intent clear and reduces the chance for obscure bugs.

[–] sugar_in_your_tea@sh.itjust.works 6 points 6 months ago (6 children)

type safe python is a myth

Sure, but type hints provide a ton of value in documenting for your users what the code expects. I use type hints everywhere, and it's fantastic! Yes, there's no guarantee that the types are correct, but with static analysis and the assumption that your users want their code to work correctly, there's a very high chance that the types are correct.

That said, I lie about types all the time. For example, if my function accepts a class instance as an argument, the intention is that the code accept any class that implements the same methods as the one I've defined in the parameter list, and you don't necessarily have to pass an instance of that class in (or one of its sub-classes). But I feel like putting something reasonable in there makes a lot more sense than nothing, and I can clarify in the docstring that I really just need something that looks like that object. One of these days I'll get around to switching that to Protocol classes to reduce type errors.

That said, I don't type hint everything. A lot of private methods and private functions don't have types, because they're usually short and aren't used outside the class/file anyway, so what's the point?

load more comments (6 replies)
[–] PattyMcB@lemmy.world 53 points 6 months ago (9 children)

I know I'm gonna get downvoted to oblivion for this, but... Serious question: why use Python if you're concerned about performance?

[–] lengau@midwest.social 54 points 6 months ago (2 children)

It's all about trade-offs. Here are a few reasons why one might care about performance in their Python code:

  1. Performance is often more tied to the code than to the interpreter - an O(n³) algorithm in blazing fast C won't necessarily perform any better than an O(nlogn) algorithm in Python.
  2. Just because this particular Python code isn't particularly performance constrained doesn't mean you're okay with it taking twice as long.
  3. Rewriting a large code base can be very expensive and error-prone. Converting small, very performance-sensitive parts of the code to a compiled language while keeping the bulk of the business logic in Python is often a much better value proposition.

These are also performance benefits one can get essentially for free with linter rules.

Anecdotally: in my final year of university I took a computational physics class. Many of my classmates wrote their simulations in C or C++. I would rotate between Matlab, Octave and Python. During one of our labs where we wrote particle simulations, I wrote and ran Octave and Python simulations in the time it took my classmates to write their C/C++ versions, and the two fastest simulations in the class were my Octave and Python ones, respectively. (The professor's own sim came in third place). The overhead my classmates had dealing with poorly optimised code that caused constant cache misses was far greater than the interpreter overhead in my code (though at the time I don't think I could have explained why their code was so slow compared to mine).

[–] PattyMcB@lemmy.world 5 points 6 months ago (4 children)

I appreciate the large amount of info. Great answer. It just doesn't make sense to me, all things being equal (including performant algorithms), why choose Python and then make a small performance tweak like in the article? I understand preferring the faster implementation, but it seems to me like waxing your car to reduce wind resistance to make it go faster, when installing a turbo-charger would be much more effective.

[–] Teanut@lemmy.world 16 points 6 months ago

If you use the profiler and see that the slower operation is being used frequently, and is taking up a chunk of time deemed significant, why not swap it to the faster version?

In a simulation I'm working on that goes through 42 million rounds I spent some time profiling and going through the code that was eating up a lot of time (especially things executed all 42 million times) and trying to find some optimizations. Brought the run time down from about 10 minutes to 5 minutes.

I certainly wasn't going to start over in C++ or Rust, and if I'd started with either of those languages I would have missed out on a lot of really strong Python libraries and probably spent more time coding rather than refining the simulation.

[–] lengau@midwest.social 9 points 6 months ago (1 children)

I think a better analogy would be that you're tuning your bike for better performance because the trade-offs of switching to a car are worse than keeping the bike.

[–] PattyMcB@lemmy.world 5 points 6 months ago (1 children)
[–] Azzu@lemm.ee 5 points 6 months ago

Good, more people should buy bicycles

load more comments (2 replies)
load more comments (1 replies)
[–] JustAnotherKay@lemmy.world 14 points 6 months ago* (last edited 6 months ago) (4 children)

Honestly most people use Python because it has fantastic libraries. They optimize it because the language is middling, but the libraries are gorgeous

ETA: This might double post because my Internet sucks right now, will fix when I have a chance

load more comments (4 replies)
[–] pastermil@sh.itjust.works 11 points 6 months ago* (last edited 6 months ago) (2 children)

This is my two cents as someone in the industry.

Because, while you don't want to nitpick on each instruction cycle, sometimes the code runs millions of times and each microsecond adds up.

Keep in mind that people use this kind of things for work, serving real world customers who are doing their work.

Yes, the language itself is not optimal even by design, but its easy to work with, so they are making it worth a while. There's no shortage of people who can work with it. It is easy to develop and maintain stuff with it, cutting development cost. Yes, we're talking real businesses with real resource constraints.

load more comments (2 replies)
[–] Takapapatapaka@lemmy.world 8 points 6 months ago

You may want to beneficiate from little performance boost even though you mostly don't need it and still need python's advantages. Being interested in performance isnt always looking for the very best performance there is out of any language, it can also be using little tips to go a tiny bit faster when you can.

[–] sugar_in_your_tea@sh.itjust.works 6 points 6 months ago* (last edited 6 months ago)

Yes, Python is the wrong choice if performance is your top priority.

But here's another perspective: why leave easy performance wins on the table? Especially if the cost is simpler code that works as you probably wanted anyway with both None and []?

Python is great if you want a really fast development cycle, because the code is generally quite simple and it's "fast enough." Any wins for "fast enough" is appreciated, because it delays me needing to actually look into little performance issues. It's pretty easy for me to write a simple regex to fix this cose (s/if len\((\w+)\) == 0:/if not \1:/), and my codebase will be slightly faster. That's awesome! I could even write up a quick pylint or ruff rule to catch these cases for developers going forward (if there isn't one already).

If I'm actively tweaking things in my Python code to get a little better performance, you're right, I should probably just use something else (writing a native module is probably a better use of time). But the author isn't arguing that you should do that, just that, in this case, if not foo is preferred over if len(foo) == 0 for technical reasons, and I'll add that it makes a ton of sense for readability reasons as well.

Here are some other simple wins:

  • [] and {} instead of list() and dict() - the former copy constants, whereas the latter actually constructs things; oh, and you save a few chars
  • use list comprehensions instead of regular loops - list comprehensions seem to be faster due to not needing to call append (and less code)
  • use built-ins when you can - they're often implemented in native code

I consider each of those cleaner Python code anyway, because they're less code, just as explicit, and use built-in language features instead of reinventing the wheel.

[–] Randelung@lemmy.world 4 points 6 months ago (1 children)

It comes down to the question "Is YOUR C++ code faster than Python?" (and of course the reverse).

I've built a SCADA from scratch and performance requirements are low to begin with, seeing as it's all network bound and real world objects take time to react, but I'm finding everything is very timely.

A colleague used SQLAlchemy for a similar task and got abysmal performance. No wonder, it's constantly querying the DB for single results.

Exactly!

We rewrote some Fortran code (known for fast perf) into Python and the net result was faster. Why? They used bubble sort in a hot loop, whereas we used Python's built-in sort (probably qsort or similar). So despite Python being "slower" on average, good architecture matters a lot more.

And your Python code doesn't have to be 100% Python, you can write performance-critical code in something else, like C++ or Rust. This is very common, and it's why popular Python libraries like numpy and scipy are written in a more performant language with a Python wrapper.

load more comments (3 replies)
[–] sirber@lemmy.ca 42 points 6 months ago* (last edited 6 months ago) (4 children)

How does Python know if it's my list or not?

[–] 2xsaiko@discuss.tchncs.de 27 points 6 months ago
[–] JasonDJ@lemmy.zip 7 points 6 months ago (2 children)

if isinstance(mylist, list) and not mylist

Problem solved.

Or if not mylist # check if list is empty

[–] sirber@lemmy.ca 15 points 6 months ago (1 children)

I think you missed the joke 😅

[–] PattyMcB@lemmy.world 3 points 6 months ago

I thought it was funny!

[–] gravitas_deficiency@sh.itjust.works 4 points 6 months ago (7 children)

You’re checking if mylist is falsey. Sometimes that’s the same as checking if it’s empty, if it’s actually a list, but that’s not guaranteed.

load more comments (7 replies)
load more comments (2 replies)
[–] iAvicenna@lemmy.world 28 points 6 months ago* (last edited 6 months ago) (20 children)

Yea and then you use "not" with a variable name that does not make it obvious that it is a list and another person who reads the code thinks it is a bool. Hell a couple of months later you yourself wont even understand that it is a list. Moreover "not" will not throw an error if you don't use an sequence/collection there as you should but len will.

You should not sacrifice code readability and safety for over optimization, this is phyton after all I don't think list lengths will be your bottle neck.

[–] jerkface@lemmy.ca 15 points 6 months ago (7 children)

Strongly disagree that not x implies to programmers that x is a bool.

[–] taladar@sh.itjust.works 7 points 6 months ago

It does if you are used to sane languages instead of the implicit conversion nonsense C and the "dynamic" languages are doing

[–] iAvicenna@lemmy.world 6 points 6 months ago (5 children)

well it does not imply directly per se since you can "not" many things but I feel like my first assumption would be it is used in a bool context

[–] thebestaquaman@lemmy.world 8 points 6 months ago (2 children)

I would say it depends heavily on the language. In Python, it's very common that different objects have some kind of Boolean interpretation, so assuming that an object is a bool because it is used in a Boolean context is a bit silly.

[–] iAvicenna@lemmy.world 4 points 6 months ago* (last edited 6 months ago) (4 children)

Well fair enough but I still like the fact that len makes the aim and the object more transparent on a quick look through the code which is what I am trying to get at. The supporting argument on bools wasn't't very to the point I agree.

That being said is there an application of "not" on other classes which cannot be replaced by some other more transparent operator (I confess I only know the bool and length context)? I would rather have transparently named operators rather than having to remember what "not" does on ten different types. I like duck typing as much as the next person, but when it is so opaque (name-wise) as in the case of "not", I prefer alternatives.

For instance having open or read on different objects which does really read or open some data vs not some object god knows what it does I should memorise each case.

[–] jerkface@lemmy.ca 5 points 6 months ago* (last edited 6 months ago) (4 children)

Truthiness is so fundamental, in most languages, all values have a truthiness, whether they are bool or not. Even in C, int x = value(); if (!x) x_is_not_zero(); is valid and idiomatic.

I appreciate the point that calling a method gives more context cues and potentially aids readability, but in this case I feel like not is the python idiom people expect and reads just fine.

load more comments (4 replies)
load more comments (3 replies)
load more comments (1 replies)
load more comments (4 replies)
load more comments (5 replies)
load more comments (19 replies)
[–] Opisek@lemmy.world 28 points 6 months ago (5 children)

The graph makes no sense. Did a generative AI make it.

I think there's a good chance of that:

  • -2x instead of ~2x - a human is unlikely to make that mistake
  • no space here: ==0 - there's a space every other time it's done, including the screenshot
  • the numbers are wrong - the screenshot has different data than the image
  • why are there three bars? A naive approach would have two.
[–] gerryflap@feddit.nl 5 points 6 months ago

Looks like it. It's a complete fever dream graph. I really don't get how someone can use an image like that. Personally I don't really like AI art anyways, but I could somewhat understand it as a sort of "filler" image to make your article a bit more interesting. But a graph that is supposed to convey actual information? No idea why anyone would AI gen that without checking

[–] pyre@lemmy.world 4 points 6 months ago

yeah I got angry just looking at it

load more comments (2 replies)
[–] uis@lemm.ee 16 points 6 months ago (1 children)

There are decades of articles on c++ optimizations, that say "use empty() instead of size()", which is same as here.

[–] dreugeworst@lemmy.ml 5 points 6 months ago

except for c++ it was just to avoid a single function call, not extra indirection. also on modern compilers size() will get inlined and ultimate instructions generated by the compiler will likely be the same

[–] ne0n@lemmy.world 7 points 6 months ago (5 children)

Isn’t “-2x faster” 2x slower?

[–] ChaoticNeutralCzech@feddit.org 5 points 6 months ago

That woulb be 0.5x. −2x implies negative duration, which makes no sense. Neither does the layout of anything else in the image.

load more comments (4 replies)
[–] antlion@lemmy.dbzer0.com 7 points 6 months ago (2 children)

Could also compare against:

if not len(mylist)

That way this version isn’t evaluating two functions. The bool evaluation of an integer is false when zero, otherwise true.

[–] FooBarrington@lemmy.world 4 points 6 months ago (2 children)

This is honestly the worst version regarding readability. Don't rely on implicit coercion, people.

load more comments (2 replies)
load more comments (1 replies)
[–] knighthawk0811@lemmy.ml 7 points 6 months ago (1 children)

so these are the only 2 ways then? huge if true

load more comments (1 replies)
[–] gigachad@sh.itjust.works 6 points 6 months ago (1 children)

I don't like it very much, my variable could also be None here

[–] iknowitwheniseeit@lemmynsfw.com 3 points 6 months ago (1 children)

You'd need to explicitly check for None if using the len() construct as well, so this doesn't change the point of the article.

[–] gigachad@sh.itjust.works 6 points 6 months ago* (last edited 6 months ago) (10 children)

But None has no len

if not foo:  

-> foo could be an empty list or None, it is ambiguous.

len(foo) will lead to an exception TypeError if foo is None, I can cleanly catch that.

It suggests I deal with a boolean when that is not the case. Explicit is better than implicit, and if not foo to check for an empty list may be pythonic, but it's still implicit af

load more comments (10 replies)
[–] palordrolap@fedia.io 5 points 6 months ago (6 children)

As a Perl fossil I recognise this syntax as equivalent to if(not @myarray) which does the same thing. And here I was thinking Guido had deliberately aimed to avoid Perlisms in Python.

That said, the Perlism in question is the right* way to do it in Perl. The length operator does not do the expected thing on an array variable. (You get the length of the stringified length of the array. And a warning if those are enabled.)

* You can start a fight with modern Perl hackers with whether unless(@myarray) is better or just plain wrong, even if it works and is equivalent.

load more comments (6 replies)
load more comments
view more: next ›