define generic `in` in terms of `any` #55669

matthias314 · 2024-09-02T13:23:33Z

I think that the current implementation of x in itr is identical to any(==(x), itr), see below. An added bonus of calling any is that a more efficient implementation of any for a custom type will most likely give an efficient implementation of in for that type at the same time.

julia/base/reduce.jl

Lines 1220 to 1238 in 3a2a4d8

    
           any(f, itr) = _any(f, itr, :) 
        
           for ItrT = (Tuple,Any) 
        
               # define a generic method and a specialized version for `Tuple`, 
        
               # whose method bodies are identical, while giving better effects to the later 
        
               @eval function _any(f, itr::$ItrT, ::Colon) 
        
                   $(ItrT === Tuple ? :(@_terminates_locally_meta) : :nothing) 
        
                   anymissing = false 
        
                   for x in itr 
        
                       v = f(x) 
        
                       if ismissing(v) 
        
                           anymissing = true 
        
                       else 
        
                           v && return true 
        
                       end 
        
                   end 
        
                   return anymissing ? missing : false 
        
               end 
        
           end

adienes · 2024-09-02T15:15:42Z

big thumbs up to remove special-cased missing logic !

aviatesk

LGTM.

mikmoore · 2024-09-04T16:32:33Z

This doesn't pertain to the current PR, which merely matches the existing behavior of using ==.

But I will gripe that in's ambiguity on the subject of == vs isequal is awful:

  in(item, collection) -> Bool
  ∈(item, collection) -> Bool

  Determine whether an item is in the given collection, in the sense that it is == to one of the values generated by
  iterating over the collection. Return a Bool value, except if item is missing or collection contains missing but not
  item, in which case missing is returned (three-valued logic (https://en.wikipedia.org/wiki/Three-valued_logic),
  matching the behavior of any and ==).

  Some collections follow a slightly different definition. For example, Sets check whether the item isequal to one of
  the elements; Dicts look for key=>value pairs, and the key is compared using isequal.

I find the choice of == to be questionable with regard to AbstractFloat behavior

julia> -0.0 in [+0.0] # debatable, but if I wanted `==` or `a <= b <= c` I probably should have written that
true

julia> NaN in [NaN] # these are `===`
false

and think the fact that Set (and others?) has different behavior is abhorrent

julia> -0.0 in Set([+0.0])
false

julia> NaN in Set([NaN])
true

aviatesk · 2024-09-06T08:08:43Z

Since that behavior is clearly stated in the docstring, I assume it's the result of a lengthy discussion. Rather than debating it in this PR, it seems more appropriate to discuss it in a separate issue.

aviatesk · 2024-09-06T08:09:12Z

@nanosoldier runbenchmarks(!"scalar", vs=":master")

nanosoldier · 2024-09-06T14:40:07Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here.

matthias314 · 2024-09-13T12:13:13Z

It's hard for me to interpret the benchmark results. Do they show any problem?

defined generic in in terms of any

ea6f2e3

aviatesk approved these changes Sep 3, 2024

View reviewed changes

Merge branch 'master' into m3/in-any

53cea02

nsajko added the collections Data structures holding multiple items, e.g. sets label Sep 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

define generic `in` in terms of `any` #55669

define generic `in` in terms of `any` #55669

matthias314 commented Sep 2, 2024

adienes commented Sep 2, 2024

aviatesk left a comment

mikmoore commented Sep 4, 2024

aviatesk commented Sep 6, 2024

aviatesk commented Sep 6, 2024

nanosoldier commented Sep 6, 2024

matthias314 commented Sep 13, 2024

	any(f, itr) = _any(f, itr, :)

	for ItrT = (Tuple,Any)
	# define a generic method and a specialized version for `Tuple`,
	# whose method bodies are identical, while giving better effects to the later
	@eval function _any(f, itr::$ItrT, ::Colon)
	$(ItrT === Tuple ? :(@_terminates_locally_meta) : :nothing)
	anymissing = false
	for x in itr
	v = f(x)
	if ismissing(v)
	anymissing = true
	else
	v && return true
	end
	end
	return anymissing ? missing : false
	end
	end

define generic in in terms of any #55669

Are you sure you want to change the base?

define generic in in terms of any #55669

Conversation

matthias314 commented Sep 2, 2024

adienes commented Sep 2, 2024

aviatesk left a comment

Choose a reason for hiding this comment

mikmoore commented Sep 4, 2024

aviatesk commented Sep 6, 2024

aviatesk commented Sep 6, 2024

nanosoldier commented Sep 6, 2024

matthias314 commented Sep 13, 2024

define generic `in` in terms of `any` #55669

define generic `in` in terms of `any` #55669