Projects user NOT assigned to without getting duplicate records

ZosoKat · April 9, 2014, 7:43pm

I have a table containing project assignments represented by a projectid column and a user_assignment column which contains the user’s unique userid. A user is assigned to a project if they have a record in the user assignment table associated with the project id. I want retrieve all the projects that a specific user is NOT assigned to without getting duplicate records since there are many users assigned to projects.

While these examples will retrieve only one record per project (i.e., no duplicates), it returns projects that user ‘abc123’ is and is NOT assigned to. I need to retrieve the projects that they are NOT assigned to.

SELECT DISTINCT `propid` FROM `user_assignments` WHERE `userid` &lt;&gt; 'abc123' ORDER BY `propid` ASC

SELECT DISTINCT `propid` FROM `user_assignments` WHERE (`userid` &lt;&gt; 'abc123') ORDER BY `propid` ASC

I am sure there is a very simple solution but I am not seeing it.

The user assignment table:

r937 · April 10, 2014, 2:42am

this is not going to be efficient because it has to examine all rows of the table

SELECT propid
  FROM user_assignments
GROUP
    BY propid
HAVING COUNT(CASE WHEN userid = 'abc123'
                  THEN 'nope'
                  ELSE NULL END) = 0

ZosoKat · April 11, 2014, 7:06pm

r937:

this is not going to be efficient because it has to examine all rows of the table
SELECT propid 
  FROM user_assignments 
GROUP
    BY propid
HAVING COUNT(CASE WHEN userid = 'abc123' 
                  THEN 'nope'
                  ELSE NULL END) = 0

Yes I found that out. It takes a long time to execute that query.

r937 · April 11, 2014, 7:25pm

it’ll go a lot faster if there’s a composite index on propid,userid

those two should actually be the primary key, instead of an auto_increment, but since you’re using an auto_increment, then declare them as a UNIQUE index

KyleWolfe · April 11, 2014, 7:25pm

I haven’t played in the world of SQL for awhile, but might a ‘not in’ work better here assuming working on indexed fields and a relatively reasonable workload for a single user.


select 
     distinct propid 
from 
     user_assignments 
where 
     propid not in (
           select distinct propid from user_assignments where userid = 'foo'
      )

r937 · April 11, 2014, 7:26pm

sadly, it might not

r937 · April 11, 2014, 8:03pm

on the other hand, it might work as good as

unlikely it would be gooder