Is there a data structure that only stores hash codes and not the actual objects?2019 Community Moderator ElectionMemory size of Java 32-bit system BitSetsFastest way to determine if an integer's square root is an integerJava Class that implements Map and keeps insertion order?Why can't I retrieve an item from a HashSet without enumeration?Java tree data-structure?Gson: How to exclude specific fields from Serialization without annotationsHashcode and Equals for HashsetDirectly accessible data structure JavaJava HashSet vs Array PerformanceWhy hash set allows adding duplicate object?Why is (a*b != 0) faster than (a != 0 && b != 0) in Java?

In a future war, an old lady is trying to raise a boy but one of the weapons has made everyone deaf

How to explain that I do not want to visit a country due to personal safety concern?

How can you use ICE tables to solve multiple coupled equilibria?

What is the rarity of this homebrew magic staff?

how to write formula in word in latex

Opacity of an object in 2.8

Hacking a Safe Lock after 3 tries

My Graph Theory Students

Why do Australian milk farmers need to protest supermarkets' milk price?

How to simplify this time periods definition interface?

Is a party consisting of only a bard, a cleric, and a warlock functional long-term?

Brexit - No Deal Rejection

Interplanetary conflict, some disease destroys the ability to understand or appreciate music

What options are left, if Britain cannot decide?

Time travel from stationary position?

Why does Bach not break the rules here?

Gravity magic - How does it work?

Does Mathematica reuse previous computations?

Is it possible to upcast ritual spells?

Employee lack of ownership

(Calculus) Derivative Thinking Question

What approach do we need to follow for projects without a test environment?

It's a yearly task, alright

Sailing the cryptic seas



Is there a data structure that only stores hash codes and not the actual objects?



2019 Community Moderator ElectionMemory size of Java 32-bit system BitSetsFastest way to determine if an integer's square root is an integerJava Class that implements Map and keeps insertion order?Why can't I retrieve an item from a HashSet without enumeration?Java tree data-structure?Gson: How to exclude specific fields from Serialization without annotationsHashcode and Equals for HashsetDirectly accessible data structure JavaJava HashSet vs Array PerformanceWhy hash set allows adding duplicate object?Why is (a*b != 0) faster than (a != 0 && b != 0) in Java?










7















My use-case is that I'm looking for a data structure in Java that will let me see if an object with the same hash code is inside (by calling contains()), but I will never need to iterate through the elements or retrieve the actual objects. A HashSet is close, but from my understanding, it still contains references to the actual objects, and that would be a waste of memory since I won't ever need the contents of the actual objects. The best option I can think of is a HashSet of type Integer storing only the hash codes, but I'm wondering if there is a built-in data structure that would accomplish the same thing (and only accept one type as opposed to HashSet of type Integer which will accept the hash code of any object).










share|improve this question



















  • 4





    Is your hash function perfect? Or can you have multiple objects with the same hash value?

    – arshajii
    2 hours ago






  • 5





    what about hashing collisions?

    – Nathan Hughes台湾不在中国
    2 hours ago






  • 7





    The HashSet will contain a reference to your object, not a copy, so don't worry about space. A HashSet<Integer> would probably use up more space because it has references to integers.

    – Sweeper
    2 hours ago












  • I agree with @Sweeper, unless you have a real need for super-duper optimization. Also, your second idea with storing hashcodes as integer wouln't be more efficient as it would store the hash+the hash of the hash.

    – Joel
    2 hours ago











  • @Sweeper The HashSet uses internally a HashMap. The memory space is the same.

    – Octavian R.
    2 hours ago
















7















My use-case is that I'm looking for a data structure in Java that will let me see if an object with the same hash code is inside (by calling contains()), but I will never need to iterate through the elements or retrieve the actual objects. A HashSet is close, but from my understanding, it still contains references to the actual objects, and that would be a waste of memory since I won't ever need the contents of the actual objects. The best option I can think of is a HashSet of type Integer storing only the hash codes, but I'm wondering if there is a built-in data structure that would accomplish the same thing (and only accept one type as opposed to HashSet of type Integer which will accept the hash code of any object).










share|improve this question



















  • 4





    Is your hash function perfect? Or can you have multiple objects with the same hash value?

    – arshajii
    2 hours ago






  • 5





    what about hashing collisions?

    – Nathan Hughes台湾不在中国
    2 hours ago






  • 7





    The HashSet will contain a reference to your object, not a copy, so don't worry about space. A HashSet<Integer> would probably use up more space because it has references to integers.

    – Sweeper
    2 hours ago












  • I agree with @Sweeper, unless you have a real need for super-duper optimization. Also, your second idea with storing hashcodes as integer wouln't be more efficient as it would store the hash+the hash of the hash.

    – Joel
    2 hours ago











  • @Sweeper The HashSet uses internally a HashMap. The memory space is the same.

    – Octavian R.
    2 hours ago














7












7








7


1






My use-case is that I'm looking for a data structure in Java that will let me see if an object with the same hash code is inside (by calling contains()), but I will never need to iterate through the elements or retrieve the actual objects. A HashSet is close, but from my understanding, it still contains references to the actual objects, and that would be a waste of memory since I won't ever need the contents of the actual objects. The best option I can think of is a HashSet of type Integer storing only the hash codes, but I'm wondering if there is a built-in data structure that would accomplish the same thing (and only accept one type as opposed to HashSet of type Integer which will accept the hash code of any object).










share|improve this question
















My use-case is that I'm looking for a data structure in Java that will let me see if an object with the same hash code is inside (by calling contains()), but I will never need to iterate through the elements or retrieve the actual objects. A HashSet is close, but from my understanding, it still contains references to the actual objects, and that would be a waste of memory since I won't ever need the contents of the actual objects. The best option I can think of is a HashSet of type Integer storing only the hash codes, but I'm wondering if there is a built-in data structure that would accomplish the same thing (and only accept one type as opposed to HashSet of type Integer which will accept the hash code of any object).







java hashset






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited 2 hours ago







B Yellow

















asked 2 hours ago









B YellowB Yellow

413




413







  • 4





    Is your hash function perfect? Or can you have multiple objects with the same hash value?

    – arshajii
    2 hours ago






  • 5





    what about hashing collisions?

    – Nathan Hughes台湾不在中国
    2 hours ago






  • 7





    The HashSet will contain a reference to your object, not a copy, so don't worry about space. A HashSet<Integer> would probably use up more space because it has references to integers.

    – Sweeper
    2 hours ago












  • I agree with @Sweeper, unless you have a real need for super-duper optimization. Also, your second idea with storing hashcodes as integer wouln't be more efficient as it would store the hash+the hash of the hash.

    – Joel
    2 hours ago











  • @Sweeper The HashSet uses internally a HashMap. The memory space is the same.

    – Octavian R.
    2 hours ago













  • 4





    Is your hash function perfect? Or can you have multiple objects with the same hash value?

    – arshajii
    2 hours ago






  • 5





    what about hashing collisions?

    – Nathan Hughes台湾不在中国
    2 hours ago






  • 7





    The HashSet will contain a reference to your object, not a copy, so don't worry about space. A HashSet<Integer> would probably use up more space because it has references to integers.

    – Sweeper
    2 hours ago












  • I agree with @Sweeper, unless you have a real need for super-duper optimization. Also, your second idea with storing hashcodes as integer wouln't be more efficient as it would store the hash+the hash of the hash.

    – Joel
    2 hours ago











  • @Sweeper The HashSet uses internally a HashMap. The memory space is the same.

    – Octavian R.
    2 hours ago








4




4





Is your hash function perfect? Or can you have multiple objects with the same hash value?

– arshajii
2 hours ago





Is your hash function perfect? Or can you have multiple objects with the same hash value?

– arshajii
2 hours ago




5




5





what about hashing collisions?

– Nathan Hughes台湾不在中国
2 hours ago





what about hashing collisions?

– Nathan Hughes台湾不在中国
2 hours ago




7




7





The HashSet will contain a reference to your object, not a copy, so don't worry about space. A HashSet<Integer> would probably use up more space because it has references to integers.

– Sweeper
2 hours ago






The HashSet will contain a reference to your object, not a copy, so don't worry about space. A HashSet<Integer> would probably use up more space because it has references to integers.

– Sweeper
2 hours ago














I agree with @Sweeper, unless you have a real need for super-duper optimization. Also, your second idea with storing hashcodes as integer wouln't be more efficient as it would store the hash+the hash of the hash.

– Joel
2 hours ago





I agree with @Sweeper, unless you have a real need for super-duper optimization. Also, your second idea with storing hashcodes as integer wouln't be more efficient as it would store the hash+the hash of the hash.

– Joel
2 hours ago













@Sweeper The HashSet uses internally a HashMap. The memory space is the same.

– Octavian R.
2 hours ago






@Sweeper The HashSet uses internally a HashMap. The memory space is the same.

– Octavian R.
2 hours ago













5 Answers
5






active

oldest

votes


















6














A Bloom filter can tell whether an object might be a member, or is definitely not a member. You can control the likelihood of false positives. A single bit is stored per hash value.



The Guava library provides an implementation in Java.






share|improve this answer

























  • Nice. This seems like the solution for very low storage overhead. But you have to worry about the false negative case. If you can statistically eliminate that, this is great!

    – Steve
    2 hours ago







  • 1





    False positives, but you can control their probability. Another disadvantage is that you can't remove elements.

    – Andy Thomas
    2 hours ago











  • The question was for a data structure that checks using only the predefined hashCode(), which can potentially have 2^31 values (counting only positives). A Bloom filter that uses one hash function with 2 ^ 31 possible values would be extraordinarily large, seeing as it is basically just a BitSet. I don't see how that counts as "very low storage overhead".

    – Leo Aso
    2 hours ago



















0














There is no data structure like that, but you can create a wrapper around HashSet to suit your needs. Something like this:



public class MyHashSet

private HashSet<Integer> set;

public MyHashSet()
set = new HashSet<>();


public void add(int hash)
set.add(hash);


public boolean contains(int hash)
return set.contains(hash);







share|improve this answer


















  • 1





    I think the OP's question wasn't about API, but about memory usage. This doesn't help with that since the result still acts like a HashSet.

    – Steve
    2 hours ago











  • @Steve His main concern was a reference to the objects. Here it is storing only referencing to Integer objects.

    – Pritam Banerjee
    2 hours ago


















0














There is no such built-in data structure, because such a data structure is rarely needed. It's easy to build one, though.



public class HashCodeSet<T> 

private final HashSet<Integer> hashCodes;

public MyHashSet()
hashCodes = new HashSet<>();


public MyHashSet(int initialCapacity)
hashCodes = new HashSet<>(initialCapacity);


public HashCodeSet(HashCodeSet toCopy)
hashCodes = new HashSet<>(toCopy.hashCodes);


public void add(T element)
hashCodes.add(element.hashCode());


public boolean containsHashCodeOf(T element)
return hashCodes.contains(element.hashCode());


@Override
public boolean equals(o: Object)

@Override
public int hashCode()
return hashCodes.hashCode(); // hash-ception


@Override
public String toString()
return hashCodes.toString();







share|improve this answer


















  • 1





    I think the OP's question wasn't about API, but about memory usage. This doesn't help with that since the result still acts like a HashSet.

    – Steve
    2 hours ago











  • I realize that I was unfair completely on this. I've removed my objections. Feel free to delete your comments on my comments. I still think there's something more to the problem, but I was off base. Sorry

    – Steve
    1 hour ago


















0














You could use a primitive collection implementation like IntSet to store values of hash codes. Obviously as others have mentioned this assumes collisions aren't a problem.






share|improve this answer






























    0














    If you want to track if a hash code is already present and to do it memory efficient a BitSet may suite your requirements.



    Look at the following example:



     public static void main(String[] args) 
    BitSet hashCodes = new BitSet();
    hashCodes.set("1".hashCode());

    System.out.println(hashCodes.get("1".hashCode())); // true
    System.out.println(hashCodes.get("2".hashCode())); // false



    The BitSet "implements a vector of bits that grows as needed.". It's a JDK "built-in data structure" which doesn't contain "references to the actual objects". It stores only if "the same hash code is inside".



    EDIT:

    As @Steve mentioned in his comment the implementation of the BitSet isn't the most memory efficient one. But there are more memory efficient implementations of a bit set - though not built-in.






    share|improve this answer

























    • I don't know how a BitSet stores individual bits. Since obviously this usage will spread bits across a very large input domain, are you sure those are stored efficiently? Just asking. The naive assumption is that the structure would be an array of bytes, where the array was just expanded to include any bit position and all positions before that, which would be monstrously inefficient. But I don't know how it actually represents bits spread way apart.

      – Steve
      2 hours ago











    • It appears that your solution won't work. See github.com/brettwooldridge/SparseBitSet

      – Steve
      2 hours ago











    • @Steve You're right. Found additionally this post. But the idea of an bit set is basically not bad. It's rather the implementation of the JDK BitSet.

      – LuCio
      1 hour ago











    Your Answer






    StackExchange.ifUsing("editor", function ()
    StackExchange.using("externalEditor", function ()
    StackExchange.using("snippets", function ()
    StackExchange.snippets.init();
    );
    );
    , "code-snippets");

    StackExchange.ready(function()
    var channelOptions =
    tags: "".split(" "),
    id: "1"
    ;
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function()
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled)
    StackExchange.using("snippets", function()
    createEditor();
    );

    else
    createEditor();

    );

    function createEditor()
    StackExchange.prepareEditor(
    heartbeatType: 'answer',
    autoActivateHeartbeat: false,
    convertImagesToLinks: true,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: 10,
    bindNavPrevention: true,
    postfix: "",
    imageUploader:
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    ,
    onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    );



    );













    draft saved

    draft discarded


















    StackExchange.ready(
    function ()
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55190768%2fis-there-a-data-structure-that-only-stores-hash-codes-and-not-the-actual-objects%23new-answer', 'question_page');

    );

    Post as a guest















    Required, but never shown

























    5 Answers
    5






    active

    oldest

    votes








    5 Answers
    5






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes









    6














    A Bloom filter can tell whether an object might be a member, or is definitely not a member. You can control the likelihood of false positives. A single bit is stored per hash value.



    The Guava library provides an implementation in Java.






    share|improve this answer

























    • Nice. This seems like the solution for very low storage overhead. But you have to worry about the false negative case. If you can statistically eliminate that, this is great!

      – Steve
      2 hours ago







    • 1





      False positives, but you can control their probability. Another disadvantage is that you can't remove elements.

      – Andy Thomas
      2 hours ago











    • The question was for a data structure that checks using only the predefined hashCode(), which can potentially have 2^31 values (counting only positives). A Bloom filter that uses one hash function with 2 ^ 31 possible values would be extraordinarily large, seeing as it is basically just a BitSet. I don't see how that counts as "very low storage overhead".

      – Leo Aso
      2 hours ago
















    6














    A Bloom filter can tell whether an object might be a member, or is definitely not a member. You can control the likelihood of false positives. A single bit is stored per hash value.



    The Guava library provides an implementation in Java.






    share|improve this answer

























    • Nice. This seems like the solution for very low storage overhead. But you have to worry about the false negative case. If you can statistically eliminate that, this is great!

      – Steve
      2 hours ago







    • 1





      False positives, but you can control their probability. Another disadvantage is that you can't remove elements.

      – Andy Thomas
      2 hours ago











    • The question was for a data structure that checks using only the predefined hashCode(), which can potentially have 2^31 values (counting only positives). A Bloom filter that uses one hash function with 2 ^ 31 possible values would be extraordinarily large, seeing as it is basically just a BitSet. I don't see how that counts as "very low storage overhead".

      – Leo Aso
      2 hours ago














    6












    6








    6







    A Bloom filter can tell whether an object might be a member, or is definitely not a member. You can control the likelihood of false positives. A single bit is stored per hash value.



    The Guava library provides an implementation in Java.






    share|improve this answer















    A Bloom filter can tell whether an object might be a member, or is definitely not a member. You can control the likelihood of false positives. A single bit is stored per hash value.



    The Guava library provides an implementation in Java.







    share|improve this answer














    share|improve this answer



    share|improve this answer








    edited 2 hours ago

























    answered 2 hours ago









    Andy ThomasAndy Thomas

    68k980133




    68k980133












    • Nice. This seems like the solution for very low storage overhead. But you have to worry about the false negative case. If you can statistically eliminate that, this is great!

      – Steve
      2 hours ago







    • 1





      False positives, but you can control their probability. Another disadvantage is that you can't remove elements.

      – Andy Thomas
      2 hours ago











    • The question was for a data structure that checks using only the predefined hashCode(), which can potentially have 2^31 values (counting only positives). A Bloom filter that uses one hash function with 2 ^ 31 possible values would be extraordinarily large, seeing as it is basically just a BitSet. I don't see how that counts as "very low storage overhead".

      – Leo Aso
      2 hours ago


















    • Nice. This seems like the solution for very low storage overhead. But you have to worry about the false negative case. If you can statistically eliminate that, this is great!

      – Steve
      2 hours ago







    • 1





      False positives, but you can control their probability. Another disadvantage is that you can't remove elements.

      – Andy Thomas
      2 hours ago











    • The question was for a data structure that checks using only the predefined hashCode(), which can potentially have 2^31 values (counting only positives). A Bloom filter that uses one hash function with 2 ^ 31 possible values would be extraordinarily large, seeing as it is basically just a BitSet. I don't see how that counts as "very low storage overhead".

      – Leo Aso
      2 hours ago

















    Nice. This seems like the solution for very low storage overhead. But you have to worry about the false negative case. If you can statistically eliminate that, this is great!

    – Steve
    2 hours ago






    Nice. This seems like the solution for very low storage overhead. But you have to worry about the false negative case. If you can statistically eliminate that, this is great!

    – Steve
    2 hours ago





    1




    1





    False positives, but you can control their probability. Another disadvantage is that you can't remove elements.

    – Andy Thomas
    2 hours ago





    False positives, but you can control their probability. Another disadvantage is that you can't remove elements.

    – Andy Thomas
    2 hours ago













    The question was for a data structure that checks using only the predefined hashCode(), which can potentially have 2^31 values (counting only positives). A Bloom filter that uses one hash function with 2 ^ 31 possible values would be extraordinarily large, seeing as it is basically just a BitSet. I don't see how that counts as "very low storage overhead".

    – Leo Aso
    2 hours ago






    The question was for a data structure that checks using only the predefined hashCode(), which can potentially have 2^31 values (counting only positives). A Bloom filter that uses one hash function with 2 ^ 31 possible values would be extraordinarily large, seeing as it is basically just a BitSet. I don't see how that counts as "very low storage overhead".

    – Leo Aso
    2 hours ago














    0














    There is no data structure like that, but you can create a wrapper around HashSet to suit your needs. Something like this:



    public class MyHashSet

    private HashSet<Integer> set;

    public MyHashSet()
    set = new HashSet<>();


    public void add(int hash)
    set.add(hash);


    public boolean contains(int hash)
    return set.contains(hash);







    share|improve this answer


















    • 1





      I think the OP's question wasn't about API, but about memory usage. This doesn't help with that since the result still acts like a HashSet.

      – Steve
      2 hours ago











    • @Steve His main concern was a reference to the objects. Here it is storing only referencing to Integer objects.

      – Pritam Banerjee
      2 hours ago















    0














    There is no data structure like that, but you can create a wrapper around HashSet to suit your needs. Something like this:



    public class MyHashSet

    private HashSet<Integer> set;

    public MyHashSet()
    set = new HashSet<>();


    public void add(int hash)
    set.add(hash);


    public boolean contains(int hash)
    return set.contains(hash);







    share|improve this answer


















    • 1





      I think the OP's question wasn't about API, but about memory usage. This doesn't help with that since the result still acts like a HashSet.

      – Steve
      2 hours ago











    • @Steve His main concern was a reference to the objects. Here it is storing only referencing to Integer objects.

      – Pritam Banerjee
      2 hours ago













    0












    0








    0







    There is no data structure like that, but you can create a wrapper around HashSet to suit your needs. Something like this:



    public class MyHashSet

    private HashSet<Integer> set;

    public MyHashSet()
    set = new HashSet<>();


    public void add(int hash)
    set.add(hash);


    public boolean contains(int hash)
    return set.contains(hash);







    share|improve this answer













    There is no data structure like that, but you can create a wrapper around HashSet to suit your needs. Something like this:



    public class MyHashSet

    private HashSet<Integer> set;

    public MyHashSet()
    set = new HashSet<>();


    public void add(int hash)
    set.add(hash);


    public boolean contains(int hash)
    return set.contains(hash);








    share|improve this answer












    share|improve this answer



    share|improve this answer










    answered 2 hours ago









    Pritam BanerjeePritam Banerjee

    11k64567




    11k64567







    • 1





      I think the OP's question wasn't about API, but about memory usage. This doesn't help with that since the result still acts like a HashSet.

      – Steve
      2 hours ago











    • @Steve His main concern was a reference to the objects. Here it is storing only referencing to Integer objects.

      – Pritam Banerjee
      2 hours ago












    • 1





      I think the OP's question wasn't about API, but about memory usage. This doesn't help with that since the result still acts like a HashSet.

      – Steve
      2 hours ago











    • @Steve His main concern was a reference to the objects. Here it is storing only referencing to Integer objects.

      – Pritam Banerjee
      2 hours ago







    1




    1





    I think the OP's question wasn't about API, but about memory usage. This doesn't help with that since the result still acts like a HashSet.

    – Steve
    2 hours ago





    I think the OP's question wasn't about API, but about memory usage. This doesn't help with that since the result still acts like a HashSet.

    – Steve
    2 hours ago













    @Steve His main concern was a reference to the objects. Here it is storing only referencing to Integer objects.

    – Pritam Banerjee
    2 hours ago





    @Steve His main concern was a reference to the objects. Here it is storing only referencing to Integer objects.

    – Pritam Banerjee
    2 hours ago











    0














    There is no such built-in data structure, because such a data structure is rarely needed. It's easy to build one, though.



    public class HashCodeSet<T> 

    private final HashSet<Integer> hashCodes;

    public MyHashSet()
    hashCodes = new HashSet<>();


    public MyHashSet(int initialCapacity)
    hashCodes = new HashSet<>(initialCapacity);


    public HashCodeSet(HashCodeSet toCopy)
    hashCodes = new HashSet<>(toCopy.hashCodes);


    public void add(T element)
    hashCodes.add(element.hashCode());


    public boolean containsHashCodeOf(T element)
    return hashCodes.contains(element.hashCode());


    @Override
    public boolean equals(o: Object)

    @Override
    public int hashCode()
    return hashCodes.hashCode(); // hash-ception


    @Override
    public String toString()
    return hashCodes.toString();







    share|improve this answer


















    • 1





      I think the OP's question wasn't about API, but about memory usage. This doesn't help with that since the result still acts like a HashSet.

      – Steve
      2 hours ago











    • I realize that I was unfair completely on this. I've removed my objections. Feel free to delete your comments on my comments. I still think there's something more to the problem, but I was off base. Sorry

      – Steve
      1 hour ago















    0














    There is no such built-in data structure, because such a data structure is rarely needed. It's easy to build one, though.



    public class HashCodeSet<T> 

    private final HashSet<Integer> hashCodes;

    public MyHashSet()
    hashCodes = new HashSet<>();


    public MyHashSet(int initialCapacity)
    hashCodes = new HashSet<>(initialCapacity);


    public HashCodeSet(HashCodeSet toCopy)
    hashCodes = new HashSet<>(toCopy.hashCodes);


    public void add(T element)
    hashCodes.add(element.hashCode());


    public boolean containsHashCodeOf(T element)
    return hashCodes.contains(element.hashCode());


    @Override
    public boolean equals(o: Object)

    @Override
    public int hashCode()
    return hashCodes.hashCode(); // hash-ception


    @Override
    public String toString()
    return hashCodes.toString();







    share|improve this answer


















    • 1





      I think the OP's question wasn't about API, but about memory usage. This doesn't help with that since the result still acts like a HashSet.

      – Steve
      2 hours ago











    • I realize that I was unfair completely on this. I've removed my objections. Feel free to delete your comments on my comments. I still think there's something more to the problem, but I was off base. Sorry

      – Steve
      1 hour ago













    0












    0








    0







    There is no such built-in data structure, because such a data structure is rarely needed. It's easy to build one, though.



    public class HashCodeSet<T> 

    private final HashSet<Integer> hashCodes;

    public MyHashSet()
    hashCodes = new HashSet<>();


    public MyHashSet(int initialCapacity)
    hashCodes = new HashSet<>(initialCapacity);


    public HashCodeSet(HashCodeSet toCopy)
    hashCodes = new HashSet<>(toCopy.hashCodes);


    public void add(T element)
    hashCodes.add(element.hashCode());


    public boolean containsHashCodeOf(T element)
    return hashCodes.contains(element.hashCode());


    @Override
    public boolean equals(o: Object)

    @Override
    public int hashCode()
    return hashCodes.hashCode(); // hash-ception


    @Override
    public String toString()
    return hashCodes.toString();







    share|improve this answer













    There is no such built-in data structure, because such a data structure is rarely needed. It's easy to build one, though.



    public class HashCodeSet<T> 

    private final HashSet<Integer> hashCodes;

    public MyHashSet()
    hashCodes = new HashSet<>();


    public MyHashSet(int initialCapacity)
    hashCodes = new HashSet<>(initialCapacity);


    public HashCodeSet(HashCodeSet toCopy)
    hashCodes = new HashSet<>(toCopy.hashCodes);


    public void add(T element)
    hashCodes.add(element.hashCode());


    public boolean containsHashCodeOf(T element)
    return hashCodes.contains(element.hashCode());


    @Override
    public boolean equals(o: Object)

    @Override
    public int hashCode()
    return hashCodes.hashCode(); // hash-ception


    @Override
    public String toString()
    return hashCodes.toString();








    share|improve this answer












    share|improve this answer



    share|improve this answer










    answered 2 hours ago









    Leo AsoLeo Aso

    5,27411029




    5,27411029







    • 1





      I think the OP's question wasn't about API, but about memory usage. This doesn't help with that since the result still acts like a HashSet.

      – Steve
      2 hours ago











    • I realize that I was unfair completely on this. I've removed my objections. Feel free to delete your comments on my comments. I still think there's something more to the problem, but I was off base. Sorry

      – Steve
      1 hour ago












    • 1





      I think the OP's question wasn't about API, but about memory usage. This doesn't help with that since the result still acts like a HashSet.

      – Steve
      2 hours ago











    • I realize that I was unfair completely on this. I've removed my objections. Feel free to delete your comments on my comments. I still think there's something more to the problem, but I was off base. Sorry

      – Steve
      1 hour ago







    1




    1





    I think the OP's question wasn't about API, but about memory usage. This doesn't help with that since the result still acts like a HashSet.

    – Steve
    2 hours ago





    I think the OP's question wasn't about API, but about memory usage. This doesn't help with that since the result still acts like a HashSet.

    – Steve
    2 hours ago













    I realize that I was unfair completely on this. I've removed my objections. Feel free to delete your comments on my comments. I still think there's something more to the problem, but I was off base. Sorry

    – Steve
    1 hour ago





    I realize that I was unfair completely on this. I've removed my objections. Feel free to delete your comments on my comments. I still think there's something more to the problem, but I was off base. Sorry

    – Steve
    1 hour ago











    0














    You could use a primitive collection implementation like IntSet to store values of hash codes. Obviously as others have mentioned this assumes collisions aren't a problem.






    share|improve this answer



























      0














      You could use a primitive collection implementation like IntSet to store values of hash codes. Obviously as others have mentioned this assumes collisions aren't a problem.






      share|improve this answer

























        0












        0








        0







        You could use a primitive collection implementation like IntSet to store values of hash codes. Obviously as others have mentioned this assumes collisions aren't a problem.






        share|improve this answer













        You could use a primitive collection implementation like IntSet to store values of hash codes. Obviously as others have mentioned this assumes collisions aren't a problem.







        share|improve this answer












        share|improve this answer



        share|improve this answer










        answered 1 hour ago









        MarkMark

        24.2k44783




        24.2k44783





















            0














            If you want to track if a hash code is already present and to do it memory efficient a BitSet may suite your requirements.



            Look at the following example:



             public static void main(String[] args) 
            BitSet hashCodes = new BitSet();
            hashCodes.set("1".hashCode());

            System.out.println(hashCodes.get("1".hashCode())); // true
            System.out.println(hashCodes.get("2".hashCode())); // false



            The BitSet "implements a vector of bits that grows as needed.". It's a JDK "built-in data structure" which doesn't contain "references to the actual objects". It stores only if "the same hash code is inside".



            EDIT:

            As @Steve mentioned in his comment the implementation of the BitSet isn't the most memory efficient one. But there are more memory efficient implementations of a bit set - though not built-in.






            share|improve this answer

























            • I don't know how a BitSet stores individual bits. Since obviously this usage will spread bits across a very large input domain, are you sure those are stored efficiently? Just asking. The naive assumption is that the structure would be an array of bytes, where the array was just expanded to include any bit position and all positions before that, which would be monstrously inefficient. But I don't know how it actually represents bits spread way apart.

              – Steve
              2 hours ago











            • It appears that your solution won't work. See github.com/brettwooldridge/SparseBitSet

              – Steve
              2 hours ago











            • @Steve You're right. Found additionally this post. But the idea of an bit set is basically not bad. It's rather the implementation of the JDK BitSet.

              – LuCio
              1 hour ago
















            0














            If you want to track if a hash code is already present and to do it memory efficient a BitSet may suite your requirements.



            Look at the following example:



             public static void main(String[] args) 
            BitSet hashCodes = new BitSet();
            hashCodes.set("1".hashCode());

            System.out.println(hashCodes.get("1".hashCode())); // true
            System.out.println(hashCodes.get("2".hashCode())); // false



            The BitSet "implements a vector of bits that grows as needed.". It's a JDK "built-in data structure" which doesn't contain "references to the actual objects". It stores only if "the same hash code is inside".



            EDIT:

            As @Steve mentioned in his comment the implementation of the BitSet isn't the most memory efficient one. But there are more memory efficient implementations of a bit set - though not built-in.






            share|improve this answer

























            • I don't know how a BitSet stores individual bits. Since obviously this usage will spread bits across a very large input domain, are you sure those are stored efficiently? Just asking. The naive assumption is that the structure would be an array of bytes, where the array was just expanded to include any bit position and all positions before that, which would be monstrously inefficient. But I don't know how it actually represents bits spread way apart.

              – Steve
              2 hours ago











            • It appears that your solution won't work. See github.com/brettwooldridge/SparseBitSet

              – Steve
              2 hours ago











            • @Steve You're right. Found additionally this post. But the idea of an bit set is basically not bad. It's rather the implementation of the JDK BitSet.

              – LuCio
              1 hour ago














            0












            0








            0







            If you want to track if a hash code is already present and to do it memory efficient a BitSet may suite your requirements.



            Look at the following example:



             public static void main(String[] args) 
            BitSet hashCodes = new BitSet();
            hashCodes.set("1".hashCode());

            System.out.println(hashCodes.get("1".hashCode())); // true
            System.out.println(hashCodes.get("2".hashCode())); // false



            The BitSet "implements a vector of bits that grows as needed.". It's a JDK "built-in data structure" which doesn't contain "references to the actual objects". It stores only if "the same hash code is inside".



            EDIT:

            As @Steve mentioned in his comment the implementation of the BitSet isn't the most memory efficient one. But there are more memory efficient implementations of a bit set - though not built-in.






            share|improve this answer















            If you want to track if a hash code is already present and to do it memory efficient a BitSet may suite your requirements.



            Look at the following example:



             public static void main(String[] args) 
            BitSet hashCodes = new BitSet();
            hashCodes.set("1".hashCode());

            System.out.println(hashCodes.get("1".hashCode())); // true
            System.out.println(hashCodes.get("2".hashCode())); // false



            The BitSet "implements a vector of bits that grows as needed.". It's a JDK "built-in data structure" which doesn't contain "references to the actual objects". It stores only if "the same hash code is inside".



            EDIT:

            As @Steve mentioned in his comment the implementation of the BitSet isn't the most memory efficient one. But there are more memory efficient implementations of a bit set - though not built-in.







            share|improve this answer














            share|improve this answer



            share|improve this answer








            edited 1 hour ago

























            answered 2 hours ago









            LuCioLuCio

            2,8721924




            2,8721924












            • I don't know how a BitSet stores individual bits. Since obviously this usage will spread bits across a very large input domain, are you sure those are stored efficiently? Just asking. The naive assumption is that the structure would be an array of bytes, where the array was just expanded to include any bit position and all positions before that, which would be monstrously inefficient. But I don't know how it actually represents bits spread way apart.

              – Steve
              2 hours ago











            • It appears that your solution won't work. See github.com/brettwooldridge/SparseBitSet

              – Steve
              2 hours ago











            • @Steve You're right. Found additionally this post. But the idea of an bit set is basically not bad. It's rather the implementation of the JDK BitSet.

              – LuCio
              1 hour ago


















            • I don't know how a BitSet stores individual bits. Since obviously this usage will spread bits across a very large input domain, are you sure those are stored efficiently? Just asking. The naive assumption is that the structure would be an array of bytes, where the array was just expanded to include any bit position and all positions before that, which would be monstrously inefficient. But I don't know how it actually represents bits spread way apart.

              – Steve
              2 hours ago











            • It appears that your solution won't work. See github.com/brettwooldridge/SparseBitSet

              – Steve
              2 hours ago











            • @Steve You're right. Found additionally this post. But the idea of an bit set is basically not bad. It's rather the implementation of the JDK BitSet.

              – LuCio
              1 hour ago

















            I don't know how a BitSet stores individual bits. Since obviously this usage will spread bits across a very large input domain, are you sure those are stored efficiently? Just asking. The naive assumption is that the structure would be an array of bytes, where the array was just expanded to include any bit position and all positions before that, which would be monstrously inefficient. But I don't know how it actually represents bits spread way apart.

            – Steve
            2 hours ago





            I don't know how a BitSet stores individual bits. Since obviously this usage will spread bits across a very large input domain, are you sure those are stored efficiently? Just asking. The naive assumption is that the structure would be an array of bytes, where the array was just expanded to include any bit position and all positions before that, which would be monstrously inefficient. But I don't know how it actually represents bits spread way apart.

            – Steve
            2 hours ago













            It appears that your solution won't work. See github.com/brettwooldridge/SparseBitSet

            – Steve
            2 hours ago





            It appears that your solution won't work. See github.com/brettwooldridge/SparseBitSet

            – Steve
            2 hours ago













            @Steve You're right. Found additionally this post. But the idea of an bit set is basically not bad. It's rather the implementation of the JDK BitSet.

            – LuCio
            1 hour ago






            @Steve You're right. Found additionally this post. But the idea of an bit set is basically not bad. It's rather the implementation of the JDK BitSet.

            – LuCio
            1 hour ago


















            draft saved

            draft discarded
















































            Thanks for contributing an answer to Stack Overflow!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid


            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.

            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55190768%2fis-there-a-data-structure-that-only-stores-hash-codes-and-not-the-actual-objects%23new-answer', 'question_page');

            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            Popular posts from this blog

            Are there any AGPL-style licences that require source code modifications to be public? Planned maintenance scheduled April 23, 2019 at 23:30 UTC (7:30pm US/Eastern) Announcing the arrival of Valued Associate #679: Cesar Manara Unicorn Meta Zoo #1: Why another podcast?Force derivative works to be publicAre there any GPL like licenses for Apple App Store?Do you violate the GPL if you provide source code that cannot be compiled?GPL - is it distribution to use libraries in an appliance loaned to customers?Distributing App for free which uses GPL'ed codeModifications of server software under GPL, with web/CLI interfaceDoes using an AGPLv3-licensed library prevent me from dual-licensing my own source code?Can I publish only select code under GPLv3 from a private project?Is there published precedent regarding the scope of covered work that uses AGPL software?If MIT licensed code links to GPL licensed code what should be the license of the resulting binary program?If I use a public API endpoint that has its source code licensed under AGPL in my app, do I need to disclose my source?

            2013 GY136 Descoberta | Órbita | Referências Menu de navegação«List Of Centaurs and Scattered-Disk Objects»«List of Known Trans-Neptunian Objects»

            Button changing it's text & action. Good or terrible? The 2019 Stack Overflow Developer Survey Results Are Inchanging text on user mouseoverShould certain functions be “hard to find” for powerusers to discover?Custom liking function - do I need user login?Using different checkbox style for different checkbox behaviorBest Practices: Save and Exit in Software UIInteraction with remote validated formMore efficient UI to progress the user through a complicated process?Designing a popup notice for a gameShould bulk-editing functions be hidden until a table row is selected, or is there a better solution?Is it bad practice to disable (replace) the context menu?